Synthesize Smart Test Data & Provision It Your Way
Do you need an easier way to:
- create test databases with referential integrity
- simulate and share file, image, and report layouts
- quality-control and stress-test applications
- benchmark new hardware and software
- conduct data warehouse ETL or Data Vault testing
- put anonymized datasets online for offshore developers
using your data models and metadata, but not your production data?
You need a smart test data anonymization tool. Table views, index orders, key relationships, and file and report contents, must reflect reality to be useful in testing. Generating realistic values and formats with synthetic data in ideal ranges -- and populating large targets -- can take a long time with other tools or programs.
With the IRI RowGen product or IRI Voracity platform, you can generate multiple synthetic targets for test databases loads, file structures, and custom report formats from scratch -- all without access to real data. Or if you want to use and anonymize, subset, or otherwise mask real data from production for on-demand or virtualized testing scenarios, you can do that, too.
IRI offers four methods for producing anonymous but intelligent test data in referentially correct database, flat-file, semi-structured file, formatted report, and even unstructured file targets:
- DB or file synthesis (via random data generation or selection) in IRI RowGen
- Prod or test data masking in IRI FieldShield, CellShield EE, or DarkShield
- RDB table subsetting and masking using RowGen or FieldShield
- Any combination of the above in IRI Voracity (which includes it all)
Data Synthesis Capabilities
RowGen can create structurally and referentially correct synthetic test data for every popular RDBMS with defined constraints, plus test data in custom report layouts or popular file/feed formats like these:
- Record, line, or variable sequential
- ASN.1 CDRs
- COBOL index (MF ISAM, Vision)
- CSV, LDIF, JSON, and XML
- Excel (XLS/X)
- FHIR, HL/7 and X12 EDI
- Fixed position text and mainframe blocked
- HDFS
- Image files and PDFs (using DarkShield with RowGen)
- MQTT and Kafka topics
- BIRT (via ODA) and KNIME (analytic & visualization nodes) in Eclipse
RowGen randomly generates field values in more than 100 data types. It can also randomly select data from set files at the field level. That, along with custom/compound data values, value ranges, and distributions, improve test data realism.
Support for standard and complex data transformations, set files, and conditional selection also contribute to RowGen's value in simulating production table and file formats for a variety of applications.
For database users, RowGen leverages the DDL information for Oracle, DB2 UDB, SQL Server, Sybase, Teradata, and other platforms to create realistic tables with structural and referential integrity. Use RowGen to populate an entire test enterprise data warehouse (EDW) or Data Vault 2.0 environment.
Data Masking Capabilities
Use any of the static data masking tools available in the IRI Data Protector Suite, or included free in the IRI Voracity platform:
- IRI FieldShield for structured files and databases
- IRI DarkShield for structured sources as well as many semi-structured and unstructured data sources
- IRI CellShield for Excel spreadsheets
to discover (profile, search, and classify), de-identify (encrypt, pseudonymize, blur, redact, etc.), data in production systems and replicate it anonymized in lower dev, test and QA environments.
If you use IRI Voracity, you can use its included RowGen synthesis and FieldShield data masking capabilities to find, classify, subset, and mask data, and integrate that data for static development use in lower environments or virtual use in live testing environments.
Consider our test data management advice as you scope out your requirements and plan your strategy, and see these links for more information on using safe test data for: