Though XML is a popular file interchange format for SOA, database, and other applications, it has not been a practical format for carrying large data volumes. In addition, conversions between legacy index or flat files and XML rely on slower parsing technologies like XSLT, and they do not simultaneously enable processing.
Transforms using XQuery cannot turn high volumes of XML data into meaningful information in XML (or any other format) quickly, if at all. There has been no efficient way to rapidly convert, process, protect, or create huge XML files.
For example, you may need to:
- Sort a huge XML file
- Extract data, or report, from an XML file
- Convert a CSV, LDIF, or other file to XML
- Convert XML to text, CSV, LDIF, ISAM, etc.
- Encrypt or de-identify fields in an XML file
- Load XML data to a spreadsheet or database
- Create an XML file from a legacy or extract file
- Generate test data in XML file formats
You may even need to perform more than one of these functions at the same time, against many massive source and target files.
IRI delivers XML and other file conversion functionality in several products. Choose based on need:
XML File Conversion Only
Use the free Lite edition of IRI NextForm to convert huge, flat XML files* to other formats (like CSV, LDIF, COBOL, text, etc.), or from those other formats into XML. If your XML files are semi-structured or more unstructured, use the dark data discovery wizard to search and structure the elements you need based on pattern matches, or upgrade to the NextForm Legacy Edition to view the your source files in the data source explorer using a specialized XML driver.
NextForm includes an XML file parser to automatically create the XDEF field layouts used in the file conversion scripts. NextForm also supports data type conversion at the field level, and the remapping of record layouts. NextForm file definitions also work in the CoSort SortCL program. Easily upgrade to CoSort for fast transformation and custom reporting.
XML File Conversion, Transformation, and Reporting
Use the SortCL program in IRI CoSort to convert, transform, report from, and create valid, well-formed XML files that represent structured data. Declare one or more XML and non-XML files for input and output as part of any SortCL job involving data:
- filtering (select, scrub, links to DQ tools)
- transformation (sort, join, aggregate, calc, etc.)
- conversion (data-type and file-format migrations)
- reporting (CDC, detail and summary formats)
- protection (field encryption, de-ID, masking)
CoSort SortCL makes all of these capabilities, one or more at a time, available to data architects who need to work with CSV, XML, LDIF, ISAM, Vision, text, and other large flat files.
XML Data Protection
Use IRI FieldShield if you need to encrypt, mask, or otherwise de-identify the field data within your XML files. FieldShield uses the same layout metadata as NextForm and RowGen, so you can easily move between format conversion and XML test data generation or XML field protection.
XML Test Data
Use IRI RowGen if you need test data in XML file formats. RowGen uses the same layout metadata as CoSort and NextForm, so you can easily move between test data generation and real data transformation.
* XML data elements must conform to a flattened structure and extract one element of the same name at a given level. If you have multiple tags of the same name, IRI engines extract the last occurring tag of a given name. Field names must be unique, and comprise a single record with no additional dependencies.