Take a Closer Look at Voracity


Next Steps
Voracity Overview Features Technical Details GUI Platforms & Pricing Why It's Better Resources

IRI Voracity® uniquely delivers proven, high-performance data manipulation and management in one pane of glass. Voracity's versatility (and affordability) helps you tackle the challenges of data discovery, integration, migration, and governance -- and the changing analytic needs of digital business -- all at once.

Check out Voracity's capabilities, the challenges it now and will continue to address, and its components, below. Explore the other tabs in this section, and the solution areas throughout this web site to understand just how much your teams can cooperatively accomplish with the state-of-the-art technology in this platform.

Voracity uniquely combines the discovery, integration, migration, governance, and analysis of data in a variety of sources ... all from one place, and often in one pass. Manipulate, migrate, mask, munge, and map structured, semi-structured, and unstructured data into multiple targets at once.

Read More

Voracity's core data management capabilities leverage the functionality of the IRI CoSort SortCL data definition and manipulation program.

And as one of the original and few remaining Hadoop alternatives, CoSort SortCL also packages, presents and provisions big data. It combines: data cleansing, extraction, transformation, loading, masking, reporting -- and even synthetic test data generation -- in the same job script and multi-threaded I/O pass in your existing file system.

If you still need the scalability and capability of Hadoop, however, you are covered. Voracity supports the execution of SortCL jobs in MapReduce 2, Spark, Spark Stream, Storm, and Tez. Compare that to Hadoop distributions you're considering, or disjointed Apache projects you're trying to coordinate.

All that work in the middle starts with data discovery. Only Voracity provides four data profiling tools. And it ends with analytics, where you have three choices: embedded BI, BIRT and Splunk integrations, and/or robust data preparation for your chosen data visualization platform.

Voracity flyer

As the above schematic illustrates, Voracity delivers these capabilities from a single GUI, IRI Workbench:

native, parallel VLDB unload
multi-threaded, combinatory data transformations
pre-sorted, facilitated VLDB loads
single-pass, high-performance ETL
third-party ETL tool acceleration
static and dynamic data masking
smart, synthetic test data generation
enterprise metadata management ( EMM)
master data management ( MDM)
metadata/task lineage, sharing & security
data quality: search, validate, cleanse, enrich
forensic metadata discovery and job auditing
detail and summary reports ( embedded BI)
change data capture ( CDC)
clickstreams (web log data webhousing)
customer data ( CDI) & segments
slowly changing dimensions ( SCD) & trends
BIRT integration
data preparation for BI/analytic tools

Only Voracity delivers multiple job design and deployment options in the same Eclipse GUI. And only Voracity uses the latest CoSort engines while also supporting multiple Hadoop engine alternatives from that same GUI.

So by embedding CoSort's mission-critical data integration, migration, and governance capabilities, supporting Hadoop engines, and front-ending discovery, EMM, MDM and workflow in a continually developed Eclipse IDE, Voracity is not only functionally comprehensive, it's uniquely ergonomic, scalable, affordable, and future-proofed for growing data sources and enterprise information needs.

Voracity addresses the challenges of data volume, variety, velocity, veracity, and value with a comprehensive data management platform that eliminates multi-tool complexity and bends the cost curve away from megavendor ETL packages and Hadoop distributions.

Read More


Data from internal and public sources is growing exponentially.

Can your tools handle tomorrow's loads?
Prepare big data subsets for analytics fast by accelerating and combining transforms in you file system - not in the BI or DB layer. Use Voracity to de-duplicate and filter, sort and join, aggregate and segment, reformat and hand-off ... all in one pass. Send prepared data in memory to BIRT at reporting time, or into cubes your app wants. See IRI's approach here, or in the 2015 DBTA webinar " Accelerating Data Processing for Analytics."
The CoSort engine in Voracity processed big data long before it was called big data, running and combining multi-gigabyte transforms in seconds, and besting 3rd-party sort, BI, DB, and ETL tools 2-20X.

And now there are Hadoop options in Voracity too, distributing and scaling huge workloads across commodity hardware via MapReduce 2, Spark, Spark Stream, Storm, and Tez.


The myriad of structured and unstructured sources is beyond most tools.

Can you acquire and mashup internal and external sources in one place?
What tools are you using now to discover, extract, process, and analyze all the data you gather or buy? Can you reach and process it all in one pane of glass? Can you quality-control and manage its metadata and master data in that same place? Can you analyze the data there too, or at least rapidly integrate and prepare it for external applications? If you use multiple tools, can you manage the expertise they require? Or if you use a legacy ETL platform, can you bear its cost?
Voracity analyzes, integrates, migrates, governs, profiles, and connects to some 150 different data sources and targets ... structured, semi-structured, and unstructured.

That includes legacy files, data and endian types, as well as popular flat and document file formats, every RDB, and newer big data and cloud/SaaS sources.


CDR, IOT, social, and other data come fast, and at different intervals.

Are you ready for streaming, near-real-time, and batched data?
The biggest data volumes are still processed in regular batch cycles, something Voracity's native CoSort and Hadoop MapReduce and Tez options will optimize. But what about the need to process (transform, mask, reformat) and analyze data in real-time for instant promotional campaigns (think mobile devices), or alerts (like traffic and weather notices) that can help drivers or event-goers?
Voracity includes CoSort to integrate data in memory and files, so you can process big data 6x faster than ETL tools, 10x SQL and 20X faster than BI/analytic tools. Its typical mode, including CDC, is batch.

Voracity can process real-time, near-real-time, and streaming data in memory via input procedure calls to CoSort, or in Hadoop Spark or Storm engines ... all from the same Eclipse GUI, IRI Workbench. Other options include using the built-in job launcher to spawn Voracity jobs in near-real-time intervals, or using specialized BAM or CEP tools for managing event-driven activity.


Low quality data jeopardizes apps and analytic value. PII is another data risk.

How do you maintain the reliability and security of your data?
Garbage in = garbage out, and thus data in doubt. Data quality suffers from inconsistent, inaccurate, or incomplete values. Social media data can be deceptive, unstructured data imprecise, and data ambiguity plagues MDM. Survey data can be biased, noisy or abnormal. Meanwhile PII and secrets contained in all that data mean you have to mask it prior to shared use. Do you have a central point of control for cleaning data and making it safe?
Voracity's data discovery, fuzzy matching, value validation, scrubbing, enrichment, and unification features all improve data quality.

Voracity's comprehensive data masking functions and synthetic test data generation capabilities remove the risk of data breaches and poor prototypes.


And the point of it all ... getting analytic value from big data.

Are you getting the insights you need to make decisions?
Consider your information and decision needs from data. For example, are you tracking consumer behavior, weather patterns, device or web log activity so that you can change promotions, make predictions, or diagnose problems? Do you see the value in an IDE easy enough for self-service data preparation and presentation, but powerful enough for IT and business user collaboration in data lifecycle management?
Voracity is the one tool that provides access to, and discovery across, the disparate data sources behind these analyses.

Only Voracity allows you blend, cleanse, mask and munge tons of data fast, and feed the results to algorithmic and visualization applications -- within the same, or another environment in the right format.

Voracity is powered by IRI CoSort or Hadoop engines, and everything it does is front-ended in one Eclipse™ GUI. Beyond a massive amount of included features, a plethora of free Eclipse plug-ins and proven partner technology expand what you can do with Voracity.

Read More

The default Voracity stack uses IRI Workbench for client-side design of data-driven jobs defined in portable scripts represented in multiple graphical UIs.

Many of the same jobs also run interchangeably in Hadoop MR2, Spark, Spark Stream, Storm or Tez.

Voracity metadata and related job script parameters are fully supported in the Workbench data model and optionally in AnalytiX DS Mapping Manager, for graphical creation, modification, and management.

Voracity Stack Diagram

Within the base Voracity package are:

  1. DB, flat-file, and dark data profiling, ERD, and metadata definition wizards
  2. key data processing features of constituent IRI Data Manager products
  3. key data security features of constituent IRI Data Protector products
  4. multiple, re-entrant job design options and execution paradigms
  5. runtime and metadata SDKs for application development
  6. robust GUI help content and CLI reference manuals
Voracity Flyer Back

More specifically, Voracity includes free use of a rich, familiar front-end job design and management environment called (IRI Workbench), built on Eclipse™. Together with Voracity's back-end production engine you can run anywhere, IRI Workbench supports the capabilities of:

  • IRI CoSort for big data manipulation and movement, including EDW integration (ETL) and data preparation for DBs and analytic tools, data quality, embedded BI, metadata and master data management, legacy sort migration, and data governance
  • IRI NextForm for data and DB migration, data replication, remapping, and federation
  • IRI FieldShield for masking PII in files and databases, and IRI CellShield EE for Excel
  • IRI RowGen for generating synthetic but realistic file, database, and report test data

Additional capabilities in the IRI Workbench include:

  • default and plug-in shell UIs for command line execution and interaction
  • multiple data profiling and metadata discovery and definition wizards
  • metadata management and master data management ( MDM)
  • ODA-integration for BIRT (visual analytics)
  • Sirius workflow, transform mapping, and E-R diagrams

Beyond the base edition, premium options include:

Integrated Capability
ADS ETL Conversion
automated job/metadata migration from other ETL tools
ADS Mapping Manager
code-free source-target ETL mapping/flow generation
CONNX Drivers
move/manipulate mainframe and other proprietary data
DataDirect Drivers
move/manipulate big data and cloud/SaaS data
cloud dashboard for interactive BI
parallel unload of Oracle and 6 other VLDB tables to files
IRI Hadoop Runtimes
MapReduce 2, Spark, Spark Stream, Storm, and Tez options
IRI Chakra Max
DB activity monitoring, auditing & protection (DAM/DAP)
WalWare StatET for R
Eclipse environment for Revolution analytics

Whether included with the base Voracity package, or installed as partner technology, everything runs in the IRI Workbench and leverages the same open data and manipulation metadata infrastructure for job management and deployment ... inside or outside the GUI.

See the product-function matrix on this page to see what's provided at a glance, and review the feature-benefit charts on this page for more details and links.

Request More Information

Live Chat

* indicates a required field.
IRI does NOT share your information.

Try Voracity Free

Discover, Integrate, Migrate, Govern, Analyze

Get Info See Demo