Chat
Request Info
Download
Overview DataStage ETI*Solution Informatica OWB & ODI Others Pentaho Talend

Challenges

While Pentaho Data Integration (PDI) is a powerful tool for preparing and integrating data, it has sizeable shortcomings on its own. Notably, the following critical issues hinder it from living up to its potenential as an integration tool:

Slow Transforms

  • Native sort may not run fast enough for high-volume inputs

Limited Masking Features

  • Does not appear to have a native ability to mask or encrypt data flowing through Kettle, a problem given the growing number of data breaches

Limited Test Data

  • No apparent native ability to prototype ETL operations without relying on production data

Solutions

PDI supports third-party functionality, so data can be processed externally without disruption. IRI Voracity and its component software significantly boost data integration through Pentaho.

Speed Your Sort

  • Use PDI's shell script step to call a CoSort job (e.g., SortCL script) to dramatically reduce sorting times
  • Run multiple sorts on the same batch file
  • Get results 14-16 times faster than Pentaho alone
  Blog

Using CoSort to Speed up the Sort Process in Pentaho


Mask Your Data

  • Use IRI FieldShield job scripts from the Shell step in Pentaho to protect data at rest
  • Mask, encrypt, and encode (and others) data in your needed format
  • Secure your data at the field-level
  Blog

Masking Data in Pentaho


Test Your Apps

  • IRI RowGen populates tables, files and reports with synthetic test data that mimics production data
  • Generate structurally- and referentially-correct DB test data for entire EDW
  • Keep your production data safe
  Blog

Creating Test Data for Pentaho

Request More Information

* indicates a required field.
IRI does NOT share your information.