"More Hadoop projects will be swept under the rug as businesses devote major resources to their big data projects before doing their due diligence, which results in a costly, disillusioning project failure."
- Gary Nakamura, Concurrent
"Spend (on big data) wisely. Follow a CRAWL - WALK - RUN strategy."
- Peter Aiken, Data Blueprint
To mine big data, you must smelt it first. Hadoop distributions and specialty software will not access or handle all the data you need, mash it, or prepare it thoroughly enough (cleansing, masking, reformatting). IRI software on the other hand, handles both big and small data sets, and lets you choose between multi-threaded file system processing, or Hadoop MR2, Spark, Storm, or Tez processing in HDFS... all in the same Eclipse GUI with the same job design and metadata.
For more than three dozen years, IRI has been the proven performer for preparing and manipulating multiple data sources across industries, geographies, and Unix/Windows platforms. Find out why you may only need:
- one affordable product, either CoSort or Voracity, both of which use:
- one simple place, a free Eclipse GUI supporting a simple 4GL, and,
- one I/O pass, combining data transformation, protection, and reporting.
Here's what you can do with IRI:
Big Data Protection - mask, encrypt, pseudonymize, de-ID, hash, tokenize, etc. data as you transform and provide it.
Big Data Provisioning - bulk load with pre-sorted data, create replicas and federated views, franchise (munge, subset) for BI/analytic tools, write reports , feed BIRT directly, or create big test data.
Design and manage your jobs in your choice of UIs from the same Eclipse IDE. Share, version-control, secure, and run the jobs from the GUI, or build them into batch scripts, applications, or distributed computing environments like Hadoop for even more speed and scalability.
Browse this section and its links for more details, or request a free trial.
Did you know?
CoSort, typically used for data transformation, staging and reporting, can also do what its spin-offs do; i.e. data migration (IRI NextForm), data masking (IRI FieldShield), and test data generation (IRI RowGen).
IRI Voracity uses the same metatada and Eclipse GUI as CoSort and its spin-offs, but also lets you design and schedule jobs with state-of-the-art ETL worfklow and built-in automation tools, and run them with CoSort or Hadoop engines.