Are you sure you need another IT fabric like: Hadoop or Teradata, an in-memory or columnar database like HANA or Vertica, a DB or ELT appliance like Exadata or Netezza, or a complex ETL tool like DataStage Parallel Extender or Ab Initio? And do you have the time, money, and expertise to take it on?
If you answered no, see below, because 5-figure IRI software integrates and transforms big data in the file system you already have. It's extremely fast, has a low learning curve, and carries far less risk.
If you answered yes, see below, because those platforms may not access or handle all the data you need, mash it up, or easily prepare it through cleansing, masking, or reformatting. To mine big data, you must smelt it first.
"More Hadoop projects will be swept under the rug as businesses devote major resources to their big data projects before doing their due diligence, which results in a costly, disillusioning project failure."
Gary Nakamura, CEO of Concurrent
"Spend (on big data projects) wisely. Follow a CRAWL - WALK - RUN strategy."
Dr. Peter Aiken, Data Blueprint
For more than 35 years, IRI has been the proven performer for preparing and manipulating multiple data sources data across industries, geographies, and Unix/Windows platforms. Find out why you may only need:
- one affordable product, IRI CoSort, performing in ...
- one simple place, an explicit 4GL job script (supported in a free Eclipse GUI), and ...
- one I/O pass, that combines data transformation, conversion, protection, and reporting.
Here's what you can do with CoSort (click to enlarge):
Big Data Protection - mask, encrypt, pseudonymize, de-ID, hash, tokenize, etc. data as you transform and provide it.
Big Data Provisioning - bulk load DBs with pre-sorted files, create replicas and federated views, franchise (munge, subset, wrangle) for BI/analytic tools, generate reports (via SortCL scripts or ODA into BIRT), and create test data.
CoSort and its spin-offs use a simple, self-documenting 4GL called SortCL for data definition, manipulation, masking and reporting. The SortCL jobs are text scripts supported in Eclipse where they can be created, shared, version-controlled, and run in both graphical or syntax-aware editing modes.
You can also run in batch, from applications, or within distributed computing environments like Hadoop for even more speed later. Browse this section and its links for more details, or request a free trial.
Did You Know?
CoSort, typically used for data transformation, staging and reporting, can also do what its spin-offs do; i.e. data migration (IRI NextForm), data masking (IRI FieldShield), and test data generation (IRI RowGen).