Building & Loading ‘Big Test Data’ into MarkLogic
Just as production data processing tools like IRI CoSort must handle big data in NoSQL DB environments, so too must a big test data generation tool like IRI RowGen. Read More
Just as production data processing tools like IRI CoSort must handle big data in NoSQL DB environments, so too must a big test data generation tool like IRI RowGen. Read More
The IRI Voracity data management platform now supports the MarkLogic NoSQL database as a source for structured data discovery (classification, profiling, and search), integration (ETL, CDC, SCD), migration (conversion and replication , governance (data cleansing and masking), and analytic (reporting and wrangling) jobs. Read More
This is the third in a series of articles for creating an IRI Voracity ETL flow of a month-end job for processing sales transactions.
In the first article, we brought an existing CoSort SortCL job script that processes month-end sales transactions into Voracity and made modifications. Read More
This is the first of two articles showing how to create and use job flows in the IRI Workbench GUI for Voracity. It follows two other series on creating flows automatically using new job wizards. Read More
The traditional or Enterprise Data Warehouse (EDW) has been at the center of data’s transformation to business intelligence (BI) for years. An EDW involves a centralized data repository (traditionally, a relational database) from which data marts and reports are built. Read More
Has your organization considered using a data lake? This article explains what a data lake is, and posits a data lake architecture optimized for analytic results. Read More
Very large legacy IT vendors, or what we’ll call megavendors, provide valuable hardware, software, and services to companies worldwide. Often however, their technical approach, product roadmap, and price point will not be the best fit for your use case. Read More
This is the second in series of articles illustrating on how to use existing IRI CoSort (SortCL) jobs in graphical IRI Voracity ETL workflows, or more simply, flows. Read More
Abstract: The IRI Voracity data management platform provides data integration tools, including data pipeline automation for high-performance ETL (Extract-Transform-Load) operations. This article is the first in another series on how to create and use high-performance ETL workflows in the IRI Workbench GUI for Voracity. Read More
Comparing Filter/Sort/Join/Aggregate Performance
ITKeySource, an ETL consultancy in Jacksonville, FL, recently benchmarked relative performance gains running IRI CoSort — and its SortCL program in particular — alongside IBM DataStage. Read More
IRI CoSort continues to be a low-cost way to accelerate Informatica ETL via pushdown optimization, and IRI RowGen can generate safe, referentially correct test data for any EDW. Read More