A Fresh Look at Data Preparation
To analyze data successfully, it must first be prepared successfully. Poor quality data creates poor results. Worse yet is data that takes too long to collect and clean because it is too big or too foreign. Read More
To analyze data successfully, it must first be prepared successfully. Poor quality data creates poor results. Worse yet is data that takes too long to collect and clean because it is too big or too foreign. Read More
IRI Workbench not only has several ways to create jobs, but also several ways to execute them.
This article focuses on IRI Workbench execution options for job scripts based on the SortCL program language, which covers IRI Voracity ETL, CDC, SDC, pivoting and subsetting jobs, as well as its constituent product functions; i.e., Read More
The “New Reformat Job” wizard in the IRI Workbench NextForm menu virtualizes disparate data sources to produce immediate, fit-for-purpose views. This feature provides a simple and free or low-cost data federation vehicle for IRI NextForm, IRI CoSort, or IRI Voracity platform users. Read More
The traditional or Enterprise Data Warehouse (EDW) has been at the center of data’s transformation to business intelligence (BI) for years. An EDW involves a centralized data repository (traditionally, a relational database) from which data marts and reports are built. Read More
Has your organization considered using a data lake? This article explains what a data lake is, and posits a data lake architecture optimized for analytic results. Read More
Update Q3’2019: Subsequent to the development of the IRI Voracity Add-On for Splunk described below, there is now also a Splunkbase-registered IRI Voracity App for Splunk available for Seamless Data Preparation, Indexing, and Visualization…
After our first examples of external unstructured data preparation and PII data masking for Splunk generated interest in these capabilities, IRI wanted to develop a direct integration from the Splunk user interface (UI). Read More
Linear regression is a staple data analysis function for financial, economic, research, and many other disciplines, that helps discover new data correlations. Users of the IRI Voracity platform can now simultaneously process big data from any number of sources and present customized trend lines to help business users make predictions. Read More
“Progress is impossible without change, and those who cannot change their minds cannot change anything.”
-George Bernard Shaw
The mathematical symbol for change is the Greek uppercase letter delta: Δ. Read More
There is now a bulk/batch Change Data Capture (CDC) wizard in the IRI Workbench GUI for Voracity, built on Eclipse™. This new graphical report builder compares previous and current data sources based on a flag field you select, and gives you the option of producing five different targets — Cumulative, Delete, Equal, Insert, and Update — or any combination thereof. Read More
A dimension is a structure that categorizes a collection of information so that meaningful answers to questions regarding that information may be obtained. Dimensions in data management and data warehouses contain relatively static data; however, this dimensional data can change slowly over time and at unpredictable intervals. Read More
In ancient times, the ability to process and analyze information was invaluable. Greek knowledge of astronomy gave rise to the Antikythera Mechanism, an analog computer with sophisticated bronze gears that predicted astronomical events like lunar phases and eclipses. Read More