Splunk is a robust analytic tool for a variety of data sources. However, Splunk cannot ingest dark data in a comprehensive way, and it lacks the kinds of data protection capabilities that compliance-minded data scientists require.
1) Index Splunk Immediately with Prepared and Protected Data
Using the new IRI add-on for Splunk you can leverage the extremely fast data preparation and protection capabilities of IRI Voracity from inside Splunk. Integrate and mask raw data sets in a variety of formats and get them into Splunk indexes immediately, through memory.
You can also use IRI software to:
The dark data discover wizard in the IRI Workbench GUI can search and structure data in MS Office documents, email repositories, and .pdf, .rtf, and .xml files all at once. It can also discover forensic metadata about each data source with an element matching the user's search pattern(s).
The resulting flat file contains all the data (and optional metadata) results, which Splunk can index easily ... and even display in the same GUI with your data preparation and management activities:
Splunk has identified a number of data encryption scenarios, but does not offer the format-preserving encryption of IRI FieldShield (available in the same GUI above or as part of a Voracity subscription) that maintains realism for data entering, or referential integrity for data leaving. Use the IRI add-on for Splunk to run a FieldShield (or IRI RowGen test data generation) job script to index this data into splunk as it is protected (or produced).
Because IRI's data protection functions are at the field level, they are more secure; if (unlike Splunk) an encryption key is disclosed, other fields with other keys or algorithms are still safe.