{"id":8439,"date":"2015-12-15T11:15:36","date_gmt":"2015-12-15T16:15:36","guid":{"rendered":"http:\/\/www.iri.com\/blog\/?p=8439"},"modified":"2025-02-12T14:04:05","modified_gmt":"2025-02-12T19:04:05","slug":"iris-new-database-subsetting-wizard","status":"publish","type":"post","link":"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/","title":{"rendered":"What Is Database Subsetting? How to Subset a Database."},"content":{"rendered":"<p>Once a database exceeds a certain size, it becomes expensive \u2014 and risky from a security perspective \u2014 to provide full-size copies for development, testing, and training. Most teams need smaller copies of the larger database, and often have the PII within masked.<\/p>\n<p>Database subsetting is that process of creating a smaller, referentially-correct copy of a larger database schema from real table extracts. Subsets can be used with, or in lieu of, masking or synthesizing test data to reduce the costs and risks associated with full sets.<\/p>\n<p>The process of creating meaningful subsets manually is complex and laborious. Normally you would have to populate smaller databases with random samples from each production table, and make sure that any relational structure between the tables was still correct in the subset.<\/p>\n<p>This article shows you how to subset a database in the fit-for-purpose, end-to-end database subsetting tool in the <a href=\"\/products\/workbench\">IRI Workbench<\/a> GUI, which simplifies and accelerates this process. The DB subsetting job wizard is available to licensed users of the <a href=\"\/products\/voracity\">IRI Voracity<\/a> data management platform, <a href=\"\/products\/rowgen\">IRI RowGen<\/a> for test data generation, or <a href=\"\/products\/fieldshield\">IRI FieldShield<\/a> for data masking.<\/p>\n<p>By using the wizard, you can create secure database subsets for testing that are structurally and referentially correct.\u00a0 This is because the wizard preserves primary and foreign key relationships, and allows you to simultaneously obfuscate PII with <a href=\"https:\/\/www.iri.com\/solutions\/data-masking\/static-data-masking\">data masking functions<\/a> like format-preserving encryption.<\/p>\n<p align=\"left\">The wizard for subset creation has the user select the source of the subset, the size, content, and sorting thereof, plus any masking or other functions that should apply to the target parent and child table data. After those options are specified, the wizard builds a series of job scripts (and optional workflow and transform mapping diagrams) which when run will generate and populate either subset tables or flat files.<\/p>\n<h2 align=\"left\">Job Options<\/h2>\n<p>This page defines the output type. If the loader is left empty, a flat file with a tab delimiter will be created. For database options, either ODBC or the specific database loader can be selected.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-16617 size-full\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/1.png\" alt=\"Define Destination\" width=\"860\" height=\"327\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/1.png 860w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/1-300x114.png 300w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/1-768x292.png 768w\" sizes=\"(max-width: 860px) 100vw, 860px\" \/><\/p>\n<h2>Subset Options<\/h2>\n<p>This page is where you specify the details of the subset. Select the connection profile and the table that will \u201cdrive\u201d the created subset. Think of the driving table as the main table from where you want the subset to originate.<\/p>\n<p>For example, if you want to subset a table of sales and all tables connected to it, you would select the table with sales info here. Also, select the size of the subset. For example, to get a subset of the 100 highest quantities sold, you would sort on the quantity sold field (as in this example) and enter 100 in the number of rows. A filter can also be added on the driver table.<\/p>\n<p>A qualitative filter can also be added on the driver table to customize the subset based on business criteria. Such as the example below, allowing the filter to omit or include a condition type, that is either manually created to fit all needs, or can use an existing condition.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-16618 size-full\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/2.png\" alt=\"Subsetting Options\" width=\"862\" height=\"835\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/2.png 862w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/2-300x291.png 300w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/2-768x744.png 768w\" sizes=\"(max-width: 862px) 100vw, 862px\" \/><\/p>\n<h2>Sorting<\/h2>\n<p>This page is where you specify the sort order of the subset. If you want the rows selected randomly, however, leave the Key Fields list blank.<\/p>\n<p>The Sort page has 3 notable sections:<\/p>\n<ol>\n<li>The Input Fields list<\/li>\n<li>The Key Fields list<\/li>\n<li>Key Options<\/li>\n<\/ol>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-16619 size-full\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/3.png\" alt=\"Sort\" width=\"860\" height=\"838\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/3.png 860w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/3-300x292.png 300w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/3-768x748.png 768w\" sizes=\"(max-width: 860px) 100vw, 860px\" \/><\/p>\n<p>Steps:<\/p>\n<ol>\n<li>If you want a subset of sorted data, select the column to sort on and click &#8220;Add Key.&#8221;<\/li>\n<li>A few options are available in the Key Options box:\n<ul>\n<li>Select the Stable check box to duplicate records to load in the order they are in the production table.<\/li>\n<li>If you want no duplicates or only duplicates in your subset, select the Duplicates check box and the appropriate radio button.<\/li>\n<\/ul>\n<\/li>\n<\/ol>\n<h2>Target Naming<\/h2>\n<p>This page is displayed if the output type is a database. It has two functions. The first allows you to view the relations of your driver table. The second is to name the output targets for the subset.<\/p>\n<p>The Target Naming page has 5 notable sections:<\/p>\n<ol>\n<li>The target profile and schema<\/li>\n<li>The output mode is always create as this job will truncate any existing tables of the same name<\/li>\n<li>SQL options for the job<\/li>\n<li>The naming options<\/li>\n<li>The references list<\/li>\n<\/ol>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-16620 size-full\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/4.png\" alt=\"Data Loading\" width=\"857\" height=\"837\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/4.png 857w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/4-300x293.png 300w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/4-768x750.png 768w\" sizes=\"(max-width: 857px) 100vw, 857px\" \/><\/p>\n<p>Steps:<\/p>\n<ol>\n<li>Select the connection profile.<\/li>\n<li>Select the schema.<\/li>\n<li>Specify the SQL options.<\/li>\n<li>Select the naming option to name the subset targets. If a different profile or schema are used, this step is optional:\n<ul>\n<li><em>Prefix all<\/em> adds a prefix to every table name and sets it as that table&#8217;s target.<\/li>\n<li><em>Postfix all<\/em> adds a postfix to every table name and sets it as that table&#8217;s target.<\/li>\n<li><em>Name individually<\/em> allows each subset target to be named independently of the other targets.<\/li>\n<\/ul>\n<\/li>\n<li>Review the\u00a0<em>Referenced by<\/em> and\u00a0<em>References\u00a0<\/em>lists to verify that the table&#8217;s relations have been found correctly:\n<ul>\n<li>In the\u00a0<em>Imported by\u00a0<\/em>list, all of the tables that your chosen table is imported by are listed.<\/li>\n<li>In the\u00a0<em>Imports<\/em> list, all of the tables that your chosen table has a relationship with are listed.<\/li>\n<\/ul>\n<\/li>\n<\/ol>\n<h2>Data Classes &amp; (Masking) Rules<\/h2>\n<p>On this page, previously created data classes can be selected and used. Data Masking functions are typically linked to these data classes as data privacy rules, and you can apply those same rules IRI FieldShield, DarkShield, or subsetting jobs.<\/p>\n<p>Data masking in database subsetting also supports referential integrity enterprise wide when you apply deterministic masking functions as rules. So be mindful, and see article on data masking tool functions, regarding the ciphertext results you want to apply.<\/p>\n<p>For a more in depth look at creating a data class and masking-rule pairing, see <a href=\"https:\/\/www.iri.com\/blog\/data-protection\/iri-data-classification\/\">this article<\/a>. And to see how all this works in practice, watch the video linked below.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-16621 size-full\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/5.png\" alt=\"Data Classes\" width=\"860\" height=\"246\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/5.png 860w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/5-300x86.png 300w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/5-768x220.png 768w\" sizes=\"(max-width: 860px) 100vw, 860px\" \/><\/p>\n<h2>Summary<\/h2>\n<p>This page summarizes the DB subsetting job configuration. It clearly displays the columns that matched a rule, and the target tables matching the source tables if they do not exist.<\/p>\n<p>The DDL will also be created for the target tables. You should use that DLL to <a href=\"https:\/\/www.iri.com\/blog\/iri\/iri-workbench\/create-new-tables-in-iri-workbench\/\">create tables in Workbench<\/a> prior to executing your subsetting workflow (batch script) if they do not exist.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-16622 size-full\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/6.png\" alt=\"\" width=\"858\" height=\"523\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/6.png 858w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/6-300x183.png 300w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/6-768x468.png 768w\" sizes=\"(max-width: 858px) 100vw, 858px\" \/><\/p>\n<p>Once all of the subsetting wizard pages have been completed, click <em>Finish<\/em>. The wizard will then create job scripts and an executable batch file for Windows or Unix that you can launch from Workbench or the command line to extract subsets from the driver table and its related tables.<\/p>\n<p>Here is a flow diagram showing the tasks created for this simple job in Voracity:<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-16623 size-full\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/7.png\" alt=\"Visual\" width=\"839\" height=\"320\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/7.png 839w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/7-300x114.png 300w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/7-768x293.png 768w\" sizes=\"(max-width: 839px) 100vw, 839px\" \/><\/p>\n<p style=\"text-align: center;\" align=\"left\"><strong>Follow along with our YouTube video!<\/strong><\/p>\n<p><iframe loading=\"lazy\" title=\"Database Subsetting Tutorial (IRI Voracity)\" width=\"1140\" height=\"641\" src=\"https:\/\/www.youtube.com\/embed\/eUSctzHUJVU?feature=oembed\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe><\/p>\n<p>Contact <a href=\"mailto:info@iri.com\">info@iri.com<\/a> if you have any questions.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Once a database exceeds a certain size, it becomes expensive \u2014 and risky from a security perspective \u2014 to provide full-size copies for development, testing, and training. Most teams need smaller copies of the larger database, and often have the PII within masked. Database subsetting is that process of creating a smaller, referentially-correct copy of<\/p>\n<div><a class=\"btn-filled btn\" href=\"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/\" title=\"What Is Database Subsetting? How to Subset a Database.\">Read More<\/a><\/div>\n","protected":false},"author":43,"featured_media":16620,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_exactmetrics_skip_tracking":false,"_exactmetrics_sitenote_active":false,"_exactmetrics_sitenote_note":"","_exactmetrics_sitenote_category":0,"footnotes":""},"categories":[8,91,29,3],"tags":[970,968,14,966,173,2009,971,520,526,789,221,967,969,191,166,2010],"class_list":["post-8439","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-data-protection","category-iri-workbench","category-test-data","category-vldb-operations","tag-column-masking","tag-data-management-platform","tag-data-masking","tag-database-subsetting","tag-database-testing","tag-db-subsets","tag-field-preserving-encryption","tag-iri-fieldshield","tag-iri-rowgen","tag-iri-voracity","tag-iri-workbench-gui","tag-job-wizard","tag-referentially-correct-subsets","tag-test-data-generation","tag-test-database","tag-test-subsets"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v23.4 (Yoast SEO v23.4) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>What Is Database Subsetting? How to Subset a Database. - IRI<\/title>\n<meta name=\"description\" content=\"Learn how to effectively perform database subsetting to reduce database testing costs and risks associated with large, unmasked databases.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"What Is Database Subsetting? How to Subset a Database.\" \/>\n<meta property=\"og:description\" content=\"Learn how to effectively perform database subsetting to reduce database testing costs and risks associated with large, unmasked databases.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/\" \/>\n<meta property=\"og:site_name\" content=\"IRI\" \/>\n<meta property=\"article:published_time\" content=\"2015-12-15T16:15:36+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-02-12T19:04:05+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/4.png\" \/>\n\t<meta property=\"og:image:width\" content=\"857\" \/>\n\t<meta property=\"og:image:height\" content=\"837\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Claudia Irvine and Tylor Quinley\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Claudia Irvine and Tylor Quinley\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"7 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/\"},\"author\":{\"name\":\"Claudia Irvine and Tylor Quinley\",\"@id\":\"https:\/\/www.iri.com\/blog\/#\/schema\/person\/72af50bbb317610e193e96392081f9b0\"},\"headline\":\"What Is Database Subsetting? How to Subset a Database.\",\"datePublished\":\"2015-12-15T16:15:36+00:00\",\"dateModified\":\"2025-02-12T19:04:05+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/\"},\"wordCount\":1100,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/www.iri.com\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/4.png\",\"keywords\":[\"column masking\",\"data management platform\",\"data masking\",\"database subsetting\",\"database testing\",\"DB subsets\",\"field-preserving encryption\",\"IRI FieldShield\",\"IRI RowGen\",\"IRI Voracity\",\"IRI Workbench GUI\",\"job wizard\",\"referentially-correct subsets\",\"test data generation\",\"test database\",\"test subsets\"],\"articleSection\":[\"Data Masking\/Protection\",\"IRI Workbench\",\"Test Data\",\"VLDB\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/\",\"url\":\"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/\",\"name\":\"What Is Database Subsetting? How to Subset a Database. - IRI\",\"isPartOf\":{\"@id\":\"https:\/\/www.iri.com\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/4.png\",\"datePublished\":\"2015-12-15T16:15:36+00:00\",\"dateModified\":\"2025-02-12T19:04:05+00:00\",\"description\":\"Learn how to effectively perform database subsetting to reduce database testing costs and risks associated with large, unmasked databases.\",\"breadcrumb\":{\"@id\":\"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/#primaryimage\",\"url\":\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/4.png\",\"contentUrl\":\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/4.png\",\"width\":857,\"height\":837},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.iri.com\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"What Is Database Subsetting? How to Subset a Database.\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.iri.com\/blog\/#website\",\"url\":\"https:\/\/www.iri.com\/blog\/\",\"name\":\"IRI\",\"description\":\"Total Data Management Blog\",\"publisher\":{\"@id\":\"https:\/\/www.iri.com\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.iri.com\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.iri.com\/blog\/#organization\",\"name\":\"IRI\",\"url\":\"https:\/\/www.iri.com\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.iri.com\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2019\/02\/iri-logo-total-data-management-small-1.png\",\"contentUrl\":\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2019\/02\/iri-logo-total-data-management-small-1.png\",\"width\":750,\"height\":206,\"caption\":\"IRI\"},\"image\":{\"@id\":\"https:\/\/www.iri.com\/blog\/#\/schema\/logo\/image\/\"}},[{\"@type\":[\"Person\"],\"@id\":\"https:\/\/www.iri.com\/blog\/#\/schema\/person\/72af50bbb317610e193e96392081f9b0\",\"name\":\"Claudia Irvine\",\"image\":{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/www.iri.com\/blog\/#\/schema\/person\/image\/\",\"inLanguage\":\"en_US\",\"url\":\"\",\"caption\":\"Claudia Irvine\"}},{\"@type\":[\"Person\"],\"@id\":\"https:\/\/www.iri.com\/blog\/#\/schema\/person\/72af50bbb317610e193e96392081f9b0\",\"name\":\"Tylor Quinley\",\"image\":{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/www.iri.com\/blog\/#\/schema\/person\/image\/\",\"inLanguage\":\"en_US\",\"url\":\"\",\"caption\":\"Tylor Quinley\"}}]]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"What Is Database Subsetting? How to Subset a Database. - IRI","description":"Learn how to effectively perform database subsetting to reduce database testing costs and risks associated with large, unmasked databases.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/","og_locale":"en_US","og_type":"article","og_title":"What Is Database Subsetting? How to Subset a Database.","og_description":"Learn how to effectively perform database subsetting to reduce database testing costs and risks associated with large, unmasked databases.","og_url":"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/","og_site_name":"IRI","article_published_time":"2015-12-15T16:15:36+00:00","article_modified_time":"2025-02-12T19:04:05+00:00","og_image":[{"width":857,"height":837,"url":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/4.png","type":"image\/png"}],"author":"Claudia Irvine and Tylor Quinley","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Claudia Irvine and Tylor Quinley","Est. reading time":"7 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/#article","isPartOf":{"@id":"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/"},"author":{"name":"Claudia Irvine and Tylor Quinley","@id":"https:\/\/www.iri.com\/blog\/#\/schema\/person\/72af50bbb317610e193e96392081f9b0"},"headline":"What Is Database Subsetting? How to Subset a Database.","datePublished":"2015-12-15T16:15:36+00:00","dateModified":"2025-02-12T19:04:05+00:00","mainEntityOfPage":{"@id":"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/"},"wordCount":1100,"commentCount":0,"publisher":{"@id":"https:\/\/www.iri.com\/blog\/#organization"},"image":{"@id":"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/#primaryimage"},"thumbnailUrl":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/4.png","keywords":["column masking","data management platform","data masking","database subsetting","database testing","DB subsets","field-preserving encryption","IRI FieldShield","IRI RowGen","IRI Voracity","IRI Workbench GUI","job wizard","referentially-correct subsets","test data generation","test database","test subsets"],"articleSection":["Data Masking\/Protection","IRI Workbench","Test Data","VLDB"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/","url":"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/","name":"What Is Database Subsetting? How to Subset a Database. - IRI","isPartOf":{"@id":"https:\/\/www.iri.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/#primaryimage"},"image":{"@id":"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/#primaryimage"},"thumbnailUrl":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/4.png","datePublished":"2015-12-15T16:15:36+00:00","dateModified":"2025-02-12T19:04:05+00:00","description":"Learn how to effectively perform database subsetting to reduce database testing costs and risks associated with large, unmasked databases.","breadcrumb":{"@id":"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/#primaryimage","url":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/4.png","contentUrl":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/4.png","width":857,"height":837},{"@type":"BreadcrumbList","@id":"https:\/\/www.iri.com\/blog\/vldb-operations\/iris-new-database-subsetting-wizard\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.iri.com\/blog\/"},{"@type":"ListItem","position":2,"name":"What Is Database Subsetting? How to Subset a Database."}]},{"@type":"WebSite","@id":"https:\/\/www.iri.com\/blog\/#website","url":"https:\/\/www.iri.com\/blog\/","name":"IRI","description":"Total Data Management Blog","publisher":{"@id":"https:\/\/www.iri.com\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.iri.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.iri.com\/blog\/#organization","name":"IRI","url":"https:\/\/www.iri.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.iri.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2019\/02\/iri-logo-total-data-management-small-1.png","contentUrl":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2019\/02\/iri-logo-total-data-management-small-1.png","width":750,"height":206,"caption":"IRI"},"image":{"@id":"https:\/\/www.iri.com\/blog\/#\/schema\/logo\/image\/"}},[{"@type":["Person"],"@id":"https:\/\/www.iri.com\/blog\/#\/schema\/person\/72af50bbb317610e193e96392081f9b0","name":"Claudia Irvine","image":{"@type":"ImageObject","@id":"https:\/\/www.iri.com\/blog\/#\/schema\/person\/image\/","inLanguage":"en_US","url":"","caption":"Claudia Irvine"}},{"@type":["Person"],"@id":"https:\/\/www.iri.com\/blog\/#\/schema\/person\/72af50bbb317610e193e96392081f9b0","name":"Tylor Quinley","image":{"@type":"ImageObject","@id":"https:\/\/www.iri.com\/blog\/#\/schema\/person\/image\/","inLanguage":"en_US","url":"","caption":"Tylor Quinley"}}]]}},"jetpack_featured_media_url":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2015\/12\/4.png","_links":{"self":[{"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/posts\/8439"}],"collection":[{"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/users\/43"}],"replies":[{"embeddable":true,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/comments?post=8439"}],"version-history":[{"count":60,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/posts\/8439\/revisions"}],"predecessor-version":[{"id":18226,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/posts\/8439\/revisions\/18226"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/media\/16620"}],"wp:attachment":[{"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/media?parent=8439"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/categories?post=8439"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/tags?post=8439"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}