{"id":13416,"date":"2020-01-09T14:06:33","date_gmt":"2020-01-09T19:06:33","guid":{"rendered":"http:\/\/www.iri.com\/blog\/?p=13416"},"modified":"2026-02-23T17:18:57","modified_gmt":"2026-02-23T22:18:57","slug":"masking-pii-amazon-s3","status":"publish","type":"post","link":"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/","title":{"rendered":"PII Masking in Amazon S3 Files"},"content":{"rendered":"<p><span style=\"font-weight: 400;\">Just as users of the <\/span><a href=\"https:\/\/www.iri.com\/products\/fieldshield\"><span style=\"font-weight: 400;\">IRI FieldShield<\/span><\/a><span style=\"font-weight: 400;\"> data masking tool can <\/span><a href=\"https:\/\/www.iri.com\/solutions\/data-masking\"><span style=\"font-weight: 400;\">mask<\/span><\/a><span style=\"font-weight: 400;\"> personally identifiable information (PII) in flat files held in <\/span><span style=\"font-weight: 400;\">Amazon Simple Storage Service (<\/span><a href=\"https:\/\/aws.amazon.com\/s3\/\"><span style=\"font-weight: 400;\">Amazon S3<\/span><\/a><span style=\"font-weight: 400;\">)<\/span><span style=\"font-weight: 400;\"> buckets, <\/span><a href=\"https:\/\/www.iri.com\/products\/darkshield\"><span style=\"font-weight: 400;\">IRI DarkShield<\/span><\/a><span style=\"font-weight: 400;\"> users can now find and mask PII in <em>structured, semi-structured, and <\/em><\/span><i><span style=\"font-weight: 400;\">unstructured <\/span><\/i><span style=\"font-weight: 400;\">files stored in S3, too. DarkShield can also protect data in Azure Blob, GCP and SharePoint Online, as well as on-premise folders. This article covers S3 only.<\/span><\/p>\n<h4><b>What is Amazon S3?<\/b><\/h4>\n<p><span style=\"font-weight: 400;\">Amazon S3 is an object storage service designed to make web-scale computing easier for developers. It is used to securely store any amount of data for a range of use cases, including websites, mobile applications and big data analytics.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">In terms of implementation, the Amazon S3 operates as a simple <\/span><a href=\"https:\/\/aws.amazon.com\/nosql\/key-value\/\"><span style=\"font-weight: 400;\">key-value<\/span><\/a><span style=\"font-weight: 400;\"> store designed to store as many objects as you want. Objects are placed into user-defined containers called \u201cBuckets\u201d and can be retrieved or manipulated using the Amazon S3 API.\u00a0<\/span><\/p>\n<h4><b>Example: Amazon S3 Data Masking with DarkShield<\/b><\/h4>\n<p><span style=\"font-weight: 400;\">This example uses some elements of Dark Data Discovery<\/span><i><span style=\"font-weight: 400;\">. <\/span><\/i><span style=\"font-weight: 400;\">The general idea is that, after parsing through data in unstructured sources, you can output what you\u2019re looking for in a structured text (flat) file, with its layouts automatically defined in a data definition file (.DDF).<\/span><\/p>\n<p><span style=\"font-weight: 400;\">While this example provides a brief introduction to dark data discovery, you may find it useful to read <\/span><a href=\"https:\/\/www.iri.com\/blog\/migration\/data-migration\/finding-dark-data-unstructured-sources\/\"><span style=\"font-weight: 400;\">this<\/span><\/a> <span style=\"font-weight: 400;\">three part blog series that explores the feature in depth.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">For this tutorial, I created a new Amazon S3 bucket containing a text file named \u201chelloS3.txt\u201d. The file contains \u201cHello, S3 Bucket!\u201d. There is also a folder named \u201cfolder\u201d within the bucket as well, containing the same file.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">The goal is to use the Dark Data Discovery wizard to create a search and remediation job that will mask all of the data within the specified bucket and upload the masked data to another bucket within the same account.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">For reference, this is how the Amazon S3 Bucket is setup:<\/span><\/p>\n<p><a href=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/s3_setup.png\"><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-13418 aligncenter\" src=\"\/blog\/wp-content\/uploads\/2020\/01\/s3_setup-1024x292.png\" alt=\"\" width=\"922\" height=\"263\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/s3_setup-1024x292.png 1024w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/s3_setup-300x86.png 300w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/s3_setup-768x219.png 768w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/s3_setup.png 1600w\" sizes=\"(max-width: 922px) 100vw, 922px\" \/><\/a><\/p>\n<p><span style=\"font-weight: 400;\">To access the <\/span><i><span style=\"font-weight: 400;\">Dark Data Discovery wizard<\/span><\/i><span style=\"font-weight: 400;\">, select the DarkShield dropdown menu from the IRI Workbench toolbar and select <\/span><i><span style=\"font-weight: 400;\">Dark Data Discovery Job<\/span><\/i><span style=\"font-weight: 400;\">. The wizard will open this way. Define a destination for the job and navigate to the <\/span><i><span style=\"font-weight: 400;\">Data Sources <\/span><\/i><span style=\"font-weight: 400;\">window. Selecting <\/span><i><span style=\"font-weight: 400;\">Add <\/span><\/i><span style=\"font-weight: 400;\">will take you to the window displayed below.<\/span><\/p>\n<p><a href=\"\/blog\/wp-content\/uploads\/2020\/01\/s3_data_source.png\"><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-13419 aligncenter\" src=\"\/blog\/wp-content\/uploads\/2020\/01\/s3_data_source.png\" alt=\"\" width=\"525\" height=\"450\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/s3_data_source.png 525w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/s3_data_source-300x257.png 300w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/s3_data_source-350x300.png 350w\" sizes=\"(max-width: 525px) 100vw, 525px\" \/><\/a><\/p>\n<p><span style=\"font-weight: 400;\">Input the relevant data from your S3 Bucket. The access and security key are obtained from <\/span><a href=\"https:\/\/docs.aws.amazon.com\/general\/latest\/gr\/aws-sec-cred-types.html\"><span style=\"font-weight: 400;\">your AWS credentials<\/span><\/a><span style=\"font-weight: 400;\">.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">An optional prefix can also be specified in a URI. Doing so will filter only on the objects that appear under the given prefix. For example, inputting \u201cfolder\u201d into the prefix field will only return objects that appear within \u201cfolder\u201d in the S3 Bucket. Otherwise, all objects are returned.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">After all the relevant information is selected, select <\/span><i><span style=\"font-weight: 400;\">OK. <\/span><\/i><span style=\"font-weight: 400;\">The next window will prompt you to specify a data target as seen below.\u00a0<\/span><\/p>\n<p><a href=\"\/blog\/wp-content\/uploads\/2020\/01\/s3_data_target.png\"><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-13420 aligncenter\" src=\"\/blog\/wp-content\/uploads\/2020\/01\/s3_data_target.png\" alt=\"\" width=\"523\" height=\"448\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/s3_data_target.png 525w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/s3_data_target-300x257.png 300w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/s3_data_target-350x300.png 350w\" sizes=\"(max-width: 523px) 100vw, 523px\" \/><\/a><\/p>\n<p><span style=\"font-weight: 400;\">For the data target, we will specify another bucket within our Amazon S3 account named \u201cdarkshield-test-remediated\u201d using the same access and secret key as our source. If there is no bucket with that name contained within the account, a new bucket with the same name will be automatically created.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">It also should be noted that data sources and targets can be mixed. For example, you can establish any combination of routes such as \u201cFile to S3 Bucket\u201d, \u201cS3 Bucket to S3 Bucket\u201d and \u201cS3 Bucket to File\u201d depending on your use case.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">When transferring contents from an S3 Bucket from one AWS account to a bucket on a different AWS account, certain permissions have to be set in each account in order for the transfer to be successful. For a guide on how to do that, please refer to <\/span><a href=\"https:\/\/aws.amazon.com\/premiumsupport\/knowledge-center\/copy-s3-objects-account\/\"><span style=\"font-weight: 400;\">this<\/span><\/a><span style=\"font-weight: 400;\"> article.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">We will create a simple Search Matcher for matching the entire contents of our sample files. We can create an ad-hoc \u201cALL\u201d Data Class loaded with the regex pattern \u201c.+\u201d to match on the entire contents of the document.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">For a more detailed overview of Data Classes and search methodologies used in DarkShield and its sister products FieldShield and CellShield EE, refer to the <\/span><a href=\"https:\/\/www.iri.com\/blog\/vldb-operations\/data-classification-in-iri-workbench\/\"><span style=\"font-weight: 400;\">following<\/span><\/a><span style=\"font-weight: 400;\"> article on Data Classification.<\/span><\/p>\n<p><a href=\"\/blog\/wp-content\/uploads\/2020\/01\/s3_data_group.png\"><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-13421 aligncenter\" src=\"\/blog\/wp-content\/uploads\/2020\/01\/s3_data_group.png\" alt=\"\" width=\"524\" height=\"391\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/s3_data_group.png 603w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/s3_data_group-300x224.png 300w\" sizes=\"(max-width: 524px) 100vw, 524px\" \/><\/a><\/p>\n<p><span style=\"font-weight: 400;\">We will attach a simple Data Redaction rule to our Search Matcher to replace every matched character, including whitespace, with an asterisk \u2018*\u2019. The screenshot below shows the final state of our Search Matcher.<\/span><\/p>\n<p><a href=\"\/blog\/wp-content\/uploads\/2020\/01\/s3_search_matcher.png\"><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-13422 aligncenter\" src=\"\/blog\/wp-content\/uploads\/2020\/01\/s3_search_matcher.png\" alt=\"\" width=\"524\" height=\"483\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/s3_search_matcher.png 536w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/s3_search_matcher-300x276.png 300w\" sizes=\"(max-width: 524px) 100vw, 524px\" \/><\/a><\/p>\n<p><span style=\"font-weight: 400;\">Note that for the purposes of this basic AWS S3 data protection demo, we will not be going into detail regarding the optional <\/span><i><span style=\"font-weight: 400;\">Filters <\/span><\/i><span style=\"font-weight: 400;\">field. For more information, please check our upcoming Filters <a href=\"http:\/\/www.iri.com\/blog\/data-protection\/masking-pii-xml-json\/\">blog<\/a>.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Finishing the wizard generates a <\/span><i><span style=\"font-weight: 400;\">.search <\/span><\/i><span style=\"font-weight: 400;\">file, which you can right-click and select <\/span><i><span style=\"font-weight: 400;\">Run As -&gt; IRI Search and Remediate Job <\/span><\/i><span style=\"font-weight: 400;\">to execute. You can see our .<\/span><i><span style=\"font-weight: 400;\">darkdata <\/span><\/i><span style=\"font-weight: 400;\">search results in the image below, where we matched on the entire contents of the two files within our S3 bucket.<\/span><\/p>\n<p><a href=\"\/blog\/wp-content\/uploads\/2020\/01\/s3_new_search.png\"><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-13423 aligncenter\" src=\"\/blog\/wp-content\/uploads\/2020\/01\/s3_new_search.png\" alt=\"\" width=\"379\" height=\"167\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/s3_new_search.png 379w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/s3_new_search-300x132.png 300w\" sizes=\"(max-width: 379px) 100vw, 379px\" \/><\/a><\/p>\n<p><span style=\"font-weight: 400;\">DarkShield will create the <\/span><i><span style=\"font-weight: 400;\">darkshield-test-remediated <\/span><\/i><span style=\"font-weight: 400;\">bucket if it doesn\u2019t already exist, and place the remediated files within while retaining the same folder structure as the original:<\/span><\/p>\n<p><a href=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/s3_remadiated.png\"><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-13424 aligncenter\" src=\"\/blog\/wp-content\/uploads\/2020\/01\/s3_remadiated-1024x321.png\" alt=\"\" width=\"922\" height=\"289\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/s3_remadiated-1024x321.png 1024w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/s3_remadiated-300x94.png 300w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/s3_remadiated-768x241.png 768w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/s3_remadiated.png 1600w\" sizes=\"(max-width: 922px) 100vw, 922px\" \/><\/a><\/p>\n<p><span style=\"font-weight: 400;\">If you have any questions about, or need help using, DarkShield for protecting PII in AWS S3 sources and\/or targets, please email <\/span><a href=\"mailto:darkshield@iri.com\"><span style=\"font-weight: 400;\">darkshield@iri.com<\/span><\/a><span style=\"font-weight: 400;\"> or contact your <\/span><a href=\"https:\/\/www.iri.com\/partners\/resellers\"><span style=\"font-weight: 400;\">IRI nearest representative<\/span><\/a><span style=\"font-weight: 400;\">.<\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Just as users of the IRI FieldShield data masking tool can mask personally identifiable information (PII) in flat files held in Amazon Simple Storage Service (Amazon S3) buckets, IRI DarkShield users can now find and mask PII in structured, semi-structured, and unstructured files stored in S3, too. DarkShield can also protect data in Azure Blob,<\/p>\n<div><a class=\"btn-filled btn\" href=\"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/\" title=\"PII Masking in Amazon S3 Files\">Read More<\/a><\/div>\n","protected":false},"author":122,"featured_media":13426,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_exactmetrics_skip_tracking":false,"_exactmetrics_sitenote_active":false,"_exactmetrics_sitenote_note":"","_exactmetrics_sitenote_category":0,"footnotes":""},"categories":[8,2255],"tags":[2084,2085,1456,2081,1367,311,1454,1386,1388,850,149,1306,2082,2083,95,1455,1453],"class_list":["post-13416","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-data-protection","category-archived-articles","tag-amazon-s3-pii-protection","tag-aws-s3-data-protection","tag-amazon-s3","tag-amazon-s3-data-masking","tag-aws","tag-bucket","tag-dark-data-discovery","tag-darkshield","tag-iri-darkshield","tag-iri-workbench","tag-pii","tag-pii-masking","tag-pii-masking-in-amazon-s3","tag-protecting-pii-in-aws-s3","tag-redaction","tag-remdiation","tag-s3"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v23.4 (Yoast SEO v23.4) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>PII Masking in Amazon S3 Files - IRI<\/title>\n<meta name=\"description\" content=\"Learn how to protect sensitive data in Amazon S3 buckets. Find and mask personally identifiable information (PII) in various file formats.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"PII Masking in Amazon S3 Files\" \/>\n<meta property=\"og:description\" content=\"Learn how to protect sensitive data in Amazon S3 buckets. Find and mask personally identifiable information (PII) in various file formats.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/\" \/>\n<meta property=\"og:site_name\" content=\"IRI\" \/>\n<meta property=\"article:published_time\" content=\"2020-01-09T19:06:33+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2026-02-23T22:18:57+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/amazon-s3.png\" \/>\n\t<meta property=\"og:image:width\" content=\"580\" \/>\n\t<meta property=\"og:image:height\" content=\"200\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Avory Richie\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Avory Richie\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"6 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/\"},\"author\":{\"name\":\"Avory Richie\",\"@id\":\"https:\/\/www.iri.com\/blog\/#\/schema\/person\/c43f5ebe932234ac4883cdf194d144c3\"},\"headline\":\"PII Masking in Amazon S3 Files\",\"datePublished\":\"2020-01-09T19:06:33+00:00\",\"dateModified\":\"2026-02-23T22:18:57+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/\"},\"wordCount\":859,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/www.iri.com\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/amazon-s3.png\",\"keywords\":[\"\u200b Amazon S3 PII protection\",\"\u200b AWS S3 data protection\",\"Amazon S3\",\"Amazon S3 data masking\u200b\",\"aws\",\"bucket\",\"Dark Data Discovery\",\"DarkShield\",\"IRI DarkShield\",\"IRI Workbench\",\"PII\",\"pii masking\",\"PII masking in Amazon S3\u200b\",\"Protecting PII in AWS S3\",\"redaction\",\"remdiation\",\"s3\"],\"articleSection\":[\"Data Masking\/Protection\",\"Archived Articles\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/\",\"url\":\"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/\",\"name\":\"PII Masking in Amazon S3 Files - IRI\",\"isPartOf\":{\"@id\":\"https:\/\/www.iri.com\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/amazon-s3.png\",\"datePublished\":\"2020-01-09T19:06:33+00:00\",\"dateModified\":\"2026-02-23T22:18:57+00:00\",\"description\":\"Learn how to protect sensitive data in Amazon S3 buckets. Find and mask personally identifiable information (PII) in various file formats.\",\"breadcrumb\":{\"@id\":\"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/#primaryimage\",\"url\":\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/amazon-s3.png\",\"contentUrl\":\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/amazon-s3.png\",\"width\":580,\"height\":200},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.iri.com\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"PII Masking in Amazon S3 Files\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.iri.com\/blog\/#website\",\"url\":\"https:\/\/www.iri.com\/blog\/\",\"name\":\"IRI\",\"description\":\"Total Data Management Blog\",\"publisher\":{\"@id\":\"https:\/\/www.iri.com\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.iri.com\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.iri.com\/blog\/#organization\",\"name\":\"IRI\",\"url\":\"https:\/\/www.iri.com\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.iri.com\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2019\/02\/iri-logo-total-data-management-small-1.png\",\"contentUrl\":\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2019\/02\/iri-logo-total-data-management-small-1.png\",\"width\":750,\"height\":206,\"caption\":\"IRI\"},\"image\":{\"@id\":\"https:\/\/www.iri.com\/blog\/#\/schema\/logo\/image\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.iri.com\/blog\/#\/schema\/person\/c43f5ebe932234ac4883cdf194d144c3\",\"name\":\"Avory Richie\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.iri.com\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/3b0bf7484bebc331040f119a6578c5e7?s=96&d=blank&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/3b0bf7484bebc331040f119a6578c5e7?s=96&d=blank&r=g\",\"caption\":\"Avory Richie\"},\"url\":\"https:\/\/www.iri.com\/blog\/author\/avoryr\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"PII Masking in Amazon S3 Files - IRI","description":"Learn how to protect sensitive data in Amazon S3 buckets. Find and mask personally identifiable information (PII) in various file formats.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/","og_locale":"en_US","og_type":"article","og_title":"PII Masking in Amazon S3 Files","og_description":"Learn how to protect sensitive data in Amazon S3 buckets. Find and mask personally identifiable information (PII) in various file formats.","og_url":"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/","og_site_name":"IRI","article_published_time":"2020-01-09T19:06:33+00:00","article_modified_time":"2026-02-23T22:18:57+00:00","og_image":[{"width":580,"height":200,"url":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/amazon-s3.png","type":"image\/png"}],"author":"Avory Richie","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Avory Richie","Est. reading time":"6 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/#article","isPartOf":{"@id":"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/"},"author":{"name":"Avory Richie","@id":"https:\/\/www.iri.com\/blog\/#\/schema\/person\/c43f5ebe932234ac4883cdf194d144c3"},"headline":"PII Masking in Amazon S3 Files","datePublished":"2020-01-09T19:06:33+00:00","dateModified":"2026-02-23T22:18:57+00:00","mainEntityOfPage":{"@id":"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/"},"wordCount":859,"commentCount":0,"publisher":{"@id":"https:\/\/www.iri.com\/blog\/#organization"},"image":{"@id":"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/#primaryimage"},"thumbnailUrl":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/amazon-s3.png","keywords":["\u200b Amazon S3 PII protection","\u200b AWS S3 data protection","Amazon S3","Amazon S3 data masking\u200b","aws","bucket","Dark Data Discovery","DarkShield","IRI DarkShield","IRI Workbench","PII","pii masking","PII masking in Amazon S3\u200b","Protecting PII in AWS S3","redaction","remdiation","s3"],"articleSection":["Data Masking\/Protection","Archived Articles"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/","url":"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/","name":"PII Masking in Amazon S3 Files - IRI","isPartOf":{"@id":"https:\/\/www.iri.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/#primaryimage"},"image":{"@id":"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/#primaryimage"},"thumbnailUrl":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/amazon-s3.png","datePublished":"2020-01-09T19:06:33+00:00","dateModified":"2026-02-23T22:18:57+00:00","description":"Learn how to protect sensitive data in Amazon S3 buckets. Find and mask personally identifiable information (PII) in various file formats.","breadcrumb":{"@id":"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/#primaryimage","url":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/amazon-s3.png","contentUrl":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/amazon-s3.png","width":580,"height":200},{"@type":"BreadcrumbList","@id":"https:\/\/www.iri.com\/blog\/data-protection\/masking-pii-amazon-s3\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.iri.com\/blog\/"},{"@type":"ListItem","position":2,"name":"PII Masking in Amazon S3 Files"}]},{"@type":"WebSite","@id":"https:\/\/www.iri.com\/blog\/#website","url":"https:\/\/www.iri.com\/blog\/","name":"IRI","description":"Total Data Management Blog","publisher":{"@id":"https:\/\/www.iri.com\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.iri.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.iri.com\/blog\/#organization","name":"IRI","url":"https:\/\/www.iri.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.iri.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2019\/02\/iri-logo-total-data-management-small-1.png","contentUrl":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2019\/02\/iri-logo-total-data-management-small-1.png","width":750,"height":206,"caption":"IRI"},"image":{"@id":"https:\/\/www.iri.com\/blog\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/www.iri.com\/blog\/#\/schema\/person\/c43f5ebe932234ac4883cdf194d144c3","name":"Avory Richie","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.iri.com\/blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/3b0bf7484bebc331040f119a6578c5e7?s=96&d=blank&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/3b0bf7484bebc331040f119a6578c5e7?s=96&d=blank&r=g","caption":"Avory Richie"},"url":"https:\/\/www.iri.com\/blog\/author\/avoryr\/"}]}},"jetpack_featured_media_url":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2020\/01\/amazon-s3.png","_links":{"self":[{"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/posts\/13416"}],"collection":[{"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/users\/122"}],"replies":[{"embeddable":true,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/comments?post=13416"}],"version-history":[{"count":9,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/posts\/13416\/revisions"}],"predecessor-version":[{"id":18349,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/posts\/13416\/revisions\/18349"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/media\/13426"}],"wp:attachment":[{"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/media?parent=13416"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/categories?post=13416"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/tags?post=13416"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}