{"id":1214,"date":"2012-06-25T15:35:53","date_gmt":"2012-06-25T15:35:53","guid":{"rendered":"http:\/\/www.iri.com\/blog\/?p=1214"},"modified":"2017-11-07T09:47:01","modified_gmt":"2017-11-07T14:47:01","slug":"what-is-unicode","status":"publish","type":"post","link":"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/","title":{"rendered":"What is Unicode?"},"content":{"rendered":"<p><a href=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2012\/06\/unicode.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-1233\" title=\"unicode\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2012\/06\/unicode.jpg\" alt=\"unicode\" width=\"273\" height=\"206\" \/><\/a>Unicode began as a project in 1987 between Apple and Xerox engineers in response to a need for an international standard of representation for every character in all major languages of the world. As the exchange of information and data became more prevalent electronically and internationally, there was a need for a unified code that could be read on any platform running any program. Prior to the development of Unicode, the primary ASCII coding scheme which used an 8-bit character representation only allowed for 256 characters.<\/p>\n<p>These early Unicode pioneers discovered that there were about 27,000 characters in the modern world and this resulted in a 16-bit fixed length character code which allowed for 65,000 characters, enough even for future expansion. Joe Becker, one of the Xerox engineers, coined the term Unicode\u009d from their requirements for a universal,\u00a0 uniform\u009d, and unique bit sequence to represent characters.<\/p>\n<p>The initial success of Unicode naturally relied on its adoption by other companies. Early in its development, major computer manufacturers, networking and software companies began making significant contributions to the design. In addition to Xerox and Apple, participating companies included Metaphor, Claris, Research Libraries Group, Sun, Microsoft, SHARE, IBM, Pacific Rim, Aldus, NeXT, and Novell.<\/p>\n<p><span style=\"font-family: arial, helvetica, sans-serif;\">By 1991, <a href=\"http:\/\/www.unicode.org\/history\/summary.html\">Unicode, Inc<\/a>. was incorporated with the original purpose to standardize, extend, and promote the Unicode character encoding. The original release date of Unicode was in October of that year. Version 1.0.0 contained codes for 7,161 characters. The most recent version, 6.0.0, was released in October 2010 and provided codes for 109,449 characters from the world&#8217;s alphabets, ideograph sets, and symbol collections.<\/span><\/p>\n<p>IRI first began developing support for Unicode data in CoSort Version 7.5. But with the release of <a title=\"CoSort 9.5 \u00e2\u20ac\u00a6 New Features\" href=\"http:\/\/www.iri.com\/blog\/miscellaneous\/how-cosort-transforms-and-manipulates-data\/\" target=\"_blank\" rel=\"noopener\">CoSort 9.5<\/a> there was a major re-design necessary to support the updates in characters that occurred.<\/p>\n<p>CoSort&#8217;s Sort Control Language (<a title=\"SortCL Product Tool Page\" href=\"http:\/\/www.iri.com\/products\/CoSort\/SortCL\" target=\"_blank\" rel=\"noopener\">SortCL<\/a>) program supports Unicode files and fields which may be mapped to database tables. SortCL can collate (sort), merge, join or convert Unicode characters and numerals in delimited or fixed-position fields.<\/p>\n<p>Conversion between Unicode and single-byte (e.g. ASCII) or native multi-byte characters (e.g. Chinese GBK\/Big5, Japanese, and Korean) is supported.\u00a0 Conversion between a variety of numeric data formats and Unicode digits is supported in both CoSort (SortCL) and <a title=\"NextForm\" href=\"http:\/\/www.iri.com\/products\/nextform\" target=\"_blank\" rel=\"noopener\">NextForm<\/a>, IRI&#8217;s standalone data migration package.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Unicode began as a project in 1987 between Apple and Xerox engineers in response to a need for an international standard of representation for every character in all major languages of the world. As the exchange of information and data became more prevalent electronically and internationally, there was a need for a unified code that<\/p>\n<div><a class=\"btn-filled btn\" href=\"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/\" title=\"What is Unicode?\">Read More<\/a><\/div>\n","protected":false},"author":5,"featured_media":11861,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_exactmetrics_skip_tracking":false,"_exactmetrics_sitenote_active":false,"_exactmetrics_sitenote_note":"","_exactmetrics_sitenote_category":0,"footnotes":""},"categories":[31],"tags":[44,70,69,68,67],"class_list":["post-1214","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-data-migration","tag-cosort","tag-data-exchange","tag-programming-language","tag-sortcl","tag-unicode"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v23.4 (Yoast SEO v23.4) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>What is Unicode? - IRI<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"What is Unicode?\" \/>\n<meta property=\"og:description\" content=\"Unicode began as a project in 1987 between Apple and Xerox engineers in response to a need for an international standard of representation for every character in all major languages of the world. As the exchange of information and data became more prevalent electronically and internationally, there was a need for a unified code thatRead More\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/\" \/>\n<meta property=\"og:site_name\" content=\"IRI\" \/>\n<meta property=\"article:published_time\" content=\"2012-06-25T15:35:53+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2017-11-07T14:47:01+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2012\/06\/unicode-logo.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1200\" \/>\n\t<meta property=\"og:image:height\" content=\"1200\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Jason Koivu\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Jason Koivu\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/\"},\"author\":{\"name\":\"Jason Koivu\",\"@id\":\"https:\/\/www.iri.com\/blog\/#\/schema\/person\/c60bc4ff5919427034376979fb2cc8df\"},\"headline\":\"What is Unicode?\",\"datePublished\":\"2012-06-25T15:35:53+00:00\",\"dateModified\":\"2017-11-07T14:47:01+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/\"},\"wordCount\":372,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/www.iri.com\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2012\/06\/unicode-logo.png\",\"keywords\":[\"CoSort\",\"data exchange\",\"programming language\",\"SortCL\",\"Unicode\"],\"articleSection\":[\"Data Migration\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/\",\"url\":\"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/\",\"name\":\"What is Unicode? - IRI\",\"isPartOf\":{\"@id\":\"https:\/\/www.iri.com\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2012\/06\/unicode-logo.png\",\"datePublished\":\"2012-06-25T15:35:53+00:00\",\"dateModified\":\"2017-11-07T14:47:01+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/#primaryimage\",\"url\":\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2012\/06\/unicode-logo.png\",\"contentUrl\":\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2012\/06\/unicode-logo.png\",\"width\":1200,\"height\":1200,\"caption\":\"unicode logo\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.iri.com\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"What is Unicode?\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.iri.com\/blog\/#website\",\"url\":\"https:\/\/www.iri.com\/blog\/\",\"name\":\"IRI\",\"description\":\"Total Data Management Blog\",\"publisher\":{\"@id\":\"https:\/\/www.iri.com\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.iri.com\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.iri.com\/blog\/#organization\",\"name\":\"IRI\",\"url\":\"https:\/\/www.iri.com\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.iri.com\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2019\/02\/iri-logo-total-data-management-small-1.png\",\"contentUrl\":\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2019\/02\/iri-logo-total-data-management-small-1.png\",\"width\":750,\"height\":206,\"caption\":\"IRI\"},\"image\":{\"@id\":\"https:\/\/www.iri.com\/blog\/#\/schema\/logo\/image\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.iri.com\/blog\/#\/schema\/person\/c60bc4ff5919427034376979fb2cc8df\",\"name\":\"Jason Koivu\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.iri.com\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/01e97234ff964558ca620a43a0506ef0?s=96&d=blank&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/01e97234ff964558ca620a43a0506ef0?s=96&d=blank&r=g\",\"caption\":\"Jason Koivu\"},\"url\":\"https:\/\/www.iri.com\/blog\/author\/jasonk\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"What is Unicode? - IRI","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/","og_locale":"en_US","og_type":"article","og_title":"What is Unicode?","og_description":"Unicode began as a project in 1987 between Apple and Xerox engineers in response to a need for an international standard of representation for every character in all major languages of the world. As the exchange of information and data became more prevalent electronically and internationally, there was a need for a unified code thatRead More","og_url":"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/","og_site_name":"IRI","article_published_time":"2012-06-25T15:35:53+00:00","article_modified_time":"2017-11-07T14:47:01+00:00","og_image":[{"width":1200,"height":1200,"url":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2012\/06\/unicode-logo.png","type":"image\/png"}],"author":"Jason Koivu","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Jason Koivu","Est. reading time":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/#article","isPartOf":{"@id":"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/"},"author":{"name":"Jason Koivu","@id":"https:\/\/www.iri.com\/blog\/#\/schema\/person\/c60bc4ff5919427034376979fb2cc8df"},"headline":"What is Unicode?","datePublished":"2012-06-25T15:35:53+00:00","dateModified":"2017-11-07T14:47:01+00:00","mainEntityOfPage":{"@id":"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/"},"wordCount":372,"commentCount":0,"publisher":{"@id":"https:\/\/www.iri.com\/blog\/#organization"},"image":{"@id":"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/#primaryimage"},"thumbnailUrl":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2012\/06\/unicode-logo.png","keywords":["CoSort","data exchange","programming language","SortCL","Unicode"],"articleSection":["Data Migration"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/","url":"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/","name":"What is Unicode? - IRI","isPartOf":{"@id":"https:\/\/www.iri.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/#primaryimage"},"image":{"@id":"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/#primaryimage"},"thumbnailUrl":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2012\/06\/unicode-logo.png","datePublished":"2012-06-25T15:35:53+00:00","dateModified":"2017-11-07T14:47:01+00:00","breadcrumb":{"@id":"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/#primaryimage","url":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2012\/06\/unicode-logo.png","contentUrl":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2012\/06\/unicode-logo.png","width":1200,"height":1200,"caption":"unicode logo"},{"@type":"BreadcrumbList","@id":"https:\/\/www.iri.com\/blog\/migration\/data-migration\/what-is-unicode\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.iri.com\/blog\/"},{"@type":"ListItem","position":2,"name":"What is Unicode?"}]},{"@type":"WebSite","@id":"https:\/\/www.iri.com\/blog\/#website","url":"https:\/\/www.iri.com\/blog\/","name":"IRI","description":"Total Data Management Blog","publisher":{"@id":"https:\/\/www.iri.com\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.iri.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.iri.com\/blog\/#organization","name":"IRI","url":"https:\/\/www.iri.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.iri.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2019\/02\/iri-logo-total-data-management-small-1.png","contentUrl":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2019\/02\/iri-logo-total-data-management-small-1.png","width":750,"height":206,"caption":"IRI"},"image":{"@id":"https:\/\/www.iri.com\/blog\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/www.iri.com\/blog\/#\/schema\/person\/c60bc4ff5919427034376979fb2cc8df","name":"Jason Koivu","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.iri.com\/blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/01e97234ff964558ca620a43a0506ef0?s=96&d=blank&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/01e97234ff964558ca620a43a0506ef0?s=96&d=blank&r=g","caption":"Jason Koivu"},"url":"https:\/\/www.iri.com\/blog\/author\/jasonk\/"}]}},"jetpack_featured_media_url":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2012\/06\/unicode-logo.png","_links":{"self":[{"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/posts\/1214"}],"collection":[{"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/users\/5"}],"replies":[{"embeddable":true,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/comments?post=1214"}],"version-history":[{"count":43,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/posts\/1214\/revisions"}],"predecessor-version":[{"id":11862,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/posts\/1214\/revisions\/11862"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/media\/11861"}],"wp:attachment":[{"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/media?parent=1214"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/categories?post=1214"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/tags?post=1214"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}