{"id":11364,"date":"2017-08-23T10:06:53","date_gmt":"2017-08-23T14:06:53","guid":{"rendered":"http:\/\/www.iri.com\/blog\/?p=11364"},"modified":"2026-02-23T16:49:16","modified_gmt":"2026-02-23T21:49:16","slug":"connecting-voracity-to-cloudera","status":"publish","type":"post","link":"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/","title":{"rendered":"Connecting Voracity to Cloudera"},"content":{"rendered":"<p class=\"c1\"><span class=\"c2\">This article, along with counterpart articles for <a href=\"http:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-hortonworks\/\">HortonWorks <\/a>(<\/span><span class=\"c2\">Ambari<\/span><span class=\"c2\">) and\u00a0<\/span><a href=\"http:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-mapr\/\"><span class=\"c2\">MapR<\/span><\/a><span class=\"c2\">, describes the simple 3-step process to connect the\u00a0<\/span><span class=\"c16 c2\"><a class=\"c11\" href=\"https:\/\/www.google.com\/url?q=http:\/\/www.iri.com\/products\/voracity&amp;sa=D&amp;ust=1503499267892000&amp;usg=AFQjCNETY2iuTpszvpu1c1wNfNRLoXPEzg\">IRI Voracity<\/a><\/span><span class=\"c2\">\u00a0big data management platform to a Cloudera distribution through the\u00a0<\/span><span class=\"c16 c2\"><a class=\"c11\" href=\"https:\/\/www.google.com\/url?q=http:\/\/www.iri.com\/blog\/data-transformation2\/running-voracity-jobs-in-hadoop\/&amp;sa=D&amp;ust=1503499267893000&amp;usg=AFQjCNEZSagx1TykGK6mL8HgsfU6ehpBDw\">VGrid<\/a><\/span><span class=\"c0\">\u00a0Gateway.<\/span><\/p>\n<p class=\"c1\"><span class=\"c2\">After connecting, data can be conveniently moved between HDFS and other systems. Furthermore, that data can be manipulated and transformed in Hadoop via MR2, Spark, Spark Stream, Storm or Tez using the jobs created in Voracity\u2019s Eclipse IDE,\u00a0<\/span><span class=\"c2 c16\"><a class=\"c11\" href=\"https:\/\/www.google.com\/url?q=http:\/\/www.iri.com\/products\/workbench\/voracity-gui&amp;sa=D&amp;ust=1503499267893000&amp;usg=AFQjCNFDuLa95jL6ixBREBYDNKCvogXzBw\">IRI Workbench<\/a><\/span><span class=\"c0\">.<\/span><\/p>\n<h2 class=\"c1\"><span class=\"c3 c18\">Step 1 &#8211; Collect Information from Cloudera Manager Dashboard<\/span><\/h2>\n<p class=\"c1 c6\"><span class=\"c0\">1. Log into Cloudera Manager.<\/span><\/p>\n<p><a href=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-1.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-11365\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-1.png\" alt=\"\" width=\"552\" height=\"378\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-1.png 552w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-1-300x205.png 300w\" sizes=\"(max-width: 552px) 100vw, 552px\" \/><\/a><\/p>\n<p class=\"c1 c6\"><span class=\"c2\">2. Click\u00a0<\/span><strong><span class=\"c3 c2\">HDFS<\/span><\/strong><span class=\"c2\">. Then, click\u00a0<\/span><strong><span class=\"c3 c2\">Configuration<\/span><\/strong><span class=\"c0\">.<\/span><\/p>\n<p><a href=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-2.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-11366\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-2.png\" alt=\"\" width=\"552\" height=\"233\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-2.png 552w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-2-300x127.png 300w\" sizes=\"(max-width: 552px) 100vw, 552px\" \/><\/a><\/p>\n<p class=\"c1 c6\"><span class=\"c2\">3. Make a note of the\u00a0<\/span><strong><span class=\"c3 c2\">NameNode Web UI Port<\/span><\/strong><span class=\"c2\">\u00a0(<\/span><em><span class=\"c17 c2\">dfs.http.address, dfs.namenode.http-address<\/span><\/em><span class=\"c0\">) value (in this case: 50070)<\/span><\/p>\n<p><a href=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-3.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-11367\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-3.png\" alt=\"\" width=\"552\" height=\"223\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-3.png 552w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-3-300x121.png 300w\" sizes=\"(max-width: 552px) 100vw, 552px\" \/><\/a><\/p>\n<p class=\"c1 c6\"><span class=\"c2\">4. Make a note of the\u00a0<\/span><strong><span class=\"c3 c2\">NameNode Port<\/span><\/strong><span class=\"c2\">\u00a0(<\/span><em><span class=\"c17 c2\">fs.default.name<\/span><\/em><span class=\"c0\">) value (in this case: 8020)<\/span><\/p>\n<p class=\"c1 c6\"><span class=\"c2\">5. Open Cloudera Manager and click\u00a0<\/span><strong><span class=\"c3 c2\">YARN<\/span><\/strong><span class=\"c2\">. Then, click\u00a0<\/span><strong><span class=\"c3 c2\">Configuration<\/span><\/strong><span class=\"c0\">.<\/span><\/p>\n<p><a href=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-4.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-11368\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-4.png\" alt=\"\" width=\"552\" height=\"262\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-4.png 552w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-4-300x142.png 300w\" sizes=\"(max-width: 552px) 100vw, 552px\" \/><\/a><\/p>\n<p class=\"c1 c6\"><span class=\"c2\">6. Make a note of the\u00a0<\/span><strong><span class=\"c3 c2\">ResourceManager Web Application HTTP Port<\/span><\/strong><span class=\"c2\"><strong>\u00a0<\/strong>(<\/span><em><span class=\"c17 c2\">yarn.resourcemanager.webapp.address<\/span><\/em><span class=\"c0\">) value (in this case: 8088)<\/span><\/p>\n<p><a href=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-5.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-11369\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-5.png\" alt=\"\" width=\"552\" height=\"244\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-5.png 552w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-5-300x133.png 300w\" sizes=\"(max-width: 552px) 100vw, 552px\" \/><\/a><\/p>\n<p class=\"c1 c6\"><span class=\"c2\">7. Make a note of the\u00a0<\/span><strong><span class=\"c3 c2\">MapReduce JobHistory Web Application HTTP Port<\/span><\/strong><span class=\"c2\">\u00a0(<\/span><em><span class=\"c17 c2\">mapreduce.jobhistory.webapp.address<\/span><\/em><span class=\"c0\">) value (in this case: 19888)<\/span><\/p>\n<p><a href=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-6.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-11370\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-6.png\" alt=\"\" width=\"552\" height=\"222\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-6.png 552w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-6-300x121.png 300w\" sizes=\"(max-width: 552px) 100vw, 552px\" \/><\/a><\/p>\n<p class=\"c1 c6\"><span class=\"c2\">8. Make a note of the<strong>\u00a0<\/strong><\/span><strong><span class=\"c3 c2\">Resource Manager Address<\/span><\/strong><span class=\"c2\">\u00a0(<\/span><em><span class=\"c2 c17\">yarn.resourcemanager.address<\/span><\/em><span class=\"c0\">) value (in this case: 8032)<\/span><\/p>\n<p class=\"c1 c6\"><span class=\"c2\">9. Go back to the main Cloudera Manager screen and click on\u00a0<\/span><strong><span class=\"c3 c2\">Oozie<\/span><\/strong><span class=\"c2\">. Then, click\u00a0<\/span><strong><span class=\"c3 c2\">Configuration<\/span><\/strong><span class=\"c0\">.<\/span><\/p>\n<p><a href=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-7.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-11371\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-7.png\" alt=\"\" width=\"552\" height=\"218\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-7.png 552w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-7-300x118.png 300w\" sizes=\"(max-width: 552px) 100vw, 552px\" \/><\/a><\/p>\n<p class=\"c1 c6\"><span class=\"c2\">10. Make a note of the\u00a0<\/span><span class=\"c3 c2\">Oozie HTTP Port<\/span><span class=\"c0\">\u00a0value (in this case: 11000)<\/span><\/p>\n<h2 id=\"h.latmhhjxt4z8\" class=\"c15\"><span class=\"c18 c3 c19\">Step 2- Enter configuration details in VGrid Dashboard<\/span><\/h2>\n<p class=\"c1 c6\"><span class=\"c0\">1. Log into the VGrid Gateway.<\/span><\/p>\n<p><a href=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-8.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-11372\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-8.png\" alt=\"\" width=\"360\" height=\"326\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-8.png 360w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-8-300x272.png 300w\" sizes=\"(max-width: 360px) 100vw, 360px\" \/><\/a><\/p>\n<p class=\"c1 c6\"><span class=\"c2\">2. Click\u00a0<\/span><strong><span class=\"c3 c2\">User &gt; Add User<\/span><\/strong><span class=\"c0\">\u00a0and enter the user information.<\/span><\/p>\n<p><a href=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-9.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-11373\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-9.png\" alt=\"\" width=\"456\" height=\"285\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-9.png 456w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-9-300x188.png 300w\" sizes=\"(max-width: 456px) 100vw, 456px\" \/><\/a><\/p>\n<p class=\"c1 c6\"><span class=\"c2\">3. Click the\u00a0<\/span><strong><span class=\"c3 c2\">X<\/span><\/strong><span class=\"c0\"><strong>\u00a0<\/strong>in the success banner to refresh the screen.<\/span><\/p>\n<p class=\"c1 c6\"><span class=\"c2\">4. Click\u00a0<\/span><strong><span class=\"c3 c2\">Detail\u00a0<\/span><\/strong><span class=\"c0\">in the Action section of the new user.<\/span><\/p>\n<p><a href=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-10.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-11374\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-10.png\" alt=\"\" width=\"552\" height=\"163\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-10.png 552w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-10-300x89.png 300w\" sizes=\"(max-width: 552px) 100vw, 552px\" \/><\/a><\/p>\n<p class=\"c1 c6\"><span class=\"c0\">5. Make note of the generated API key shown. It will be needed in the VGrid Gateway setup in the Workbench preferences screen.<\/span><\/p>\n<p class=\"c1\"><span class=\"c0\">\u00a0<\/span><a href=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-11.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-11375\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-11.png\" alt=\"\" width=\"552\" height=\"118\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-11.png 552w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-11-300x64.png 300w\" sizes=\"(max-width: 552px) 100vw, 552px\" \/><\/a><\/p>\n<p class=\"c1 c6\"><span class=\"c2\">6. Click\u00a0<\/span><strong><span class=\"c3 c2\">HadoopConfig<\/span><\/strong><span class=\"c2\"><strong>\u00a0<\/strong>and\u00a0<\/span><strong><span class=\"c3 c2\">Add Hadoop Config<\/span><\/strong><span class=\"c0\">.<\/span><\/p>\n<ul>\n<li><strong>Cluster\u00a0<\/strong>= Cluster Name<\/li>\n<li><strong>User<\/strong> = User name that will be used as the user in the Hadoop file system when working in Workbench<\/li>\n<li><strong>Hdfs<\/strong> = NameNode Web UI Port<\/li>\n<li><strong>Namenode<\/strong> = NameNode Port<\/li>\n<li><strong>Proxy<\/strong> = Resource Manager Web Application HTTP Port<\/li>\n<li><strong>History<\/strong> = MapReduce JobHistory Web Application HTTP Port<\/li>\n<li><strong>Jobtracker<\/strong> = Resource Manager Address<\/li>\n<li><strong>Oozie<\/strong> = Oozie HTTP Port<\/li>\n<\/ul>\n<p><a href=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-12.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-11376\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-12.png\" alt=\"\" width=\"456\" height=\"434\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-12.png 456w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-12-300x286.png 300w\" sizes=\"(max-width: 456px) 100vw, 456px\" \/><\/a><\/p>\n<p class=\"c1 c6\"><span class=\"c2\">7. Click the\u00a0<\/span><strong><span class=\"c3 c2\">X<\/span><\/strong><span class=\"c0\"><strong>\u00a0<\/strong>in the success banner to refresh the screen.<\/span><\/p>\n<p class=\"c1 c6\"><span class=\"c2\">8. Click\u00a0<\/span><strong><span class=\"c3 c2\">HadoopConfig<\/span><\/strong><span class=\"c2\">\u00a0and click\u00a0<\/span><strong><span class=\"c3 c2\">inactive<\/span><\/strong><span class=\"c0\"><strong>\u00a0<\/strong>to activate that configuration.<\/span><\/p>\n<p><a href=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-13.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-11377\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-13.png\" alt=\"\" width=\"553\" height=\"125\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-13.png 553w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-13-300x68.png 300w\" sizes=\"(max-width: 553px) 100vw, 553px\" \/><\/a><\/p>\n<p class=\"c1 c6\"><span class=\"c0\">9. Multiple configurations can be associated with each user; however, only one can be active at any given time.<\/span><\/p>\n<h2 id=\"h.mpxgru96urtb\" class=\"c15\"><span class=\"c19 c18 c3\">Step 3 &#8211; Enter configuration details in IRI Workbench<\/span><\/h2>\n<p class=\"c1 c6\"><span class=\"c2\">1. Open\u00a0<\/span><strong><span class=\"c3 c2\">IRI Workbench<\/span><\/strong><span class=\"c2\">. On the\u00a0<\/span><strong><span class=\"c3 c2\">IRI &gt; VGrid Gateway<\/span><\/strong><span class=\"c0\">\u00a0preferences screen, enter the details of the connection.<\/span><\/p>\n<p><a href=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-14.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-11378\" src=\"http:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-14.png\" alt=\"\" width=\"552\" height=\"331\" srcset=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-14.png 552w, https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-14-300x180.png 300w\" sizes=\"(max-width: 552px) 100vw, 552px\" \/><\/a><\/p>\n<p class=\"c1 c6\"><span class=\"c2\">2. Click\u00a0<\/span><strong><span class=\"c3 c2\">Test Connection<\/span><\/strong><span class=\"c0\">\u00a0to confirm that a successful connection is made. Included engines can be narrowed down here if only certain engines are being used in the Hadoop environment. Also, a default engine can be selected for Hadoop run configurations.<\/span><\/p>\n<p class=\"c13\"><span class=\"c7\">Once connected, you should be able to interact with HDFS and run compatible Voracity jobs seamlessly per\u00a0<\/span><span class=\"c16 c7\"><a class=\"c11\" href=\"https:\/\/www.google.com\/url?q=http:\/\/www.iri.com\/blog\/data-transformation2\/running-voracity-jobs-in-hadoop\/&amp;sa=D&amp;ust=1503499267902000&amp;usg=AFQjCNEnPAcE03-luAbwHkpM7Xo5tGvU-Q\">this article<\/a><\/span><span class=\"c7\">. If you have any questions or need assistance, contact\u00a0<\/span><span class=\"c16 c7\"><a class=\"c11\" href=\"mailto:voracity@iri.com\">voracity@iri.com<\/a><\/span><span class=\"c7 c9\">.<\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>This article, along with counterpart articles for HortonWorks (Ambari) and\u00a0MapR, describes the simple 3-step process to connect the\u00a0IRI Voracity\u00a0big data management platform to a Cloudera distribution through the\u00a0VGrid\u00a0Gateway. After connecting, data can be conveniently moved between HDFS and other systems. Furthermore, that data can be manipulated and transformed in Hadoop via MR2, Spark, Spark Stream,<\/p>\n<div><a class=\"btn-filled btn\" href=\"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/\" title=\"Connecting Voracity to Cloudera\">Read More<\/a><\/div>\n","protected":false},"author":43,"featured_media":11365,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_exactmetrics_skip_tracking":false,"_exactmetrics_sitenote_active":false,"_exactmetrics_sitenote_note":"","_exactmetrics_sitenote_category":0,"footnotes":""},"categories":[108,1,776,91,2255],"tags":[],"class_list":["post-11364","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-big-data-2","category-data-transformation2","category-etl","category-iri-workbench","category-archived-articles"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v23.4 (Yoast SEO v23.4) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Connecting Voracity to Cloudera - IRI<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Connecting Voracity to Cloudera\" \/>\n<meta property=\"og:description\" content=\"This article, along with counterpart articles for HortonWorks (Ambari) and\u00a0MapR, describes the simple 3-step process to connect the\u00a0IRI Voracity\u00a0big data management platform to a Cloudera distribution through the\u00a0VGrid\u00a0Gateway. After connecting, data can be conveniently moved between HDFS and other systems. Furthermore, that data can be manipulated and transformed in Hadoop via MR2, Spark, Spark Stream,Read More\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/\" \/>\n<meta property=\"og:site_name\" content=\"IRI\" \/>\n<meta property=\"article:published_time\" content=\"2017-08-23T14:06:53+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2026-02-23T21:49:16+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-1.png\" \/>\n\t<meta property=\"og:image:width\" content=\"552\" \/>\n\t<meta property=\"og:image:height\" content=\"378\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Claudia Irvine\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Claudia Irvine\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"6 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/\"},\"author\":{\"name\":\"Claudia Irvine\",\"@id\":\"https:\/\/www.iri.com\/blog\/#\/schema\/person\/72af50bbb317610e193e96392081f9b0\"},\"headline\":\"Connecting Voracity to Cloudera\",\"datePublished\":\"2017-08-23T14:06:53+00:00\",\"dateModified\":\"2026-02-23T21:49:16+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/\"},\"wordCount\":474,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/www.iri.com\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-1.png\",\"articleSection\":[\"Big Data\",\"Data Transformation\",\"ETL\",\"IRI Workbench\",\"Archived Articles\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/\",\"url\":\"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/\",\"name\":\"Connecting Voracity to Cloudera - IRI\",\"isPartOf\":{\"@id\":\"https:\/\/www.iri.com\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-1.png\",\"datePublished\":\"2017-08-23T14:06:53+00:00\",\"dateModified\":\"2026-02-23T21:49:16+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/#primaryimage\",\"url\":\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-1.png\",\"contentUrl\":\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-1.png\",\"width\":552,\"height\":378},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.iri.com\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Connecting Voracity to Cloudera\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.iri.com\/blog\/#website\",\"url\":\"https:\/\/www.iri.com\/blog\/\",\"name\":\"IRI\",\"description\":\"Total Data Management Blog\",\"publisher\":{\"@id\":\"https:\/\/www.iri.com\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.iri.com\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.iri.com\/blog\/#organization\",\"name\":\"IRI\",\"url\":\"https:\/\/www.iri.com\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.iri.com\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2019\/02\/iri-logo-total-data-management-small-1.png\",\"contentUrl\":\"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2019\/02\/iri-logo-total-data-management-small-1.png\",\"width\":750,\"height\":206,\"caption\":\"IRI\"},\"image\":{\"@id\":\"https:\/\/www.iri.com\/blog\/#\/schema\/logo\/image\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.iri.com\/blog\/#\/schema\/person\/72af50bbb317610e193e96392081f9b0\",\"name\":\"Claudia Irvine\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.iri.com\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/fb33dee44756bd13ac23f1342c2bb7b2?s=96&d=blank&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/fb33dee44756bd13ac23f1342c2bb7b2?s=96&d=blank&r=g\",\"caption\":\"Claudia Irvine\"},\"url\":\"https:\/\/www.iri.com\/blog\/author\/claudiai\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Connecting Voracity to Cloudera - IRI","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/","og_locale":"en_US","og_type":"article","og_title":"Connecting Voracity to Cloudera","og_description":"This article, along with counterpart articles for HortonWorks (Ambari) and\u00a0MapR, describes the simple 3-step process to connect the\u00a0IRI Voracity\u00a0big data management platform to a Cloudera distribution through the\u00a0VGrid\u00a0Gateway. After connecting, data can be conveniently moved between HDFS and other systems. Furthermore, that data can be manipulated and transformed in Hadoop via MR2, Spark, Spark Stream,Read More","og_url":"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/","og_site_name":"IRI","article_published_time":"2017-08-23T14:06:53+00:00","article_modified_time":"2026-02-23T21:49:16+00:00","og_image":[{"width":552,"height":378,"url":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-1.png","type":"image\/png"}],"author":"Claudia Irvine","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Claudia Irvine","Est. reading time":"6 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/#article","isPartOf":{"@id":"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/"},"author":{"name":"Claudia Irvine","@id":"https:\/\/www.iri.com\/blog\/#\/schema\/person\/72af50bbb317610e193e96392081f9b0"},"headline":"Connecting Voracity to Cloudera","datePublished":"2017-08-23T14:06:53+00:00","dateModified":"2026-02-23T21:49:16+00:00","mainEntityOfPage":{"@id":"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/"},"wordCount":474,"commentCount":0,"publisher":{"@id":"https:\/\/www.iri.com\/blog\/#organization"},"image":{"@id":"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/#primaryimage"},"thumbnailUrl":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-1.png","articleSection":["Big Data","Data Transformation","ETL","IRI Workbench","Archived Articles"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/","url":"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/","name":"Connecting Voracity to Cloudera - IRI","isPartOf":{"@id":"https:\/\/www.iri.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/#primaryimage"},"image":{"@id":"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/#primaryimage"},"thumbnailUrl":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-1.png","datePublished":"2017-08-23T14:06:53+00:00","dateModified":"2026-02-23T21:49:16+00:00","breadcrumb":{"@id":"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/#primaryimage","url":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-1.png","contentUrl":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-1.png","width":552,"height":378},{"@type":"BreadcrumbList","@id":"https:\/\/www.iri.com\/blog\/data-transformation2\/connecting-voracity-to-cloudera\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.iri.com\/blog\/"},{"@type":"ListItem","position":2,"name":"Connecting Voracity to Cloudera"}]},{"@type":"WebSite","@id":"https:\/\/www.iri.com\/blog\/#website","url":"https:\/\/www.iri.com\/blog\/","name":"IRI","description":"Total Data Management Blog","publisher":{"@id":"https:\/\/www.iri.com\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.iri.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.iri.com\/blog\/#organization","name":"IRI","url":"https:\/\/www.iri.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.iri.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2019\/02\/iri-logo-total-data-management-small-1.png","contentUrl":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2019\/02\/iri-logo-total-data-management-small-1.png","width":750,"height":206,"caption":"IRI"},"image":{"@id":"https:\/\/www.iri.com\/blog\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/www.iri.com\/blog\/#\/schema\/person\/72af50bbb317610e193e96392081f9b0","name":"Claudia Irvine","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.iri.com\/blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/fb33dee44756bd13ac23f1342c2bb7b2?s=96&d=blank&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/fb33dee44756bd13ac23f1342c2bb7b2?s=96&d=blank&r=g","caption":"Claudia Irvine"},"url":"https:\/\/www.iri.com\/blog\/author\/claudiai\/"}]}},"jetpack_featured_media_url":"https:\/\/www.iri.com\/blog\/wp-content\/uploads\/2017\/08\/cloudera-image-1.png","_links":{"self":[{"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/posts\/11364"}],"collection":[{"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/users\/43"}],"replies":[{"embeddable":true,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/comments?post=11364"}],"version-history":[{"count":3,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/posts\/11364\/revisions"}],"predecessor-version":[{"id":11408,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/posts\/11364\/revisions\/11408"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/media\/11365"}],"wp:attachment":[{"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/media?parent=11364"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/categories?post=11364"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.iri.com\/blog\/wp-json\/wp\/v2\/tags?post=11364"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}