{"id":1949,"date":"2015-01-05T10:02:47","date_gmt":"2015-01-05T10:02:47","guid":{"rendered":"http:\/\/joapen.com\/blog\/?p=1949"},"modified":"2015-01-05T10:02:47","modified_gmt":"2015-01-05T10:02:47","slug":"hadoop-pig","status":"publish","type":"post","link":"http:\/\/joapen.com\/blog\/2015\/01\/05\/hadoop-pig\/","title":{"rendered":"Hadoop, Pig"},"content":{"rendered":"<p>My learning on the <a href=\"http:\/\/joapen.com\/blog\/2014\/12\/15\/hadoop-components\/\">basic concepts of Hadoop<\/a> continues.<\/p>\n<p>Pig has 2 basic elements:<\/p>\n<ul>\n<li>Pig Latin, it&#8217;s a data flow language used by programmers to write pig programs<\/li>\n<li>Pig Latin compiler: converts pig latin code into executable code. Executable code is in form of MapReduce jobs or it can spawn a process where virtual Hadoop instance is created to run pig code on single node.<\/li>\n<\/ul>\n<p>Pig works along with other Hadoop elements as HDSF, MapReduce Framework, YARN&#8230;<\/p>\n<p>You can create Macros in Pig Language, you can also access to the piggybank to use standard code.<\/p>\n<p>The main difference between MapReduce V1 and V2 is the existence of YARN<\/p>\n<p>Pig vs. SQL<\/p>\n<ul>\n<li>Pig Latin is procedural, SQL is declarative.<\/li>\n<li>In pig you can have bag of tuples and the can be duplicated; In SQL on a set of tuples, every tuple is unique.<\/li>\n<li>In Pig you can have different number of columns.<\/li>\n<li>Pig uses ETL natively; SQL requires a separate ETL tool.<\/li>\n<li>Pig uses lazy evaluation. In RDBMS you only have instant invocation of commands.<\/li>\n<li>In Pig there is not control statements as &#8220;if&#8221; and &#8220;else&#8221;.<\/li>\n<li>Pig Latin allows pipeline developers to decide where to checkpoint data in the pipeline and you can store data at any point during a pipeline. Most RDBMS systems have limited or no pipeline support. SQL is oriented around queries that produce a single result.<a href=\"http:\/\/joapen.com\/blog\/2015\/01\/05\/hadoop-pig\/apache_hadoop_ecosystem\/\" rel=\"attachment wp-att-1960\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-1960\" src=\"http:\/\/joapen.com\/blog\/wp-content\/uploads\/2015\/01\/Apache_Hadoop_Ecosystem.jpg\" alt=\"Apache_Hadoop_Ecosystem\" width=\"690\" height=\"350\" srcset=\"http:\/\/joapen.com\/blog\/wp-content\/uploads\/2015\/01\/Apache_Hadoop_Ecosystem.jpg 690w, http:\/\/joapen.com\/blog\/wp-content\/uploads\/2015\/01\/Apache_Hadoop_Ecosystem-300x152.jpg 300w, http:\/\/joapen.com\/blog\/wp-content\/uploads\/2015\/01\/Apache_Hadoop_Ecosystem-500x253.jpg 500w\" sizes=\"auto, (max-width: 690px) 100vw, 690px\" \/><\/a><\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>My learning on the basic concepts of Hadoop continues. Pig has 2 basic elements: Pig Latin, it&#8217;s a data flow language used by programmers to write pig programs Pig Latin compiler: converts pig latin code into executable code. Executable code is in form of MapReduce jobs or it can spawn a process where virtual Hadoop &#8230; <a title=\"Hadoop, Pig\" class=\"read-more\" href=\"http:\/\/joapen.com\/blog\/2015\/01\/05\/hadoop-pig\/\" aria-label=\"Read more about Hadoop, Pig\">Read more<\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[65],"tags":[],"class_list":["post-1949","post","type-post","status-publish","format-standard","hentry","category-hadoop"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.3 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Hadoop, Pig -<\/title>\n<meta name=\"description\" content=\"My learning on the basic concepts of Hadoop continues. Pig has 2 basic elements: Pig Latin, it&#039;s a data flow language used by programmers to write pig - joapen projects\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/joapen.com\/blog\/2015\/01\/05\/hadoop-pig\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Hadoop, Pig -\" \/>\n<meta property=\"og:description\" content=\"My learning on the basic concepts of Hadoop continues. Pig has 2 basic elements: Pig Latin, it&#039;s a data flow language used by programmers to write pig - joapen projects\" \/>\n<meta property=\"og:url\" content=\"https:\/\/joapen.com\/blog\/2015\/01\/05\/hadoop-pig\/\" \/>\n<meta property=\"og:site_name\" content=\"joapen projects\" \/>\n<meta property=\"article:published_time\" content=\"2015-01-05T10:02:47+00:00\" \/>\n<meta property=\"og:image\" content=\"http:\/\/joapen.com\/blog\/wp-content\/uploads\/2015\/01\/Apache_Hadoop_Ecosystem.jpg\" \/>\n<meta name=\"author\" content=\"joapen\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"joapen\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"1 minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/joapen.com\\\/blog\\\/2015\\\/01\\\/05\\\/hadoop-pig\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/joapen.com\\\/blog\\\/2015\\\/01\\\/05\\\/hadoop-pig\\\/\"},\"author\":{\"name\":\"joapen\",\"@id\":\"https:\\\/\\\/joapen.com\\\/blog\\\/#\\\/schema\\\/person\\\/23919df2312175fe9c4609203595b217\"},\"headline\":\"Hadoop, Pig\",\"datePublished\":\"2015-01-05T10:02:47+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/joapen.com\\\/blog\\\/2015\\\/01\\\/05\\\/hadoop-pig\\\/\"},\"wordCount\":232,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\\\/\\\/joapen.com\\\/blog\\\/#\\\/schema\\\/person\\\/23919df2312175fe9c4609203595b217\"},\"image\":{\"@id\":\"https:\\\/\\\/joapen.com\\\/blog\\\/2015\\\/01\\\/05\\\/hadoop-pig\\\/#primaryimage\"},\"thumbnailUrl\":\"http:\\\/\\\/joapen.com\\\/blog\\\/wp-content\\\/uploads\\\/2015\\\/01\\\/Apache_Hadoop_Ecosystem.jpg\",\"articleSection\":[\"Hadoop\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\\\/\\\/joapen.com\\\/blog\\\/2015\\\/01\\\/05\\\/hadoop-pig\\\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/joapen.com\\\/blog\\\/2015\\\/01\\\/05\\\/hadoop-pig\\\/\",\"url\":\"https:\\\/\\\/joapen.com\\\/blog\\\/2015\\\/01\\\/05\\\/hadoop-pig\\\/\",\"name\":\"Hadoop, Pig -\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/joapen.com\\\/blog\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/joapen.com\\\/blog\\\/2015\\\/01\\\/05\\\/hadoop-pig\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/joapen.com\\\/blog\\\/2015\\\/01\\\/05\\\/hadoop-pig\\\/#primaryimage\"},\"thumbnailUrl\":\"http:\\\/\\\/joapen.com\\\/blog\\\/wp-content\\\/uploads\\\/2015\\\/01\\\/Apache_Hadoop_Ecosystem.jpg\",\"datePublished\":\"2015-01-05T10:02:47+00:00\",\"description\":\"My learning on the basic concepts of Hadoop continues. Pig has 2 basic elements: Pig Latin, it's a data flow language used by programmers to write pig - joapen projects\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/joapen.com\\\/blog\\\/2015\\\/01\\\/05\\\/hadoop-pig\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/joapen.com\\\/blog\\\/2015\\\/01\\\/05\\\/hadoop-pig\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/joapen.com\\\/blog\\\/2015\\\/01\\\/05\\\/hadoop-pig\\\/#primaryimage\",\"url\":\"http:\\\/\\\/joapen.com\\\/blog\\\/wp-content\\\/uploads\\\/2015\\\/01\\\/Apache_Hadoop_Ecosystem.jpg\",\"contentUrl\":\"http:\\\/\\\/joapen.com\\\/blog\\\/wp-content\\\/uploads\\\/2015\\\/01\\\/Apache_Hadoop_Ecosystem.jpg\",\"width\":690,\"height\":350},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/joapen.com\\\/blog\\\/2015\\\/01\\\/05\\\/hadoop-pig\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/joapen.com\\\/blog\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Hadoop, Pig\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/joapen.com\\\/blog\\\/#website\",\"url\":\"https:\\\/\\\/joapen.com\\\/blog\\\/\",\"name\":\"joapen projects\",\"description\":\"Just a place to write\",\"publisher\":{\"@id\":\"https:\\\/\\\/joapen.com\\\/blog\\\/#\\\/schema\\\/person\\\/23919df2312175fe9c4609203595b217\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/joapen.com\\\/blog\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":[\"Person\",\"Organization\"],\"@id\":\"https:\\\/\\\/joapen.com\\\/blog\\\/#\\\/schema\\\/person\\\/23919df2312175fe9c4609203595b217\",\"name\":\"joapen\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/joapen.com\\\/blog\\\/wp-content\\\/uploads\\\/2021\\\/04\\\/joapen-mini.jpeg\",\"url\":\"https:\\\/\\\/joapen.com\\\/blog\\\/wp-content\\\/uploads\\\/2021\\\/04\\\/joapen-mini.jpeg\",\"contentUrl\":\"https:\\\/\\\/joapen.com\\\/blog\\\/wp-content\\\/uploads\\\/2021\\\/04\\\/joapen-mini.jpeg\",\"width\":400,\"height\":400,\"caption\":\"joapen\"},\"logo\":{\"@id\":\"https:\\\/\\\/joapen.com\\\/blog\\\/wp-content\\\/uploads\\\/2021\\\/04\\\/joapen-mini.jpeg\"},\"sameAs\":[\"http:\\\/\\\/www.joapen.com\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Hadoop, Pig -","description":"My learning on the basic concepts of Hadoop continues. Pig has 2 basic elements: Pig Latin, it's a data flow language used by programmers to write pig - joapen projects","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/joapen.com\/blog\/2015\/01\/05\/hadoop-pig\/","og_locale":"en_US","og_type":"article","og_title":"Hadoop, Pig -","og_description":"My learning on the basic concepts of Hadoop continues. Pig has 2 basic elements: Pig Latin, it's a data flow language used by programmers to write pig - joapen projects","og_url":"https:\/\/joapen.com\/blog\/2015\/01\/05\/hadoop-pig\/","og_site_name":"joapen projects","article_published_time":"2015-01-05T10:02:47+00:00","og_image":[{"url":"http:\/\/joapen.com\/blog\/wp-content\/uploads\/2015\/01\/Apache_Hadoop_Ecosystem.jpg","type":"","width":"","height":""}],"author":"joapen","twitter_misc":{"Written by":"joapen","Est. reading time":"1 minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/joapen.com\/blog\/2015\/01\/05\/hadoop-pig\/#article","isPartOf":{"@id":"https:\/\/joapen.com\/blog\/2015\/01\/05\/hadoop-pig\/"},"author":{"name":"joapen","@id":"https:\/\/joapen.com\/blog\/#\/schema\/person\/23919df2312175fe9c4609203595b217"},"headline":"Hadoop, Pig","datePublished":"2015-01-05T10:02:47+00:00","mainEntityOfPage":{"@id":"https:\/\/joapen.com\/blog\/2015\/01\/05\/hadoop-pig\/"},"wordCount":232,"commentCount":0,"publisher":{"@id":"https:\/\/joapen.com\/blog\/#\/schema\/person\/23919df2312175fe9c4609203595b217"},"image":{"@id":"https:\/\/joapen.com\/blog\/2015\/01\/05\/hadoop-pig\/#primaryimage"},"thumbnailUrl":"http:\/\/joapen.com\/blog\/wp-content\/uploads\/2015\/01\/Apache_Hadoop_Ecosystem.jpg","articleSection":["Hadoop"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/joapen.com\/blog\/2015\/01\/05\/hadoop-pig\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/joapen.com\/blog\/2015\/01\/05\/hadoop-pig\/","url":"https:\/\/joapen.com\/blog\/2015\/01\/05\/hadoop-pig\/","name":"Hadoop, Pig -","isPartOf":{"@id":"https:\/\/joapen.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/joapen.com\/blog\/2015\/01\/05\/hadoop-pig\/#primaryimage"},"image":{"@id":"https:\/\/joapen.com\/blog\/2015\/01\/05\/hadoop-pig\/#primaryimage"},"thumbnailUrl":"http:\/\/joapen.com\/blog\/wp-content\/uploads\/2015\/01\/Apache_Hadoop_Ecosystem.jpg","datePublished":"2015-01-05T10:02:47+00:00","description":"My learning on the basic concepts of Hadoop continues. Pig has 2 basic elements: Pig Latin, it's a data flow language used by programmers to write pig - joapen projects","breadcrumb":{"@id":"https:\/\/joapen.com\/blog\/2015\/01\/05\/hadoop-pig\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/joapen.com\/blog\/2015\/01\/05\/hadoop-pig\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/joapen.com\/blog\/2015\/01\/05\/hadoop-pig\/#primaryimage","url":"http:\/\/joapen.com\/blog\/wp-content\/uploads\/2015\/01\/Apache_Hadoop_Ecosystem.jpg","contentUrl":"http:\/\/joapen.com\/blog\/wp-content\/uploads\/2015\/01\/Apache_Hadoop_Ecosystem.jpg","width":690,"height":350},{"@type":"BreadcrumbList","@id":"https:\/\/joapen.com\/blog\/2015\/01\/05\/hadoop-pig\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/joapen.com\/blog\/"},{"@type":"ListItem","position":2,"name":"Hadoop, Pig"}]},{"@type":"WebSite","@id":"https:\/\/joapen.com\/blog\/#website","url":"https:\/\/joapen.com\/blog\/","name":"joapen projects","description":"Just a place to write","publisher":{"@id":"https:\/\/joapen.com\/blog\/#\/schema\/person\/23919df2312175fe9c4609203595b217"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/joapen.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":["Person","Organization"],"@id":"https:\/\/joapen.com\/blog\/#\/schema\/person\/23919df2312175fe9c4609203595b217","name":"joapen","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/joapen.com\/blog\/wp-content\/uploads\/2021\/04\/joapen-mini.jpeg","url":"https:\/\/joapen.com\/blog\/wp-content\/uploads\/2021\/04\/joapen-mini.jpeg","contentUrl":"https:\/\/joapen.com\/blog\/wp-content\/uploads\/2021\/04\/joapen-mini.jpeg","width":400,"height":400,"caption":"joapen"},"logo":{"@id":"https:\/\/joapen.com\/blog\/wp-content\/uploads\/2021\/04\/joapen-mini.jpeg"},"sameAs":["http:\/\/www.joapen.com"]}]}},"_links":{"self":[{"href":"http:\/\/joapen.com\/blog\/wp-json\/wp\/v2\/posts\/1949","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/joapen.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/joapen.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/joapen.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/joapen.com\/blog\/wp-json\/wp\/v2\/comments?post=1949"}],"version-history":[{"count":3,"href":"http:\/\/joapen.com\/blog\/wp-json\/wp\/v2\/posts\/1949\/revisions"}],"predecessor-version":[{"id":1962,"href":"http:\/\/joapen.com\/blog\/wp-json\/wp\/v2\/posts\/1949\/revisions\/1962"}],"wp:attachment":[{"href":"http:\/\/joapen.com\/blog\/wp-json\/wp\/v2\/media?parent=1949"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/joapen.com\/blog\/wp-json\/wp\/v2\/categories?post=1949"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/joapen.com\/blog\/wp-json\/wp\/v2\/tags?post=1949"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}