{"id":222,"date":"2009-06-12T16:23:16","date_gmt":"2009-06-12T21:23:16","guid":{"rendered":"http:\/\/blog.espol.edu.ec\/hadoop\/?p=222"},"modified":"2009-06-12T16:24:01","modified_gmt":"2009-06-12T21:24:01","slug":"mas-data-sets-de-la-wikipedia","status":"publish","type":"post","link":"https:\/\/blog.espol.edu.ec\/hadoop\/2009\/06\/12\/mas-data-sets-de-la-wikipedia\/","title":{"rendered":"M\u00e1s data sets de la Wikipedia"},"content":{"rendered":"<p>Una entrada en el blog de <a href=\"http:\/\/www.datawrangling.com\/wikipedia-page-traffic-statistics-dataset\">Data Wrangling<\/a> describe tres data sets de la Wikipedia: el ya conocido raw dump, uno que contiene estad\u00edsticas de las frecuencias de visitas a las p\u00e1ginas de la Wikipedia durante 7 meses (el cual ya est\u00e1 subido a los AWS), y uno con la lista de los enlaces de las p\u00e1ginas a otras p\u00e1ginas.<\/p>\n<p>Los invito a darme ideas de usos interesantes de estos data sets.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Una entrada en el blog de Data Wrangling describe tres data sets de la Wikipedia: el ya conocido raw dump, uno que contiene estad\u00edsticas de las frecuencias de visitas a las p\u00e1ginas de la Wikipedia durante 7 meses (el cual ya est\u00e1 subido a los AWS), y uno con la lista de los enlaces de [&hellip;]<\/p>\n","protected":false},"author":1510,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[945,6,1465],"tags":[6047,110],"class_list":["post-222","post","type-post","status-publish","format-standard","hentry","category-desarrollo","category-espol","category-investigacion","tag-aws","tag-wikipedia"],"_links":{"self":[{"href":"https:\/\/blog.espol.edu.ec\/hadoop\/wp-json\/wp\/v2\/posts\/222","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blog.espol.edu.ec\/hadoop\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.espol.edu.ec\/hadoop\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.espol.edu.ec\/hadoop\/wp-json\/wp\/v2\/users\/1510"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.espol.edu.ec\/hadoop\/wp-json\/wp\/v2\/comments?post=222"}],"version-history":[{"count":2,"href":"https:\/\/blog.espol.edu.ec\/hadoop\/wp-json\/wp\/v2\/posts\/222\/revisions"}],"predecessor-version":[{"id":224,"href":"https:\/\/blog.espol.edu.ec\/hadoop\/wp-json\/wp\/v2\/posts\/222\/revisions\/224"}],"wp:attachment":[{"href":"https:\/\/blog.espol.edu.ec\/hadoop\/wp-json\/wp\/v2\/media?parent=222"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.espol.edu.ec\/hadoop\/wp-json\/wp\/v2\/categories?post=222"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.espol.edu.ec\/hadoop\/wp-json\/wp\/v2\/tags?post=222"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}