{"id":592,"date":"2007-09-10T00:56:00","date_gmt":"2007-09-10T06:56:00","guid":{"rendered":"http:\/\/adrianmonck.com\/?p=592"},"modified":"2007-09-10T00:56:00","modified_gmt":"2007-09-10T06:56:00","slug":"the-new-journalism","status":"publish","type":"post","link":"https:\/\/adrianmonck.com\/about\/2007\/09\/the-new-journalism\/","title":{"rendered":"The new journalism?"},"content":{"rendered":"<p><span class=\"dropcaps\">A<\/span> lot of messianic stuff is written about what the <span style=\"font-weight: bold;\">future of journalism<\/span> looks like, but new site <a href=\"http:\/\/wikileaks.org\/\" target=\"_blank\"><span style=\"font-weight: bold;\">Wikileaks<\/span><\/a> has a pre-launch taster of the kind of materials and treatment that could underpin it.<span id=\"fullpost\"><\/p>\n<p>It has a leak of what it claims is a <a href=\"http:\/\/wikileaks.org\/wiki\/Afghanistan_OEF_Property_List.xls\" target=\"_blank\">list<\/a> of <a style=\"font-weight: bold;\" href=\"http:\/\/wikileaks.org\/wiki\/US_Military_Equipment_in_Afghanistan\" target=\"_blank\">US Military Equipment in Afghanistan<\/a>. Wikileaks applies a lot of <span style=\"font-weight: bold;\">Computer Assisted Reporting<\/span> techniques to analysing the data. Here is how they did it:<\/p>\n<blockquote>\n<p>The analysis proceeded as follows: <\/p>\n<ol>\n<li> Understand the abbreviations, acronyms numbers and other nomenclature in the leak (specifically <a href=\"http:\/\/wikileaks.org\/wiki\/NSN\" title=\"NSN\">NSN<\/a>, <a href=\"http:\/\/wikileaks.org\/wiki\/index.php?title=LIN&#038;action=edit\" class=\"new\" title=\"LIN\">LIN<\/a>, <a href=\"http:\/\/wikileaks.org\/wiki\/UIC\" title=\"UIC\">UIC<\/a>) using publicly available source information. The results of which have been documented in <a href=\"http:\/\/wikileaks.org\/wiki\/US_Military_Logistics\" title=\"US Military Logistics\">US Military Logistics<\/a> and elsewhere.  <\/li>\n<li> Discover various public <a href=\"http:\/\/wikileaks.org\/wiki\/NATO_Stock_Number\" title=\"NATO Stock Number\">NATO Stock Number<\/a> catalogues. Confirm the the validity of random samples of the leak using these databases and other deployment references. <\/li>\n<li> By hand create tallies for a few interesting items observed by inspection. Write up an initial draft of the high-level analysis. <\/li>\n<li> Learn <a href=\"http:\/\/pythong.org\/\" class=\"external text\" title=\"http:\/\/pythong.org\" rel=\"nofollow\">Python<\/a>. Using <a href=\"http:\/\/vim.org\/\" class=\"external text\" title=\"http:\/\/vim.org\" rel=\"nofollow\">vim<\/a> macros, <a href=\"http:\/\/perl.org\/\" class=\"external text\" title=\"http:\/\/perl.org\" rel=\"nofollow\">perl<\/a> and a couple of Python programs, put the material into more presentable form, i.e <a href=\"http:\/\/wikileaks.org\/wiki\/Afghanistan_OEF_Property_List\" title=\"Afghanistan OEF Property List\">Afghanistan OEF Property List<\/a> and <a href=\"http:\/\/wikileaks.org\/wiki\/Afghanistan_OEF_Property_List.html\" title=\"Afghanistan OEF Property List.html\">Afghanistan OEF Property List.html<\/a>. <\/li>\n<li> Write additional code to split out the <a href=\"http:\/\/wikileaks.org\/wiki\/NATO_Supply_Group\" title=\"NATO Supply Group\">NATO Supply Group<\/a> and <a href=\"http:\/\/wikileaks.org\/wiki\/NATO_Supply_Classification\" title=\"NATO Supply Classification\">NATO Supply Classification<\/a> from the <a href=\"http:\/\/wikileaks.org\/wiki\/NATO_Stock_Number\" title=\"NATO Stock Number\">NATO Stock Number<\/a> (NSN) <\/li>\n<li> Obtain a list of <a href=\"http:\/\/wikileaks.org\/wiki\/NATO_Supply_Group\" title=\"NATO Supply Group\">NATO Supply Group<\/a> and <a href=\"http:\/\/wikileaks.org\/wiki\/NATO_Supply_Classification\" title=\"NATO Supply Classification\">NATO Supply Classification<\/a> codes from public US Military logistics sources <\/li>\n<li> Learn <a href=\"http:\/\/wikipedia.org\/wiki\/Structured_Query_Language\" class=\"extiw\" title=\"wikipedia:Structured_Query_Language\">Structured Query Language<\/a> and install a database program. <\/li>\n<li> Pull the original leak, the group and classification code tables into a <a href=\"http:\/\/wikipedia.org\/wiki\/SQL\" class=\"extiw\" title=\"wikipedia:SQL\">SQL<\/a> database, in this case, <a href=\"http:\/\/wikipedia.org\/wiki\/sqlite\" class=\"extiw\" title=\"wikipedia:sqlite\">sqlite<\/a>, but any SQL database would have sufficed. <\/li>\n<li> Experiment with SQL. Merge in <a href=\"http:\/\/wikileaks.org\/wiki\/NATO_Supply_Classification\" title=\"NATO Supply Classification\">NATO Supply Classifications<\/a> into the main leak for extra context and generate <a href=\"http:\/\/wikileaks.org\/wiki\/Afghanistan_OEF_Property_List-extended.html\" title=\"Afghanistan OEF Property List-extended.html\">Afghanistan OEF Property List-extended.html<\/a>. <\/li>\n<li> Experiment with SQL and discover how to generate several different tallies for the leaked items; by <a href=\"http:\/\/wikileaks.org\/wiki\/NATO_Supply_Group\" title=\"NATO Supply Group\">NATO Supply Group<\/a>, <a href=\"http:\/\/wikileaks.org\/wiki\/NATO_Supply_Classification\" title=\"NATO Supply Classification\">NATO Supply Classification<\/a> and <a href=\"http:\/\/wikileaks.org\/wiki\/NATO_Stock_Number\" title=\"NATO Stock Number\">NATO Stock Number<\/a>. Convert to HTML and place into the <a href=\"http:\/\/wikileaks.org\/wiki\/US_Military_Equipment_in_Afghanistan#Appendix\" title=\"\"> Appendix <\/a> . <\/li>\n<li> Using SQL, generate a unique list of <a href=\"http:\/\/wikileaks.org\/wiki\/NSN\" title=\"NSN\">NSNs<\/a>. Write a program to concurrently query the US Logistics web-query NSN search for pricing information and extract the price for every <a href=\"http:\/\/wikileaks.org\/wiki\/NSN\" title=\"NSN\">NSN<\/a> on the list (except alphanumerical NSN&#8217;s which are not listed, probably due to being <a href=\"http:\/\/wikileaks.org\/wiki\/Management_Control_Number\" title=\"Management Control Number\">Management Control Numbers<\/a>). <\/li>\n<li> Pull in the pricing information to the SQL database. <\/li>\n<li> Using SQL, generate a new tally by <a href=\"http:\/\/wikileaks.org\/wiki\/NSN\" title=\"NSN\">NSN<\/a>, join this together with the pricing information for each NSN, sort by total price, convert to HTML and place it into the <a href=\"http:\/\/wikileaks.org\/wiki\/US_Military_Equipment_in_Afghanistan#Appendix\" title=\"\"> Appendix <\/a> . <\/li>\n<li> Using SQL calculate the total value of <b>all<\/b> equipment for which we have prices. <\/li>\n<li> By inspection extract additional features of interest &#8211; <a href=\"http:\/\/wikileaks.org\/wiki\/US_Military_Equipment_in_Afghanistan#Notable_Units\" title=\"\"> Notable Units <\/a> , and items. <\/li>\n<\/ol>\n<\/blockquote>\n<p>If you want to help, they have a list of tasks to move things on&#8230;<\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>A lot of messianic stuff is written about what the future of journalism looks like, but new site Wikileaks has a pre-launch taster of the kind of materials and treatment that could underpin it. It has a leak of what it claims is a list of US Military Equipment in Afghanistan. Wikileaks applies a lot [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"nf_dc_page":"","footnotes":""},"categories":[7,50],"tags":[],"class_list":["post-592","post","type-post","status-publish","format-standard","hentry","category-journalism","category-journalists"],"_links":{"self":[{"href":"https:\/\/adrianmonck.com\/about\/wp-json\/wp\/v2\/posts\/592","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/adrianmonck.com\/about\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/adrianmonck.com\/about\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/adrianmonck.com\/about\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/adrianmonck.com\/about\/wp-json\/wp\/v2\/comments?post=592"}],"version-history":[{"count":0,"href":"https:\/\/adrianmonck.com\/about\/wp-json\/wp\/v2\/posts\/592\/revisions"}],"wp:attachment":[{"href":"https:\/\/adrianmonck.com\/about\/wp-json\/wp\/v2\/media?parent=592"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/adrianmonck.com\/about\/wp-json\/wp\/v2\/categories?post=592"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/adrianmonck.com\/about\/wp-json\/wp\/v2\/tags?post=592"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}