{"id":916,"date":"2005-05-30T00:18:28","date_gmt":"2005-05-30T04:18:28","guid":{"rendered":"http:\/\/www.theoreti.ca\/?p=916"},"modified":"2005-05-30T00:18:28","modified_gmt":"2005-05-30T04:18:28","slug":"text-analysis-of-e-mail","status":"publish","type":"post","link":"https:\/\/theoreti.ca\/?p=916","title":{"rendered":"Text Analysis of E-Mail"},"content":{"rendered":"<p><a title=\"scribblings &amp; musings @ sgs online\" href=\"http:\/\/www.humanities.mcmaster.ca\/~sgs\/index.php?p=61\">St\u00c8fan Sinclair<\/a> has blogged an interesting story from the New York Times on how <a title=\"Enron Offers an Unlikely Boost to E-Mail Surveillance - New York Times\" href=\"http:\/\/www.nytimes.com\/2005\/05\/22\/weekinreview\/22kola.html?ei=5070&amp;en=27fef3a4a12df3ad&amp;ex=1117598400&amp;adxnnl=1&amp;oref=login&amp;adxnnlx=1117426125-SjWiImXotLi3LnesKLuUIg\">Enron Offers an Unlikely Boost to E-Mail Surveillance<\/a>. Researchers, including Dr. Skillicorn at Queen&#8217;s, are using a large collection of Enron e-mail posted by the Federal Energy Regulatory Commission to experiment with e-mail tracking and analysis. A large corpus like the Enron one (over a million messages) can be used as a testbed for social network analysis or diachronic trend analysis. The article also talks about fears that government Echelon-style surveillance of e-mail may become available to corporate intelligence types. I wonder if we can develop useful text analysis tools optimized for e-mail collections like a dialogue of messages on a subject, or the Humanist archives. Some thing for <a href=\"http:\/\/taporware.mcmaster.ca\">TAPoRware<\/a>. <\/p>\n<p>Scientists had long theorized that tracking the e-mailing and word usage patterns within a group over time &#8211; without ever actually reading a single e-mail &#8211; could reveal a lot about what that group was up to. The Enron material gave Mr. Skillicorn&#8217;s group and a handful of others a chance to test that theory, by seeing, first of all, if they could spot sudden changes.<\/p>\n<p>For example, would they be able to find the moment when someone&#8217;s memos, which were routinely read by a long list of people who never responded, suddenly began generating private responses from some recipients? Could they spot when a new person entered a communications chain, or if old ones were suddenly shut out, and correlate it with something significant?<\/p>\n","protected":false},"excerpt":{"rendered":"<p>St\u00c8fan Sinclair has blogged an interesting story from the New York Times on how Enron Offers an Unlikely Boost to E-Mail Surveillance. Researchers, including Dr. Skillicorn at Queen&#8217;s, are using a large collection of Enron e-mail posted by the Federal Energy Regulatory Commission to experiment with e-mail tracking and analysis. A large corpus like the &hellip; <a href=\"https:\/\/theoreti.ca\/?p=916\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">Text Analysis of E-Mail<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[16],"tags":[],"class_list":["post-916","post","type-post","status-publish","format-standard","hentry","category-text-analysis"],"_links":{"self":[{"href":"https:\/\/theoreti.ca\/index.php?rest_route=\/wp\/v2\/posts\/916","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/theoreti.ca\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/theoreti.ca\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/theoreti.ca\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/theoreti.ca\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=916"}],"version-history":[{"count":0,"href":"https:\/\/theoreti.ca\/index.php?rest_route=\/wp\/v2\/posts\/916\/revisions"}],"wp:attachment":[{"href":"https:\/\/theoreti.ca\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=916"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/theoreti.ca\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=916"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/theoreti.ca\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=916"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}