txtkit – Visual Text Mining

txtkit – Visual Text Mining Tool is a Mac OS 10.3 networked application that lets you visualize and interact with texts through a command line interface and visualizations. The visualizations, if I understand them, weave the text and user behaviour together with information about other users. It produces some of the most beautiful visualizations I have seen for a while.

See also their Related Links.

Principal Component Analysis Online

At the Centre for Literary and Linguistic Computing (sounds like the title of a journal 🙂 they have mounted a web accessible apparatus to do computational-stylistics exploration. See PCA Online Introduction | CLLC where you can launch an applet that will work with Shakespeare texts.

This is a good example of sophisticated tools available online. What they need is a model that allows us to use with our own texts. I am also not sure about the step by step interface where you go march through pages of decisions. (There is probably no easy way to make it direct manipulation.)

Network Fragments, Analysis of E-Mail

One emerging form of structured text analysis is the analysis of large corpora of e-mail. See Social Network Fragments and InFlow and Email Datamining. Both projects create visualizations of networks much as Steve Ramsay does with StageGraph.

An interesting question for TAPoR is whether we can build an aggregator that can build a corpus from e-mail that could be used by other tools.
Continue reading Network Fragments, Analysis of E-Mail