The Historical Event Markup and Linking Project is a brilliant project by Bruce Robertson that defines a markup language for events in space and can then generate timelines, animated maps and interactive maps. Combines the temporal and geospatial ideas were are working on for the Globalization Compendium.
Category: Text Technology and TAPoR
Text Vectors
Most text analysis techniques present a synchronic view of the text. For example, a list of word frequencies treats the text as a whole. How can we look at change across a text? How can we quantify a text as it progresses, whether in writing or playing? Could we anticipate the sorts of words likely to be used or summarize those used before?
Continue reading Text Vectors
Choosing a Wiki
Wiki Wiki Clones is a page by the original creator of Wikis – Ward Cunningham that lists alternative wiki frameworks. (Thanks to James Chartrand for this.)
Continue reading Choosing a Wiki
Many 2 Many
Many-to-Many is a wiki on Social Software by Sebastian Paquet and friends. The design and idea parallel what we are doing with a private wiki on TAPoR.
What can we learn from it?
Continue reading Many 2 Many
txtkit – Visual Text Mining
txtkit – Visual Text Mining Tool is a Mac OS 10.3 networked application that lets you visualize and interact with texts through a command line interface and visualizations. The visualizations, if I understand them, weave the text and user behaviour together with information about other users. It produces some of the most beautiful visualizations I have seen for a while.
See also their Related Links.
Principal Component Analysis Online
At the Centre for Literary and Linguistic Computing (sounds like the title of a journal 🙂 they have mounted a web accessible apparatus to do computational-stylistics exploration. See PCA Online Introduction | CLLC where you can launch an applet that will work with Shakespeare texts.
This is a good example of sophisticated tools available online. What they need is a model that allows us to use with our own texts. I am also not sure about the step by step interface where you go march through pages of decisions. (There is probably no easy way to make it direct manipulation.)
Network Fragments, Analysis of E-Mail
One emerging form of structured text analysis is the analysis of large corpora of e-mail. See Social Network Fragments and InFlow and Email Datamining. Both projects create visualizations of networks much as Steve Ramsay does with StageGraph.
An interesting question for TAPoR is whether we can build an aggregator that can build a corpus from e-mail that could be used by other tools.
Continue reading Network Fragments, Analysis of E-Mail
Linguistica Annotation
Linguistic Annotation is a list of tools and formats for annotating and studying linguistic features. Included are things like TEI and Multitext. We need to review these tools against what we hope to have on TAPoR.
Annotating Web Pages, Survey
A Survey of Web Annotation Systems is a good short review of various systems for annotating web pages. The question I have is whether blogs are a type of web annotation that organizes around the annotation rather than the site?
RSS Web Feeds
RSS Web feeds are obviously cool, but what can we do with them?
Here is an article with some ideas: Yahoo! News – Enthusiasts Call Web Feed Next Big Thing. From the perspective of text analysis, these are feeds of raw text with which to play. TAPoR needs to think of how to provide accessible tools for these.