Digital Classics Symposium in Buffalo

I am heading home the day after giving the closing remarks at a conference in Buffalo on Word, Space, Time: Digital Perspectives on the Classical World. This is the first conference of the new Digital Classics Association. It was a gem of a conference where I learned about a succession of neat projects. Here are some notes. My laptop ran out of juice at times so I was not able to take notes on everything.

  • Greg Crane gave the opening keynote announcing his new Humbolt appointment and what he is gong to try to do there. He announced that he wanted to: 1) Advance the role of Greco-Roman culture and Classical Greek and Latin in human intellectual life as broadly and as deeply as possible in a global world. And 2) To blow the dust off the simple, cogent and ancient term philology and to support an open philology that can, in turn, support a dialogue among civilizations. He talked about the history and importance of philology and then announced the Open Philology Project. This project has as its goals:
    • Open greek and latin texts (the TLG is not open)
    • Comprehensive open data about the classical world
    • Multitext digital editions
    • Annotations
    • Deep linguistic annotation
    • Full workflow through true digital edition

    This is a worthy and ambitious vision and I tried to remind people of it at the end. Classics is the right size and has the right interdiscplinarity to be able to model a comprehensive system.

  • Crane talked about Alpheios, a text editing and learning system that Perseus is connecting to. Monica Berti showed her work on fragmenta in Alpheios and I later learned that this is a philanthropically funded project. Berti’s demo of how she is handling fragmenta is at http://services.perseus.tufts.edu/berti_demo/
  • Marco Büchler gave a tantalizing paper on “Using Google PageRank to detect text reuse”. His was not the only text reuse project – it is technique that is important to classicists who study how classical authors have been quoted, alluded to, and reused over time. Büchler’s software is TRACER which will be available once he has some documentation. I think the idea of using a PageRank to sort hits is a great idea and would love to play with his tools. He encouraged interested parties to join a Google group on text reuse.
  • Walter Scheidel showed the Orbis system in a paper on “Redrawing the map of the Roman world.” Orbis is a brilliant tool for measuring time and cost for travel in the Roman world. It is a great example of spatial analysis.
  • Tom Elliot talked about the Pleiades project and how they have around 34,000 places registered and linked. He was initially skeptical about semantic web technologies and RDF, but is now using it in a way that shows what we can do in the humanities with this approach. I am struck by how Plieades now provides a service to all sorts of other projects. What Classics now need is similar projects for people, passages (texts), periods (events and time), and other primitives. Classics could set an example of coordinated semantic data.
  • Ryan Horne wrapped a great session on geospatial work with a presentation on “Mapping antiquity a-la-carte: a GIS interface of the ancient world”. He showed Antiquity À la carte which allows you to generate all sorts of maps of the Classical world. Great tool for teachers.
  • Kevin D. Fisher gave a fascinating presentation on “Digital approaches to ancient cities: The Kalavasos and Maroni built environments project, Cyprus.” In The Crane Project they are using all sorts of cool technology like 3D laser scanners and ground penetrating radar to map their dig in Cyprus. I liked how was using techniques to model how the environments were lived in. What could you see from where, what were the accessible rooms in buildings?
  • My favorite project of conference was Christopher Johanson’s visual argument on RomeLab: Performance on the ephemeral stage. He presented an argument about temporary stages in the Roman forum that was made through a virtual Rome that you can travel around through the browser. The argument is in a sequence points that can be opened and which will move you around the world to see what the argument is about. His paper was an example of a visual argument through RomeLab and by extension about RomeLab. Despite a technical glitch, it was an impressive performance that made its point on so many levels.
  • I attended a neat little workshop on R led by Jeff Rydberg-Cox. His learning materials are at http://daedalus.umkc.edu/StatisticalMethods/index.html and he pointed us to a neat tutorial at http://tryr.codeschool.com/.
  • At the end there was a great panel on Literary Criticism and Digital Methods. Matt Jockers presented is work on macroanalysis of 19th century literature. He had a neat word cloud visualization of topic modeling results. Patrick J. Burns was very good on “Distant reading alliteration in Latin poetry.” He was very good on walking us through his method and illustrating it with humour. Neil Bernstein talked about the Tesserae project. The Tesserae project is looking at text reuse and has neat tools online for people to see how author A gets reused in author B.

I gave the closing remarks and I tried to draw attention to the history of the vision of a perfect reading (or philology) machine. I think took advantage of being the last to offer suggestions as to how digital classics might move research forward:

  • The Digital Classics Association should take seriously Greg Cranes invitation to influence his Open Philology Project. Classics is, for various reason, in a unique position to imagine comprehensive research and learning environments.
  • They should think about primitives and how they support them. What Pleiades has done for place other should think of doing for people, periods (events and time), buildings and things, and so on. The idea would be to have a network of projects managing semantic data about the things that matter to Classicists.
  • I encouraged people to think about how to include the larger public into research using crowdsourcing and gaming.
  • I encouraged them to think about how digital research is shared and assessed. They should look at the work from the MLA on assessment and the DCA could adapt stuff for Classics.
  • Finally I talked a bit out infrastructure and the dangers of developing infrastructure prematurely. I called for infrastructure experiments.

I think the DCA will be putting up a video of my closing remarks.

The National Digital Public Library Is Launched! by Robert Darnton

Robert Darnton has written an essay about the launch of the Digital Public Library of America that everyone should read. A great writer and a historian he provides a historical context and a contemporary context. He quotes from the original mission statement to show the ambition,

“an open, distributed network of comprehensive online resources that would draw on the nation’s living heritage from libraries, universities, archives, and museums in order to educate, inform, and empower everyone in the current and future generations.”

The essay, The National Digital Public Library Is Launched! by Robert Darnton is in the New York Review of Books. A lot of it talks about what Harvard is contributing (Darnton is the University Librarian there), which is OK as it is good to see leadership.

He also mentions that Daniel Cohen is the new executive director. Bravo! Great choice!

Cupid Computers – Student Dating service 1966

Victoria led me to this 1966 form for a student computerized dating service called “Cupid Computer.” You can see the form in context in the University of Waterloo student newspaper, Coryphaeus. See 1966-67_v7,n09_Coryphaeus. Click to read and page forward and you will see the two page insert with the questions you have to answer and send in. Here are some examples.

The Cupid Computer service was apparently “run by students” and would, for $3 give you a list of 3 scientifically compatible dates. They mention using an IBM computer and that the Computronics Company is “The Leader by far in Canadian Computer Dating Systems”. The PDF of the insert (and cover page) is here, 1966-67_v7no9_Coryphaeus.

It is worth noting that this was a student developed service. While serious university computer centers were doing other things, students were developing their own social uses of computers … and long before the web.

The Expression of Emotions in 20th Century Books

Emilie pointed me to an NPR strory on mining mood in 20th century books, Mining Books To Map Emotions Through A Century. This story draws on a very readable article The Expression of Emotions in 20th Century Books in PLOS One. The article reports on a study of “mood” or sentiment over time in literature. The used the Google Ngram data. I like how they report first and then discuss methodology at the end.

They mention support from an interesting EU funded project TrendMiner. TrendMiner is developing real-time multi-lingual analysis tools.

Continue reading The Expression of Emotions in 20th Century Books

The Cube at QUT – world’s biggest multitouch installation

Luciano sent me this link to a stunning multipoint touch installation at Queensland University of Technology, Brisbane, The Cube at QUT – world’s biggest multitouch installation. I like how they have the touch at ground level, but the screen extends up.

I note that the Cube folk also have a Lego Education Learning Centre. I’m doubly envious.

Culture Clash: A Truly Bizarre Domino’s Pizza Commercial

From Culture Smash an interesting example of West meets Japan, A Truly Bizarre Domino’s Pizza Commercial. This video nicely captures all sorts of phenomena like:

  • The dangers of cross-cultural interactions. Sometimes it is just weird.
  • The dangers of older men (like me) trying on cool Japan.
  • Vocaloids as a phenomena.
  • Augmented reality (and pizza).

You can download the free Domino’s App here. I plan to try it soon.

Hacker Measures the Internet Illegally with Carna Botnet

Speigel Online has an interesting story about how a Hacker Measures the Internet Illegally with Carna Botnet. The anonymous hacker(s) exploited unprotected devices online to create a botnet with which they then used to take a census of those online.

So what were the actual results of the Internet census? How many IP addresses were there in 2012? “That depends on how you count,” the hacker writes. Some 450 million were “in use and reachable” during his scans. Then there were the firewalled IPs and those with reverse DNS records (which means there are domain names associated with them). In total, this equalled some 1.3 billion IP addresses in use.

Future Hype: Near Futures

I gave a lecture at Kim Solez’s course on the future of medicine and he taped it and put it up on YouTube here:

Geoffrey Rockwell FutureHype LABMP 590 2013 March 7 – YouTube.

This talk came out of a conversation we had at a pub about Ray Kurzweil where I disagreed with Kim about Kurzweil’s predictions. Thinking about Kurzweil I realized how fundamental prediction is. We call it hope. It is easy to make fun of the futurists, but we need to recognize how we always look forward to the near future.

Tool Discourse

Character Density by Year in Tool DiscourseWe are finally getting results in a long slow process of trying to study tool discourse in the digital humanities. Amy Dyrbe and Ryan Chartier are building a corpus of discourse around tools that includes tool reviews, articles about what people are doing with tools, web pages about tools and so on. We took the first coherent chunk and Ryan has been analyzing it with R. The graph above shows which years have the most characters. My hypothesis was that tool reviews and discourse dropped off in the 1990s as the web became more important. This seems to be wrong.

Here are the high-frequency words (with stop words removed). Note the modal verbs “can”, “will”, and “may.” They indicate the potentiality of tools.

“can” 2305
“one” 1996
“text” 1940
“word” 1931
“words” 1859
“program” 1606
“ii” 1514 (Not sure why)
“will” 1361
“language” 1307
“data” 1285
“two” 1188
“system” 1183
“computer” 1116
“used” 1115
“use” 942
“user” 939
“file” 890
“first” 870
“may” 853
“also” 837