Visualization – Theoreti.ca

Constellate Sunset

The neat ITHAKA Constellate project is being shut down. It sounds like it was not financially sustainable.

As of November 2024, ITHAKA made the decision to sunset Constellate on July 1, 2025. While we’re proud of the meaningful impact Constellate has had on individuals and institutions, helping advance computational literacy and text analysis skills across academia, we have concluded that continuing to support the platform and classes is not sustainable for ITHAKA in the long term. As a nonprofit organization, we need to focus our resources on initiatives that can achieve broad-scale impact aligned with our mission. Despite Constellate’s success with its participating institutions, we haven’t found a path to achieve this broader impact.

It sounds like this sort of analytical support is best supported in universities by courses, workshops etc. Constellate developed cool notebooks (available in GitHub), courses built on the notebooks, and webinar recordings.

South Korea faces deepfake porn ’emergency’

The president has addressed the growing epidemic after Telegram users were found exchanging doctored photos of underage girls.

Once again, deepfake porn is in the news as South Korea faces deepfake porn ’emergency’. Teenagers have been posting deepfake porn images of people they know, including minors, on sites like Telegram.

South Korean President Yoon Suk Yeol on Tuesday instructed authorities to “thoroughly investigate and address these digital sex crimes to eradicate them”.

This has gone beyond a rare case in Spain or Winnipeg. In South Korea it has spread to hundreds of schools. Porn is proving to be a major use of AI.

DH 2024: Visualization Ethics and Text Analysis Infrastructure

This week I’m at DH 2024 at George Mason in Washington DC. I presented as part of two sessions.

On Wednesday I presented a short paper with Lauren Klein on work a group of us are doing on Visualization Ethics: A Case Study Approach. We met at a Dagstuhl on Visualization and the Humanities: Towards a Shared Research Agenda. We developed case studies for teaching visualization ethics and that’s what our short presentation was about. The link above is to a Google Drive with drafts of our cases.

Thursday morning I was part of a panel on Text Analysis Tools and Infrastructure in 2024 and Beyond. (The link, again, takes you to a web page where you can download the short papers we wrote for this “flipped” session.) This panel brought together a bunch of text analysis projects like WordCruncher and Lexos to talk about how we can maintain and evolve our infrastructure.

Musée d’Orsay’s Van Gogh Exhibition Breaks Historic Attendance Record

The Musée d’Orsay set a record attendance of 793,556 visitors to its exhibition ‘Van Gogh in Auvers-sur-Oise’.

ARTnews has a story about how the Musée d’Orsay’s Van Gogh Exhibition Breaks Historic Attendance Record. The exhibit included a virtual reality component (Virtual Reality – Van Gogh’s Palette) where visitors could put on a headset and interact with the palette of Vincent van Gogh. You can see a 360 degree video of the experience here in French. It takes place in the room of Dr. Gachet who treated van Gogh. It starts with the piano at which his daughter Marguerite posed for a painting. Her character also narrates. Then you zoom in on a 3D rendered version of his palette where you hear about some of the paintings he did in the last 70 days of his life. They emerge from the palette.

It isn’t clear if the success of the show is due to the VR component or just the chance to see originals. We can only experience the 360 video which has limited interactivity. That said, I don’t find the video of the VR experience convincing. It is a creative documentary and it is hard to see how being immersed would make much of a difference. Was it just a gimmick to get more people to come to the show?

Group hopes to resurrect 128-year-old Cyclorama of Jerusalem, near Quebec City

MONTREAL — The last cyclorama in Canada has been hidden from public view since it closed in 2018, but a small group of people are hoping to revive the unique…

Good News! A Group hopes to resurrect 128-year-old Cyclorama of Jerusalem, near Quebec City. The Cyclorama of Jerusalem is the last/only cyclorama still standing in Canada. I visited and blogged about it back in 2004 when I was able to visit it. Then it closed and now they are trying to restore it and sell it.

Cycloramas are the virtual reality of the 19th century. Long paintings, sometimes with props, were mounted in the round in special buildings that allowed people to feel immersed in a painted space. These remind us of the variety of types of media that have surpassed – the forgotten types of media.

DH 2023 Graz | Austria

I’m at DH 2023 in Graz, Austria and keeping my live notes here on Philosophi.ca.

The weather is hot and humid, at least in comparison to Edmonton, but the town is lovely. There is a lot of green space and a good tram system.

The conference is not at the university, but in a conference centre that is, thankfully, air conditioned.

The Alt-Right Manipulated My Comic. Then A.I. Claimed It.

AI generated comic in style of Sarah Andersen

My drawings are a reflection of my soul. What happens when artificial intelligence — and anyone with access to it — can replicate them?

Webcomic artist Sarah Andersen has written a timely Opinion for the New York Times on how The Alt-Right Manipulated My Comic. Then A.I. Claimed It. She talks about being harassed by the Alt-Right who created a shadow version of her work full of violent, racist and nazi motifs. Now she could be haunted by an AI-generated shadow like the image above. Her essay nicely captures the feeling of helplessness that many artists who survive on their work must be feeling before the “research” trick of LAION, the nonprofit arm of Stability AI that scraped copyrighted material under the cover of academic research and then made available for commercialization as Stable Diffusion.

Andersen links to a useful article on AI Data Laundering which is a good term for what researchers seem to be doing intentionally or not. What is the solution? Datasets gathered with consent? Alas too many of us, including myself, have released images on Flickr and other sites. So, as the article author Andy Baio puts it, “Asking for permission slows technological progress, but it’s hard to take back something you’ve unconditionally released into the world.”

While artists like Andersen may have no legal recourse that doesn’t make it ethical. Perhaps the academics that are doing the laundering should be called out. Perhaps we should consider boycotting such tools and hiring live artists when we have graphic design work.

Issues around AI text to art generators

A new art-generating AI system called Stable Diffusion can create convincing deepfakes, including of celebrities.

TechCrunch has a nice discussion of Deepfakes for all: Uncensored AI art model prompts ethics questions. The relatively sudden availability of AI text to art generators has provoked discussion on the ethics of creation and of large machine learning models. Here are some interesting links:

Ars Technica has a article on how Artists begin selling AI-generated artwork on stock photography websites. I note that MidJourney generated images all seem to have a similar style. We may find it becomes more and more identifiable like some smell in the background.
Ars Technica has another article on various projects to be able to see what original images might have been used in training AIs like MidJourney. Have AI image generators assimilated your art? New tool lets you check. The provenance of some of the training sets is documented here. It remains to be seen what you can do if your images have been used.
And of course there are art groups that are banning AI generated art, Flooded with AI-generated images, some art communities ban them completely. This raises the question of whether one can tell?

It is worth identifying some of the potential issues:

These art generating AIs may have violated copyright in scraping millions of images. Could artists whose work has been exploited sue for compensation?
The AIs are black boxes that are hard to query. You can’t tell if copyrighted images were used.
These AIs could change the economics of illustration. People who used to commission and pay for custom art for things like magazines, book covers, and posters, could start just using these AIs to save money. Just as Flickr changed the economics of photography, MidJourney could put commercial illustrators out of work.
We could see a lot more “original” art in situations where before people could not afford it. Perhaps poster stores could offer to generate a custom image for you and print it. Get your portrait done as a cyberpunk astronaut.
The AIs could reinforce visual bias in our visual literacy. Systems that always see Philosophers as old white guys with beards could limit our imagination of what could be.
These could be used to create pornographic deepfakes with people’s faces on them or other toxic imagery.

Zampolli Prize Awarded to Voyant Tools

Spyral Notebook Detail (showing code cell and stacked graphs)

Yesterday I gave the triennial Zampolli Prize lecture that honoured Voyant. The lecture is given at the annual ADHO Digital Humanities conference which this year is being hosted by the University of Tokyo. The award notice is here Zampolli Prize Awarded to Voyant Tools. Some of the things I touched on in the talk included:

The genius of of Stéfan Sinclair who passed in August 2020. Voyant was his vision from the time of his dissertation for which he develop HyperPo.
The global team of people involved in Voyant including many graduate research assistants at the U of Alberta. See the About page of Voyant.
How Voyant built on ideas Stéfan and I developed in Hermeneutica about collaborative research as opposed to the inherited solitary paradigm.
How we have now developed an extension to Voyant called Spyral. Spyral is a notebook programming environment built on JavaScript. It allows you to document your Voyant explorations, save parameters for corpora and tools, preprocess texts, postprocess results, and create new visualizations. It is, in short, a full data analysis and visualization environment built into Voyant so you can easily call up and explore results in Voyant’s already rich tool set.
In the image above you can see a Spyral code cell that outputs two stacked graphs where the same pattern of words is graphed over two different, but synchronized, corpora. You can thus compare the use of the pattern over time between the two datasets.
Replication as a practice for recovering an understanding of innovative technologies now taken for granted like tokenization or the KWIC. I talked about how Stéfan and I have been replicating important text processing technologies as a way of understanding the history of computing and the digital humanities. Spyral was the environment we developed for documenting our replications.
I then backed up and talked about the epistemological questions about knowledge and knowledge things in the digital age that grew out of and then inspired our experiments in replication. These go back to attempts to think-through tools as knowledge things that bear knowledge in ways that discourse doesn’t. In this context I talked about the DIKW pyramid (data, information, knowledge, wisdom) that captures current views about the relationships between data and knowledge.
Finally I called for help to maintain and extend Voyant/Spyral. I announced the creation of a consortium to bring us together to sustain Voyant.

It was an honour to be able to give the Zampolli lecture on behalf of all the people who have made Voyant such a useful tool.

Zampolli Prize Awarded to Voyant Tools

I’m immensely proud to write that the Zampolli Prize Awarded to Voyant Tools. The Zampolli prize is one of the most prestigious in my field. I’m proud to have been part of the team that developed and sustained Voyant. Alas, Stéfan Sinclair, its genius, is not with us to share this.