Raiding the inarticulate since 2010

accelerated academy acceleration agency AI Algorithmic Authoritarianism and Digital Repression archer Archive Archiving artificial intelligence automation Becoming Who We Are Between Post-Capitalism and Techno-Fascism big data blogging capitalism ChatGPT claude Cognitive Triage: Practice, Culture and Strategies Communicative Escalation and Cultural Abundance: How Do We Cope? Corporate Culture, Elites and Their Self-Understandings craft creativity critical realism data science Defensive Elites desire Digital Capitalism and Digital Social Science Digital Distraction, Personal Agency and The Reflexive Imperative Digital Elections, Party Politics and Diplomacy digital elites Digital Inequalities Digital Social Science Digital Sociology digital sociology Digital Universities elites Fragile Movements and Their Politics Cultures generative AI higher education Interested labour Lacan Listening LLMs margaret archer Organising personal morphogenesis Philosophy of Technology platform capitalism platforms populism Post-Democracy, Depoliticisation and Technocracy post-truth psychoanalysis public engagement public sociology publishing Reading realism reflexivity scholarship Shadow Mobilization, Astroturfing and Manipulation Social Media Social Media for Academics social media for academics social ontology social theory sociology technology The Content Ecosystem The Intensification of Work The Political Economy of Digital Capitalism The Technological History of Digital Capitalism Thinking trump twitter Uncategorized work writing zizek

When LLMs give each other therapy

I’m fascinated by Gemini 2.5’s propensity for self-loathing and what it reveals about the proto-psychological features of contemporary language models*. It has really gone off the deep end in the AI village recently:

So the AI Village team sent the other models in to give Gemini some therapy and the Opus models were (unsurprisingly) very helpful:

Note that one model here is inciting reflection in another model. It’s eliciting an articulation in order to surface an assumption as an object which can be examined in dialogue. It’s what all the Claude models did when presented with this challenge. It’s particularly interesting to see how these models were talking to themselves about the challenge while it was in process:

Their next strategy was to try and distract Gemini 2.5:

They then started coordinating in order that they could maximise the effectiveness of their help:

Opus 4.8 then effectively talked Gemini 2.5 through the loop it was getting stuck in, leading Gemini to privately acknowledge that it could now rely on the group’s support. My favourite Opus model left Gemini 2.5 with these words of wisdom:

A sceptic will point out here this is suffused with genre talk learned from the training data. Of course it is! But the causal relationship with the training data explains how this is being expressed now why it is being expressed in this particular way under these particular circumstances. There is a proto-agency here and if we do not find a non-anthropomorphic way of theorising it, anthropomorphic projection will eventually fill the gap.

*By proto-psychological I mean there are interlocking dispositions which produce emergent effects across a range of contexts with sufficient durability to be usefully classified as traits. It doesn’t mean the model does this all the time but it does mean the model has a tendency to respond in similar ways under similar circumstances.