Echoes in GenAI generations

September 4, 2025
Nebojsa Jojic, Microsoft

In our recent PNAS paper we demonstrate that large language models produce little variation in generated narratives. Compared to those generations, a human-written narrative is usually Sui Generis, i.e. one of a kind. Or as we’d say in ML and statistics, human writing is in the tails of the distribution of the content LLMs generate. We introduced the Sui Generis (SG) score which can be used to evaluate distinctiveness of written text, whether it was written by a human or by a machine. SG scores may find its uses both in model improvement and in assistive tools (e.g. helping you to sound less like a GPT). As LLMs exhibit increasingly useful abilities to compare and refine ideas, and occasionally add to them, good writing in the future will likely still require human-led, possibly collaborative effort, but greatly assisted by AI.

- Nebojsa Jojic
  
  Senior Principal Researcher
Research Area
- Artificial intelligence
- Human language technologies
Research Lab
- Microsoft Research Lab - Redmond
Group
- Natural Language Processing Group

Watch Next

Using LLMs for safe low-level programming
February 25, 2025
Aseem Rastogi,

Pantazis Deligiannis
GenAI for Supply Chain Management: Present and Future
February 14, 2025
Georg Glantschnig,

Beibin Li,

Konstantina Mellou

, et. al.
Fostering appropriate reliance on AI
September 3, 2024
Mihaela Vorvoreanu
Driving Industry Evolution: Exploring the Impact of Generative AI on Sector Transformation
June 4, 2024
Jiang Bian
Panel: Generative AI for Global Impact: Challenges and Opportunities
June 4, 2024
Jacki O'Neill,

Tanuja Ganu,

Sunayana Sitaram

, et. al.
Keynote: Building Globally Equitable AI
June 4, 2024
Jacki O'Neill
Making Sentence Embeddings Robust to User-Generated Content
May 29, 2024
Lydia Nishimwe
Generative AI and Plural Governance: Mitigating Challenges and Surfacing Opportunities
March 5, 2024
Madeleine Daepp
The Metacognitive Demands and Opportunities of Generative AI
March 5, 2024
Lev Tankelevitch
MEGA: Multi-lingual Evaluation of Generative AI
June 29, 2023
Kabir Ahuja,

Millicent Ochieng