On the Relationship between Truth and Political Bias in Language Models

Even when language reward models are trained only on true versus false data, they can still display a political bias.

Research

Political bias has been well-documented in large language models. However, our study shows that this bias can show up even if LLMs are aligned only on objectively true versus false data. This has interesting implications for AI alignment, as it potentially points to biases in the pretrained models that are then exacerbated during fine-tuning.

LLM Bias and Equity, Political Bias, AI Alignment

MIT News feature

Home > Research > On the Relationship between Truth and Political Bias in Language Models

Collaborators

Suyash Fulay

PhD candidate

William Brannon

PhD candidate

Shrestha Mohanty

Masters candidate

Cassandra Overney

PhD candidate

Elinor Poole-Dayan

Masters candidate

Professor Deb Roy

Director & Principal Investigator

Jad Kabbara

Research Scientist

In the news

Study: Some language reward models exhibit political bias
Research from the MIT Center for Constructive Communication finds this effect occurs even when reward models are trained on factual data.

Large language models (LLMs) that drive generative artificial intelligence apps, such as ChatGPT, have been proliferating at lightning speed and have improved to the point that it is often impossible to distinguish between something written through generative AI and human-composed text. However, these models can also sometimes generate false statements or display a political bias.

News

12.12.2024 | MIT News

More Research

All Research

Anthology

An AI interface that turns raw conversation audio into interactive maps

Research

Communication, Conversation Visualization, Sensemaking, Data Visualization, Civic Engagement

Dialogue to Decision

An LLM-Powered Framework for Analyzing Collective Idea Evolution and Voting Dynamics in Deliberative Assemblies

Research

Large Language Model, Generative AI, Civic Engagement, Deliberative Assembly, Deliberation

Voice to Vision

Supporting decision makers in effective and efficient constituency-informed, AI-supported decision-making. Communicating how constituen...

Research

Communication, Sensemaking, Civic Engagement, Participatory Research, Deliberation

DisCourse

A curated social experience that transforms dinner between strangers into an opportunity to reimagine how we listen, speak, and share

Research

Conversation Facilitation, Civic Engagement, Listening, Design, Art

FRONTLINE Social Dialogue Network

Archived

Over 40 young people participated in a social listening experiment exploring how personal experiences build bridges to better understanding current events and creating mo

Pilots & Programs

Conversation Facilitation, Civic Engagement, Media Literacy, Journalism, Artificial Intelligence, Youth Engagement, Youth Dialogue, FRONTLINE PBS, Cortico

realtalk@Boston

A new civic infrastructure in Boston grounded in dialogue as a way to building “civic muscle” of democracy

Pilots & Programs

Research, Sensemaking, Conversation Facilitation, Participatory Research, Listening