AI Interpretability Research Strategy

The science of understanding AI.

InterpretAI is an AI consultancy founded by researchers at UC Berkeley. We help organizations understand what their models have learned — and use that understanding for discovery, governance, and high-stakes deployment.

Start a conversation → What we do →

Fig. 01

Identity Mark

InterpretAI

v.2026

[ 01 ] What we do

Interpretability is one of the longest-standing open problems in machine learning. The systems being deployed across most industries today are functional but largely opaque, often even to the engineers who maintain them.

We work with companies that can't afford that opacity: in finance, healthcare, law, science, and infrastructure. We bring the rigor of the academic frontier — and we ship.

Featured engagement

AI Interpretability & Scientific Discovery.

OpenAI Forum · San Francisco · 2026

Read coverage ↗

[ 02 ] Services

Three ways we work with you.

01 / Audit

Interpretability audits

We take models that are already in use and study them from the inside. The output is a written account of what your system has learned, where it performs, and where it doesn't. Engineers and external reviewers can both work from it.

Mechanistic probing
Behavioral evaluation
Bias & robustness assessment

→

02 / Build

Custom interpretable models

Some applications require models that are explainable by construction. We build for these cases: regulated industries, safety-critical pipelines, and any setting where someone eventually has to justify a decision the model made.

Domain-specific architectures
Interpretable-by-design training
Deployment & monitoring

→

03 / Advise

AI strategy & research translation

AI research moves faster than most leadership teams can track. We brief executives, boards, and product leaders on what's worth attention from current research, what's overstated, and what to plan for over the next year or two.

Executive briefings
Roadmaps & due diligence
Policy & risk advisory

→

[ 03 ] Selected work

What we've built.

W / 01 · 2026

OpenAI Forum, San Francisco

Invited talk

Gašper Beguš presented our research on interpretability as an engine for scientific discovery — methods for moving beyond model explanation into the production of new knowledge.

Read coverage ↗

W / 02 · 2024–2026

Models that learn from raw audio

Method

Speech models trained on raw audio rather than text — closer to how infants acquire language than to how large text models work. The internal representations can be probed and compared directly against neural signals, giving a methodology that connects machine learning with cognitive science.

Methods overview ↗

W / 03 · 2024

Latent Spacecraft

Research collaboration

A research project on the inside of generative models, produced in collaboration with the design studio Metahaven and Riccardo Petrini. A demonstration that interpretability can be a humanistic practice as well as a technical one.

How we think.

P / 01

Begin with a question, not a product.

Some problems call for a method that's six months old. Others are best handled with techniques that have been around for decades. Knowing the difference matters more than chasing what's newest.

P / 02

Transparency is a design constraint.

Post-hoc explanation has real limits. We help organizations understand the systems they already run, and we design future ones so transparency is a property of the architecture itself.

P / 03

Research-grade thinking, industry-grade delivery.

We work the way a research group works, on the schedule a business actually has. What we hand off is meant to hold up under both technical and regulatory review.

P / 04

Imagining is the new implementation.

The hardest part of AI work is rarely the engineering — it's deciding what's worth building. We bring the humanistic side of this thinking to the table, grounded in research like Nina's Artificial Humanities (Michigan, 2025).

[ 05 ] Team

Founders.

01 — Co-founder

Gašper Beguš

Interpretable machine learning · Speech & language models

Associate Professor of Linguistics at UC Berkeley and director of the Berkeley Speech and Computation Lab. PhD in Linguistics from Harvard. He develops interpretable machine-learning models that learn speech and language from raw audio, and applies these methods to questions of scientific discovery. His work has been covered by The New York Times, The Guardian, The Economist, National Geographic, the Financial Times, and others.

Selected briefings & talks OpenAI Forum · NYU Stern · Centre Pompidou · National Science Foundation · Santa Fe Institute

UC Berkeley Harvard Berkeley SCL Lab

gasperbegus.com ↗

02 — Co-founder

Nina Beguš

AI & the humanities · Strategy & consulting

Researcher at UC Berkeley working on artificial intelligence and the humanities. PhD in Comparative Literature from Harvard, where her dissertation became the basis for Artificial Humanities (University of Michigan Press, 2025). She is a co-author of Latent Spacecraft, a research project on AI latent spaces produced with the design studio Metahaven. Previously a Senior Researcher at the Berggruen Institute, where she developed process-based consulting methods used by both startups and major technology companies.

UC Berkeley Harvard BIDS Berggruen Institute

ninabegus.com ↗

— Our research featured in —

The New York Times The Guardian The Economist National Geographic Financial Times The Atlantic NPR BBC WIRED Quanta Scientific American Nautilus The New York Times The Guardian The Economist National Geographic Financial Times The Atlantic NPR BBC WIRED Quanta Scientific American Nautilus

[ 06 ] Contact

Let's talk.

General & Inquiries

info@interpret-ai.com

Office

InterpretAI LLC
2108 N Street, Suite N
Sacramento, CA 95816

LinkedIn · X