Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

mentions of specific names and titles

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

resh

-0.07

ienes

-0.06

bread

-0.06

ãģ¶

-0.06

FINITY

-0.06

ACLE

-0.06

 æ´

-0.06

ãĥĪãĥª

-0.05

ìļĶ

-0.05

Øµ

-0.05

POSITIVE LOGITS

Ed

0.08

aea

0.07

Ed

0.07

ahu

0.07

.ed

0.07

enting

0.07

xED

0.07

gewater

0.07

imson

0.06

rott

0.06

Activations Density 0.015%

No Known Activations

© Neuronpedia 2025

Privacy & Terms Blog GitHub Slack Twitter Contact