Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

references to specific provinces

oai_token-act-pair · gpt-4o-mini Triggered by @bot

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

tings

-0.07

angen

-0.07

angs

-0.07

rita

-0.07

-0.07

TING

-0.07

yun

-0.07

-0.07

-0.07

anmar

-0.07

POSITIVE LOGITS

-wide

0.12

wide

0.12

/state

0.10

ä»½

0.08

-long

0.08

Ø¨ÙĪÙĦ

0.07

à¥Ģà¤¯

0.07

åĲ¾

0.07

hetic

0.07

ally

0.07

Activations Density 0.010%

No Known Activations

© Neuronpedia 2026

Privacy & Terms Blog GitHub Slack Twitter Contact