Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

bjerg

-0.08

issen

-0.07

antha

-0.06

obel

-0.06

ingers

-0.06

 hafta

-0.06

zim

-0.06

-0.06

ì©

-0.06

rze

-0.06

POSITIVE LOGITS

å½

0.06

asley

0.06

ulong

0.06

ãĥ³ãĤ¹

0.06

eria

0.06

ousse

0.06

nore

0.06

ãĢ

0.06

.win

0.05

Î»Î¹Î¬

0.05

Activations Density 0.000%

No Known Activations

This feature has no known activations.

© Neuronpedia 2026

Privacy & Terms Blog GitHub Slack Twitter Contact