Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ãģ¦ãĤĤ

-0.07

 Jenner

-0.06

ferred

-0.06

vs

-0.06

-0.06

 duplicate

-0.06

 Screw

-0.06

="__

-0.05

Cub

-0.05

 exhaust

-0.05

POSITIVE LOGITS

 CONSEQUENTIAL

0.08

Ã¡bado

0.07

autiful

0.07

readystatechange

0.07

tingham

0.07

aptops

0.06

naÄįenÃŃ

0.06

zcze

0.06

 Garrison

0.06

IPPING

0.06

Activations Density 0.000%

No Known Activations

This feature has no known activations.

© Neuronpedia 2025

Privacy & Terms Blog GitHub Slack Twitter Contact