Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

orthand

-0.08

opsis

-0.07

_errno

-0.07

-tests

-0.07

ÅĤo

-0.07

 Ã§oÄŁ

-0.07

roscope

-0.07

erosis

-0.07

è£ı

-0.07

>Show

-0.07

POSITIVE LOGITS

 basically

0.06

port

0.06

wise

0.06

odie

0.06

ber

0.05

 Wikimedia

0.05

0.05

ger

0.05

Ut

0.05

 stuff

0.05

Activations Density 0.000%

No Known Activations

This feature has no known activations.

© Neuronpedia 2025

Privacy & Terms Blog GitHub Slack Twitter Contact