Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

Guinea

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Confederacy

-0.91

 auffi

-0.82

 Learner

-0.79

EndInit

-0.79

 guineas

-0.79

 ―――――

-0.78

saraba

-0.77

 fubject

-0.77

 ſtand

-0.77

 ſte

-0.77

POSITIVE LOGITS

0.52

0.51

0.46

 köz

0.46

0.45

0.44

0.44

or

0.43

kø

0.42

0.42

Activations Density 0.054%

No Known Activations

© Neuronpedia 2025

Privacy & Terms Blog GitHub Slack Twitter Contact