Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

end

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

-0.68

↵

-0.60

↵↵

-0.55

-0.52

<eos>

-0.51

an

-0.50

-0.48

-0.47

you

-0.46

las

-0.45

POSITIVE LOGITS

ſelves

1.48

 Houſe

1.44

 myſelf

1.44

Efq

1.43

 ſtate

1.40

 itſelf

1.38

ſelf

1.34

 Reſ

1.34

 Anſ

1.34

 purpoſe

1.32

Activations Density 0.095%

No Known Activations

© Neuronpedia 2026

Privacy & Terms Blog GitHub Slack Twitter Contact