Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

Re

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Sa

-0.93

Re

-0.75

Ph

-0.71

sa

-0.70

Qu

-0.65

Res

-0.65

Es

-0.63

Th

-0.61

Pe

-0.61

ph

-0.61

POSITIVE LOGITS

 myſelf

1.23

 raiſ

1.12

Efq

1.06

 himſelf

1.05

 uſed

1.02

 itſelf

1.00

 auffi

0.99

 Jefus

0.99

ſelf

0.97

 ſta

0.94

Activations Density 0.131%

No Known Activations

© Neuronpedia 2026

Privacy & Terms Blog GitHub Slack Twitter Contact