Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

more

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

IsContent

-0.99

 Мексичка

-0.99

 pleaſure

-0.97

 myſelf

-0.97

 itſelf

-0.93

########.

-0.92

SequentialGroup

-0.90

+#+

-0.90

dafx

-0.88

Efq

-0.88

POSITIVE LOGITS

↵↵

0.76

↵

0.75

0.75

0.74

’

0.66

<eos>

0.64

0.60

0.59

0.59

‘

0.56

Activations Density 0.023%

No Known Activations

© Neuronpedia 2025

Privacy & Terms Blog GitHub Slack Twitter Contact