Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

these

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 these

-2.03

these

-1.93

These

-1.48

 THESE

-1.47

 These

-1.41

これらの

-1.38

 thefe

-1.30

 těchto

-1.24

 theſe

-1.22

 этих

-1.20

POSITIVE LOGITS

two

0.75

 same

0.66

 ideas

0.65

 three

0.64

 days

0.63

 kinds

0.62

 events

0.61

 words

0.60

 questions

0.60

0.60

Activations Density 0.113%

No Known Activations

© Neuronpedia 2026

Privacy & Terms Blog GitHub Slack Twitter Contact