Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

addressing an issue

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 myſelf

-1.30

 itſelf

-1.25

Efq

-1.16

ſelves

-1.07

 Jefus

-1.07

 himſelf

-1.05

 themſelves

-1.05

NUMX

-1.05

 Theſe

-1.04

 Cæsar

-1.03

POSITIVE LOGITS

the

0.90

 some

0.68

 this

0.66

all

0.66

0.65

 their

0.60

0.59

and

0.59

0.59

0.59

Activations Density 0.067%

No Known Activations

© Neuronpedia 2025

Privacy & Terms Blog GitHub Slack Twitter Contact