© Neuronpedia 2026

Privacy & Terms Blog GitHub Slack Twitter Contact

Neuronpedia

Natural Language

NEW Assistant AxisNEW Circuit TracerUPDATESteer SAE Evals ExportsAPI Community Blog Privacy & Terms Contact

Home
Gemma-4-31B
30-RES-MATRYOSHKA-131K
130929

INDEX

Explanations

bool type attributes

np_acts-logits-general · gemini-2.5-flash-lite

New Auto-Interp

Top Features by Cosine Similarity

Configuration

decoderesearch/gemma-4-saes/gemma-4-31b

Prompts (Dashboard)

16,384 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

No Configuration Found

Embeds

Show PlotsShow ExplanationShow ActivationsShow Test FieldShow SteerShow Link

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

can

-0.11

 could

-0.08

 dapat

-0.08

 può

-0.08

 können

-0.08

 aún

-0.08

 میتوان

-0.07

 можем

-0.07

できます

-0.07

したいと思います

-0.07

POSITIVE LOGITS

通常の

0.06

 دعم

0.06

 சிலர்

0.06

رفع

0.06

当時の

0.05

வந்த

0.05

 সহ্য

0.05

 lieu

0.05

 normalement

0.05

慣

0.05

Activations Density 0.002%

No Known Activations