© Neuronpedia 2026

Privacy & Terms Blog GitHub Slack Twitter Contact

Neuronpedia

Natural Language

NEW Assistant AxisNEW Circuit TracerUPDATESteer SAE Evals ExportsAPI Community Blog Privacy & Terms Contact

Home
Gemma-3-4B-IT
22-GEMMASCOPE-2-RES-65K
1202

INDEX

Explanations

items in lists

np_acts-logits-general · gemini-2.5-flash-lite

New Auto-Interp

Top Features by Cosine Similarity

Configuration

google/gemma-scope-2-4b-it/resid_post/layer_22_width_65k_l0_medium

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

No Configuration Found

Embeds

Show PlotsShow ExplanationShow ActivationsShow Test FieldShow SteerShow Link

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

奻

0.72

 हमेशा

0.71

 એટલે

0.70

 ainult

0.67

ようになりました

0.67

»;

0.65

 означает

0.65

apä

0.64

!;

0.64

 начинается

0.63

POSITIVE LOGITS

 등을

2.17

etc

2.15

،

2.12

2.10

 등이

2.02

 എന്നിവ

2.00

 และ

1.95

 등의

1.95

,…

1.91

,...

1.87

Activations Density 1.225%

No Known Activations