Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

updates and operations

np_acts-logits-general · gemini-2.5-flash-lite

New Auto-Interp

Configuration

google/gemma-scope-2-27b-it/resid_post/layer_31_width_262k_l0_medium

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

APs

0.46

outed

0.44

shifted

0.43

varlak

0.42

Ob

0.42

ž

0.42

insel

0.42

changed

0.41

NIH

0.41

RE

0.41

POSITIVE LOGITS

 communion

0.49

зион

0.45

ственным

0.45

 folder

0.44

<0xEC>

0.44

 разум

0.44

 ма

0.44

 rhythm

0.44

 patriotic

0.44

 Маке

0.43

Activations Density 0.005%

No Known Activations

© Neuronpedia 2025

Privacy & Terms Blog GitHub Slack Twitter Contact