Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

function calls with parentheses

np_acts-logits-general · gemini-2.5-flash-lite

New Auto-Interp

Configuration

google/gemma-scope-2-4b-it/resid_post/layer_22_width_65k_l0_medium

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

'';

0.98

"";

0.98

 '.';

0.96

';",

0.94

❞

0.93

{};

0.90

;";

0.86

='';

0.83

="";

0.82

_;

0.82

POSITIVE LOGITS

())

3.09

2.99

.)

2.91

!)

2.89

):

2.87

?)

2.81

[])

2.81

).

2.79

)。

2.70

"")

2.69

Activations Density 0.498%

No Known Activations

© Neuronpedia 2026

Privacy & Terms Blog GitHub Slack Twitter Contact