Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

rubbing and related words

np_acts-logits-general · gemini-2.5-flash-lite

New Auto-Interp

Configuration

google/gemma-scope-27b-pt-res/layer_22/width_131k

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

-2.45

 neue

-2.44

The

-2.44

ve

-2.34



-2.33



-2.30

-2.28

我们

-2.22

小時

-2.22

-2.20

POSITIVE LOGITS

耵

2.73

FirstName

2.48

鷓

2.31

嫱

2.28

蹕

2.28

絎

2.25

睺

2.19

妧

2.06

»;

2.05

硨

2.05

Activations Density 0.014%

No Known Activations

© Neuronpedia 2025

Privacy & Terms Blog GitHub Slack Twitter Contact