Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

Douglas Adams

np_acts-logits-general · gemini-2.5-flash-lite

New Auto-Interp

Configuration

google/gemma-scope-2-27b-it/resid_post/layer_16_width_262k_l0_medium

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 cabbage

0.48

ធី

0.48

)}$

0.47

 seabed

0.47

Yvette

0.46

 నీ

0.45

 अवस्थ

0.45

$&$-

0.44

Indexing

0.44

నీ

0.43

POSITIVE LOGITS

っています

0.54

Мы

0.50

Ἱ

0.49

Всем

0.49

 artificially

0.48

স্ট্র

0.47

 również

0.47

fst

0.47

 İran

0.46

 dans

0.46

Activations Density 0.000%

No Known Activations

© Neuronpedia 2025

Privacy & Terms Blog GitHub Slack Twitter Contact