Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

store account

np_acts-logits-general · gemini-2.5-flash-lite

New Auto-Interp

Configuration

google/gemma-scope-2-27b-it/resid_post/layer_40_width_262k_l0_medium

Prompts (Dashboard)

238,145 prompts, 512 tokens each

Dataset (Dashboard)

lmsys + oasst1

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

uteurs

0.41

 decomposes

0.41

 össz

0.40

 මිල

0.39

 ferm

0.39

鉀

0.39

resents

0.38

તું

0.36

 комите

0.36

 agama

0.36

POSITIVE LOGITS

scrollView

0.41

 khắc

0.37

 ресур

0.37

kräft

0.36

 বির

0.36

ʍ

0.35

indrical

0.35

 Simplified

0.35

hotite

0.35

"./

0.35

Activations Density 0.000%

No Known Activations

© Neuronpedia 2025

Privacy & Terms Blog GitHub Slack Twitter Contact