Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

social or emotional states

np_acts-logits-general · gemini-2.5-flash-lite

ready and prepared

np_acts-logits-general · gemini-2.5-flash-lite

New Auto-Interp

Configuration

google/gemma-scope-2-27b-pt/resid_post/layer_16_width_16k_l0_medium

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

esehen

0.70

с

0.67

miner

0.66

rie

0.65

they

0.64

ight

0.63

ة

0.62

nett

0.62

rund

0.62

ranet

0.62

POSITIVE LOGITS

el

0.93

as

0.91

ב

0.88

on

0.80

ే

0.80

on

0.79

ด

0.76

ed

0.76

ब

0.76

ר

0.75

Activations Density 8.943%

No Known Activations

© Neuronpedia 2025

Privacy & Terms Blog GitHub Slack Twitter Contact