Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

NFL news and analysis

np_acts-logits-general · gemini-2.5-flash-lite

New Auto-Interp

Configuration

google/gemma-scope-27b-pt-res/layer_34/width_131k

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Martial

-0.84

wahati

-0.80

Sit

-0.78

 Odor

-0.77

sit

-0.75

 Martial

-0.73

rendon

-0.71

 шпа

-0.70

Joyce

-0.69

경

-0.68

POSITIVE LOGITS

ESI

0.86

 calib

0.80

 angele

0.79

π

0.78

 giac

0.78

APIC

0.75

 približ

0.74

psz

0.73

邪魔

0.73

齧

0.73

Activations Density 0.025%

No Known Activations

© Neuronpedia 2025

Privacy & Terms Blog GitHub Slack Twitter Contact