Neuronpedia

APIAssistant AxisNEW Circuit TracerNEW Steer SAE Evals Exports Community Blog Privacy & Terms Contact

INDEX

Explanations

phone numbers

np_acts-logits-general · gemini-2.5-flash-lite

New Auto-Interp

Configuration

google/gemma-scope-27b-pt-res/layer_34/width_131k

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

গ

-0.90

 warned

-0.83

 those

-0.81

 noel

-0.77

 calculating

-0.77

 cera

-0.75

tisgarh

-0.75

lą

-0.75

 Mahesh

-0.74

ään

-0.71

POSITIVE LOGITS

0.90

"+

0.86

>>(

0.82

:(

0.82

$(

0.81

Tel

0.80

Tel

0.79

0.78

이다

0.76

}(

0.76

Activations Density 0.035%

No Known Activations

© Neuronpedia 2025

Privacy & Terms Blog GitHub Slack Twitter Contact