INDEX

Explanations

ag followed by specific suffixes

np_acts-logits-general · gemini-2.5-flash-lite

The neuron activates on any token beginning with the letters “Ag,” i.e. words or names prefixed by “Ag.”

oai_token-act-pair · o4-mini Triggered by @jyhe0408

words beginning with the “Ag”/“AG” prefix, often as the start of proper names or technical terms.

oai_token-act-pair · gpt-5 Triggered by @jyhe0408

the beginning of proper nouns or brand names starting with "Ag".

oai_token-act-pair · claude-4-5-sonnet Triggered by @jyhe0408

New Auto-Interp

Configuration

google/gemma-scope-27b-pt-res/layer_34/width_131k

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

hler

-0.89

更に

-0.84

ół

-0.82

 further

-0.81

どうやら

-0.80

therosclerosis

-0.78

mahami

-0.77

 Satt

-0.75

FURTHER

-0.75

 furthermore

-0.75

POSITIVE LOGITS

Ag

1.45

ag

1.37

Ag

1.32

AG

1.13

Agn

1.01

这些

0.92

AGR

0.89

 Agenda

0.87

Agenda

0.85

Aging

0.85

Activations Density 0.056%

ag followed by specific suffixes

The neuron activates on any token beginning with the letters “Ag,” i.e. words or names prefixed by “Ag.”

words beginning with the “Ag”/“AG” prefix, often as the start of proper names or technical terms.

the beginning of proper nouns or brand names starting with "Ag".

No Comments

No Known Activations

ag followed by specific suffixes

The neuron activates on any token beginning with the letters “Ag,” i.e. words or names prefixed by “Ag.”

words beginning with the “Ag”/“AG” prefix, often as the start of proper names or technical terms.

the beginning of proper nouns or brand names starting with "Ag".

No Comments

No Known Activations