INDEX

Explanations

This neuron seems to activate on somewhat random words and phrases, perhaps short function words or verb phrases, and the content doesn't appear to create a coherent meaning.

oai_token-act-pair · gemini-2.0-flash

New Auto-Interp

Configuration

fnlp/Llama-Scope-R1-Distill/400M-Slimpajama-400M-OpenR1-Math-220k/L21R

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

Hzfinfdu/SlimPajama-3B and open-r1/OpenR1-Math-220k

Features

32,768

Data Type

float32

Hook Name

blocks.21.hook_resid_post

Architecture

jumprelu

Context Size

1,024

Dataset

cerebras/SlimPajama-627B

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 mÃ³

-0.07

moid

-0.06

 slightly

-0.06

ãĤįãģĨ

-0.06

åħį

-0.06

invalid

-0.06

 Ð²Ð¿Ð¾Ð»Ð½Ðµ

-0.06

irim

-0.06

 Invalid

-0.06

æľīäºº

-0.06

POSITIVE LOGITS

 limited

0.17

 lack

0.17

 absence

0.15

 lacking

0.15

limited

0.14

 lacks

0.14

lack

0.13

 minimal

0.13

 lacked

0.13

 Lack

0.12

Activations Density 0.054%

This neuron seems to activate on somewhat random words and phrases, perhaps short function words or verb phrases, and the content doesn't appear to create a coherent meaning.

No Comments

No Known Activations