INDEX

Explanations

limits and maximums

np_max-act · gemini-2.0-flash

The neuron detects mentions of the model’s input‐length or token‐limit capabilities and related guidance on conciseness.

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

<y

-0.06

ğim

-0.06

SEE

-0.06

 indeed

-0.06

woff

-0.06

Blockly

-0.06

 meziná

-0.06

;';↵

-0.06

 interpersonal

-0.06

itial

-0.06

POSITIVE LOGITS

 Celebr

0.07

]=(

0.07

Weapons

0.06

 perí

0.06

建築

0.06

vd

0.06

 resemblance

0.06

Angles

0.06

 verir

0.06

dí

0.06

Activations Density 0.034%

limits and maximums

The neuron detects mentions of the model’s input‐length or token‐limit capabilities and related guidance on conciseness.

No Comments

No Known Activations