INDEX

Explanations

Code/identifiers

np_max-act · gemini-2.0-flash

The neuron strongly lights up on numeric tokens (sequences of digits).

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

.Go

-0.07

sos

-0.06

(isolate

-0.06

dfd

-0.06

.MAX

-0.06

減

-0.06

 possível

-0.06

 Womens

-0.06

 організації

-0.06

|--

-0.06

POSITIVE LOGITS

ming

0.07

trag

0.07

MING

0.07

pare

0.07

aktion

0.06

ıl

0.06

 jednání

0.06

 @"↵

0.06

mış

0.06

elog

0.06

Activations Density 0.027%

Code/identifiers

The neuron strongly lights up on numeric tokens (sequences of digits).

No Comments

No Known Activations