INDEX

Explanations

IS

np_max-act · gemini-2.0-flash

The neuron detects short all-caps two-letter strings (e.g. “IS,” “SM,” “QC”)—i.e. brief uppercase abbreviations.

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Fo

-0.07

.Description

-0.07

มากกว

-0.07

pea

-0.06

ocre

-0.06

 decreased

-0.06

 гриб

-0.06

lasting

-0.06

 buflen

-0.06

 multip

-0.06

POSITIVE LOGITS

.act

0.07

0.06

チ

0.06

xmin

0.06

주소

0.06

反

0.06

gens

0.06

 shocks

0.06

goal

0.06

 elektron

0.06

Activations Density 0.000%

IS

The neuron detects short all-caps two-letter strings (e.g. “IS,” “SM,” “QC”)—i.e. brief uppercase abbreviations.

No Comments

No Known Activations