INDEX

Explanations

bound

np_max-act · gemini-2.0-flash

The neuron detects words ending in “-bound,” i.e. terms describing substances or components bound to surfaces or membranes.

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

REM

-0.07

Checkpoint

-0.07

ONES

-0.07

MAKE

-0.06

GHz

-0.06

 engaging

-0.06

arges

-0.06

Sampling

-0.06

umes

-0.06

JSON

-0.06

POSITIVE LOGITS

险

0.08

 Oval

0.06

_flg

0.06

ar

0.06

Tud

0.06

 usefulness

0.06

.psi

0.06

 disfr

0.06

 toplam

0.06

 الخاص

0.06

Activations Density 0.012%

bound

The neuron detects words ending in “-bound,” i.e. terms describing substances or components bound to surfaces or membranes.

No Comments

No Known Activations