INDEX

Explanations

text snippets

np_max-act · gemini-2.0-flash

comparative and superlative forms of adjectives and verbs.

oai_token-act-pair · gpt-4o-mini Triggered by @xinyanhu8

The neuron fires on runs of underscore characters (i.e. the blank “____” tokens used as placeholders in fill-in-the-blank questions).

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

.Exception

-0.06

 proverb

-0.06

(ad

-0.06

ark

-0.06

่ท

-0.06

.Brand

-0.06

주소

-0.06

Vij

-0.06

PRETTY

-0.06

-Year

-0.06

POSITIVE LOGITS

izin

0.07

asier

0.06

 çocu

0.06

 نتیجه

0.06

..."↵

0.06

YY

0.06

ну

0.06

�

0.06

_AX

0.06

 quant

0.06

Activations Density 0.005%

text snippets

comparative and superlative forms of adjectives and verbs.

The neuron fires on runs of underscore characters (i.e. the blank “____” tokens used as placeholders in fill-in-the-blank questions).

No Comments

No Known Activations