INDEX

Explanations

:

np_max-act · gemini-2.0-flash

scripted dialogue turns marked by speaker labels and dialogue punctuation indicating conversational exchanges.

oai_token-act-pair · gpt-5 Triggered by @vetterc0

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

.appspot

-0.07

etically

-0.07

.Generation

-0.07

 glitch

-0.07

-seven

-0.06

ỏ

-0.06

ζ

-0.06

 yardımcı

-0.06

`]

-0.06

 ثاني

-0.06

POSITIVE LOGITS

rift

0.08

.getKey

0.07

Got

0.07

 They

0.07

 preco

0.07

lei

0.07

 {},↵

0.07

רכת

0.07

yog

0.07

 newborn

0.06

Activations Density 0.054%

:

scripted dialogue turns marked by speaker labels and dialogue punctuation indicating conversational exchanges.

No Comments

No Known Activations

:

scripted dialogue turns marked by speaker labels and dialogue punctuation indicating conversational exchanges.

No Comments

No Known Activations