INDEX

Explanations

badlogic game library

np_max-act · gemini-2.0-flash

tokens that mark conversation/control boundaries (e.g., special <|im_start|> / role/instruction markers)

oai_token-act-pair · gpt-5-mini Triggered by @vetterc0

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

💮

-0.08

 **/↵

-0.07

)/

-0.07

 תנ

-0.07

(&___

-0.07

颡

-0.07

zbollah

-0.07

.setStyle

-0.07

 sama

-0.07

POSITIVE LOGITS

�

0.08

Bü

0.07

偶然

0.07

_miss

0.06

issors

0.06

历史

0.06

kho

0.06

postalcode

0.06

(numbers

0.06

Cor

0.06

Activations Density 0.093%

badlogic game library

tokens that mark conversation/control boundaries (e.g., special <|im_start|> / role/instruction markers)

No Comments

No Known Activations

badlogic game library

tokens that mark conversation/control boundaries (e.g., special <|im_start|> / role/instruction markers)

No Comments

No Known Activations