INDEX

Explanations

Advice/personal stories

np_max-act · gemini-2.0-flash

sentences or phrases that give advice, recommendations, or ask/pose questions addressed to the reader (second‑person directives and conditionals).

oai_token-act-pair · gpt-5-mini Triggered by @vetterc0

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

メッセージ

-0.07

史上

-0.07

士兵

-0.07

 Willis

-0.07

desktop

-0.06

ทหาร

-0.06

 Hash

-0.06

qi

-0.06

貴

-0.06

 Thai

-0.06

POSITIVE LOGITS

，并

0.07

sla

0.07

faker

0.07

 nets

0.07

itledBorder

0.07

")==

0.07

漫

0.07

更好地

0.07

黢

0.07

<<<<<<<<

0.07

Activations Density 0.222%

Advice/personal stories

sentences or phrases that give advice, recommendations, or ask/pose questions addressed to the reader (second‑person directives and conditionals).

No Comments

No Known Activations

Advice/personal stories

sentences or phrases that give advice, recommendations, or ask/pose questions addressed to the reader (second‑person directives and conditionals).

No Comments

No Known Activations