INDEX

Explanations

European parliament debates

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

磕

-0.07

.unpack

-0.07

 nestled

-0.07

写字楼

-0.07

 start

-0.07

 wearing

-0.06

 valley

-0.06

컵

-0.06

𝖙

-0.06

回味

-0.06

POSITIVE LOGITS

 현재

0.07

 Increase

0.07

ທ

0.07

_PAR

0.07

 Shift

0.06

Bei

0.06

Volt

0.06

 Samurai

0.06

 brawl

0.06

bara

0.06

Activations Density 0.009%

European parliament debates

No Comments

No Known Activations

European parliament debates

No Comments

No Known Activations