INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_23/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.23.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Computer

-0.07

quit

-0.07

auc

-0.07

 Compiler

-0.07

 compiling

-0.07

 veut

-0.07

 }}"></

-0.07

 Jacob

-0.07

 halluc

-0.07

 Simpsons

-0.07

POSITIVE LOGITS

 passionately

0.08

SEARCH

0.07

gage

0.07

≴

0.06

 표현

0.06

∜

0.06

posta

0.06

ità

0.06

康养

0.06

perf

0.06

Activations Density 0.004%

No Comments

No Known Activations