INDEX

Explanations

Technology/internet/software

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

CCP

-0.07

مصر

-0.07

危

-0.07

_CLICKED

-0.07

 Dich

-0.07

 cliff

-0.07

 sâu

-0.07

 SECTION

-0.06

白白

-0.06

มน

-0.06

POSITIVE LOGITS

)+"

0.07

?>>

0.07

 StringTokenizer

0.07

SUV

0.07

[++

0.07

oooooooo

0.06

Observer

0.06

/domain

0.06

pretty

0.06

 לטובת

0.06

Activations Density 0.009%

Technology/internet/software

No Comments

No Known Activations

Technology/internet/software

No Comments

No Known Activations