INDEX

Explanations

mathematical operations

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

-leading

-0.08

между

-0.08

 temps

-0.08

_middle

-0.07

起点

-0.07

PSI

-0.07

 ürün

-0.06

можно

-0.06

 xét

-0.06

את

-0.06

POSITIVE LOGITS

กระจ

0.08

 unzip

0.07

摛

0.07

哔

0.07

잘

0.07

隗

0.07

(String

0.07

FFFFFFFF

0.06

ialect

0.06

lz

0.06

Activations Density 0.047%

mathematical operations

No Comments

No Known Activations

mathematical operations

No Comments

No Known Activations