INDEX

Explanations

Interests and hobbies

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

-yellow

-0.07

tg

-0.07

selectorMethod

-0.07

ක

-0.06

כשיו

-0.06

ใด

-0.06

_individual

-0.06

Preferred

-0.06

 Else

-0.06

 usado

-0.06

POSITIVE LOGITS

 eyebrows

0.07

.prob

0.07

美好的

0.07

unique

0.07

 Chronic

0.07

充分体现

0.07

vent

0.06

burg

0.06

 Поч

0.06

Activations Density 0.080%

Interests and hobbies

No Comments

No Known Activations

Interests and hobbies

No Comments

No Known Activations