INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 isChecked

-0.07

 colonies

-0.07

 professional

-0.07

道德

-0.07

يمن

-0.07

ø

-0.07

ropolitan

-0.07

fname

-0.07

 בעיר

-0.07

 Cuisine

-0.07

POSITIVE LOGITS

❅

0.08

Mix

0.08

 =>
↵

0.07

Hub

0.07

MIX

0.06

 units

0.06

unm

0.06

 teamed

0.06

 çer

0.06

 במהל

0.06

Activations Density 0.867%

No Comments

No Known Activations