INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-qwen2.5-7b-instruct/resid_post_layer_23/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.23.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

employed

-0.07

 courtesy

-0.07

 Scratch

-0.07

zoom

-0.07

 מבחינת

-0.06

.tie

-0.06

突如

-0.06

Input

-0.06

鳍

-0.06

 correct

-0.06

POSITIVE LOGITS

 много

0.07

 banco

0.07

converter

0.07

sender

0.07

 haute

0.07

 وعد

0.07

hyper

0.07

>`

0.06

_ud

0.06

Lê

0.06

Activations Density 0.046%

No Comments

No Known Activations