INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

_dll

-0.07

(runtime

-0.07

 ofrec

-0.07

 domicile

-0.07

 Increase

-0.07

🍸

-0.07

 ladder

-0.07

史诗

-0.07

ライブ

-0.06

POSITIVE LOGITS

nested

0.08

 neighbor

0.08

 neighboring

0.07

-neck

0.07

 khác

0.07

ingroup

0.07

汉族

0.07

ʶ

0.07

 neighbors

0.07

Neg

0.07

Activations Density 0.007%

No Known Activations