INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

.Aggressive

-0.07

otch

-0.07

 obliv

-0.07

_SUFFIX

-0.07

 Geek

-0.07

.More

-0.07

 astr

-0.07

.ISupportInitialize

-0.07

计较

-0.07

ORIZONTAL

-0.07

POSITIVE LOGITS

 Validation

0.08

 prevalence

0.08

准确

0.07

赞同

0.07

志愿服务

0.07

Logo

0.07

궕

0.06

門

0.06

_extraction

0.06

Activations Density 0.012%

No Known Activations