INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.23.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

砌

-0.07

 track

-0.07

 וי

-0.07

dr

-0.07

ọc

-0.07

ustering

-0.07

oky

-0.07

ݥ

-0.07

----------------------------------------------------------------------

-0.07

POSITIVE LOGITS

Batman

0.07

PYTHON

0.07

_TLS

0.07

 авто

0.07

 но

0.07

Men

0.06

 Sabha

0.06

MISS

0.06

 Calculates

0.06

 бренд

0.06

Activations Density 0.023%

No Known Activations