INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

Nur

-0.08

ROS

-0.08

.BUTTON

-0.07

 Riyadh

-0.07

_And

-0.07

veh

-0.07

 ihtiy

-0.07

-No

-0.07

瑧

-0.07

.Attribute

-0.07

POSITIVE LOGITS

},

0.07

จำนวน

0.07

uar

0.07

ose

0.07

中小企业

0.07

منذ

0.07

 חמ

0.07

 scientists

0.06

 //////////////////////////////////////////////////////////////////////////

0.06

Activations Density 0.000%

No Known Activations