INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

-basket

-0.07

 Pokémon

-0.07

Austin

-0.07

asks

-0.06

وم

-0.06

 Billy

-0.06

 potassium

-0.06

 Bronx

-0.06

 LGBTQ

-0.06

POSITIVE LOGITS

 เพ

0.07

*.

0.07

'/

0.06

天

0.06

Jeb

0.06

 --------------------------------------------------------------------------------

0.06

:'',↵

0.06

(gc

0.06

.lu

0.06

lásil

0.06

Activations Density 0.134%

No Known Activations