INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

LOG

-0.08

 kidney

-0.07

灭亡

-0.07

萄

-0.07

 lying

-0.07

 Pokemon

-0.07

こともある

-0.07

ﺾ

-0.07

אחד

-0.07

 HttpStatusCode

-0.07

POSITIVE LOGITS

 infographic

0.08

symbol

0.08

ignore

0.08

 suggesting

0.08

菰

0.07

 symbol

0.07

scaled

0.07

(prefix

0.07

 attest

0.07

Activations Density 0.007%

No Known Activations