INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.23.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

联手

-0.08

rm

-0.07

fn

-0.07

 resulted

-0.07

 bourbon

-0.07

SELECT

-0.06

 sürec

-0.06

sys

-0.06

");

-0.06

样子

-0.06

POSITIVE LOGITS

 libr

0.07

 confidentiality

0.07

nets

0.07

 Trinity

0.07

ADOW

0.06

_VARS

0.06

itional

0.06

 credits

0.06

 الض

0.06

Activations Density 0.001%

No Known Activations