INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.23.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

below

-0.07

爬

-0.07

↵

-0.07

dat

-0.07

孩童

-0.07

 parted

-0.07

 total

-0.07

Categories

-0.07

cup

-0.07

POSITIVE LOGITS

[js

0.08

Instr

0.08

安全管理

0.08

 השא

0.07

Laf

0.07

 Renderer

0.07

פעל

0.07

もし

0.07

Propagation

0.07

_Renderer

0.07

Activations Density 0.006%

No Known Activations