INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_19/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.19.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

ังจาก

-0.06

 commodity

-0.06

нути

-0.06

ANA

-0.06

 También

-0.06

 movie

-0.06

 leakage

-0.06

請

-0.06

 Nuevo

-0.06

 ngon

-0.06

POSITIVE LOGITS

 /*#__

0.07

(',');↵

0.06

.teacher

0.06

.eulerAngles

0.06

 уст

0.06

(sn

0.06

.editor

0.06

铁

0.06

_clock

0.06

 supern

0.06

Activations Density 0.006%

No Comments

No Known Activations