INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Vehicle

-0.07

_ter

-0.07

omer

-0.06

。」↵↵

-0.06

hetto

-0.06

.loaded

-0.06

」↵↵

-0.06

 cambio

-0.06

.NULL

-0.06

 cheaper

-0.06

POSITIVE LOGITS

 monstrous

0.06

 KIND

0.06

Comparator

0.06

جم

0.06

 Spinner

0.05

(render

0.05

�

0.05

 mathematical

0.05

 shielding

0.05

Activations Density 0.000%

No Comments

No Known Activations

No Comments

No Known Activations