INDEX

Explanations

No Explanations Found

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

OPC

-0.07

 teaches

-0.06

آ

-0.06

 every

-0.06

axy

-0.06

 dentist

-0.06

 Practices

-0.06

 acción

-0.06

 Barry

-0.06

 surgeons

-0.06

POSITIVE LOGITS

ــ

0.07

 allerdings

0.07

 także

0.06

ukkan

0.06

vd

0.06

bookmark

0.06

ker

0.06

iv

0.06

zdy

0.06

ducers

0.06

Activations Density 0.000%

No Comments

No Known Activations

No Comments

No Known Activations