INDEX

Explanations

consensual

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_7/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.7.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

UserInfo

-0.07

.getCurrent

-0.07

 großen

-0.07

ittance

-0.07

znik

-0.06

 fatigue

-0.06

 resistance

-0.06

 Flores

-0.06

 binder

-0.06

 Baum

-0.06

POSITIVE LOGITS

ensual

0.10

 ����

0.07

 rozsah

0.07

 рос

0.06

_globals

0.06

 Sofa

0.06

 Louis

0.06

 بال

0.06

 painful

0.06

يمكن

0.06

Activations Density 0.003%

consensual

No Comments

No Known Activations

consensual

No Comments

No Known Activations