INDEX

Explanations

Programming languages

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_7/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.7.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Pemb

-0.08

 Deaths

-0.07

 riot

-0.07

Ts

-0.07

 witty

-0.07

 folders

-0.07

 hack

-0.07

 indices

-0.07

eko

-0.06

etzt

-0.06

POSITIVE LOGITS

(bot

0.07

SerializedName

0.06

 trouvé

0.06

Keeping

0.06

idepress

0.06

"+

0.06

 diferentes

0.06

_L

0.06

τευ

0.06

_perc

0.06

Activations Density 0.086%

Programming languages

No Comments

No Known Activations