INDEX

Explanations

future tense/possibility

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 these

-0.08

 this

-0.08

اذا

-0.07

 that

-0.07

 nemoc

-0.07

 THAT

-0.07

์ท

-0.07

 fathers

-0.07

that

-0.07

 These

-0.07

POSITIVE LOGITS

 glEnable

0.07

 такая

0.06

 Spiritual

0.06

 stroke

0.06

会

0.06

 Portug

0.06

ox

0.06

 Nearly

0.06

也是

0.06

EPS

0.06

Activations Density 0.446%

future tense/possibility

No Comments

No Known Activations

future tense/possibility

No Comments

No Known Activations