INDEX

Explanations

biographies of singers

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_27/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.27.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 фах

-0.07

igraphy

-0.07

ainless

-0.07

젤

-0.06

ього

-0.06

ano

-0.06

igraph

-0.06

 najle

-0.06

едь

-0.06

 calibration

-0.06

POSITIVE LOGITS

.getMessage

0.06

Rachel

0.06

ECM

0.06

.Un

0.06

 streets

0.06

 ресур

0.06

teachers

0.06

 YYSTACK

0.06

 був

0.06

선

0.06

Activations Density 0.018%

biographies of singers

No Comments

No Known Activations

biographies of singers

No Comments

No Known Activations