INDEX

Explanations

rebellion

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_27/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.27.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 зак

-0.07

_major

-0.06

 số

-0.06

 expressed

-0.06

 soaring

-0.06

管

-0.06

 přičemž

-0.06

 radiator

-0.06

fiber

-0.06

/spec

-0.06

POSITIVE LOGITS

上传

0.06

ře

0.06

	work

0.06

.Connection

0.06

'];?>

0.06

 acqu

0.06

Content

0.06

ultipartFile

0.06

 Catholics

0.06

ออกแบบ

0.06

Activations Density 0.105%

rebellion

No Comments

No Known Activations

rebellion

No Comments

No Known Activations