INDEX

Explanations

computer code/system info

np_max-act-logits · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_27/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.27.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

đ

-0.07

 kiểu

-0.07

Checkout

-0.06

bots

-0.06

.inst

-0.06

 कब

-0.06

-п

-0.06

 quoting

-0.06

 anyways

-0.06

ว

-0.06

POSITIVE LOGITS

erts

0.08

وزيع

0.06

++)

0.06

 žena

0.06

(errno

0.06

	freopen

0.06

 Федера

0.06

\[

0.06

 strncpy

0.06

역

0.06

Activations Density 0.287%

computer code/system info

No Comments

No Known Activations

computer code/system info

No Comments

No Known Activations