INDEX

Explanations

ThreadLocal

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_15/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.15.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 councillor

-0.07

 bumps

-0.07

(iv

-0.07

 entire

-0.07

[h

-0.07

 facilitated

-0.07

_REQUIRE

-0.07

 wrongdoing

-0.06

Nature

-0.06

 outros

-0.06

POSITIVE LOGITS

 사건

0.07

さい

0.06

Colorado

0.06

 companyId

0.06

 기업

0.06

 MethodInvocation

0.06

 Sosyal

0.06

말

0.06

شنامه

0.06

 červ

0.05

Activations Density 0.005%

ThreadLocal

No Comments

No Known Activations

ThreadLocal

No Comments

No Known Activations