INDEX

Explanations

social justice issues

np_max-act · gemini-2.0-flash

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_7/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.7.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

-Muslim

-0.07

wifi

-0.07

 köln

-0.07

.string

-0.06

 culo

-0.06

(grammarAccess

-0.06

 vieille

-0.06

 정책

-0.06

 realms

-0.06

 تی

-0.06

POSITIVE LOGITS

 belir

0.06

 inform

0.06

 safely

0.06

CLL

0.06

 barred

0.06

/check

0.06

 fortune

0.06

_BAR

0.05

 Judicial

0.05

.Sign

0.05

Activations Density 0.314%

social justice issues

No Comments

No Known Activations