INDEX

Explanations

mental health

np_max-act · gemini-2.0-flash

Text discussing mental health issues (especially depression and suicidal risk) and related help/resources.

oai_token-act-pair · gpt-5-mini Triggered by @vetterc0

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

interp

-0.07

usters

-0.07

IENT

-0.06

 opera

-0.06

.config

-0.06

 scripture

-0.06

Mixed

-0.06

chg

-0.06

 Educ

-0.06

 Mend

-0.06

POSITIVE LOGITS

官方

0.07

_IMAGE

0.07

xc

0.07

Om

0.06

 marvelous

0.06

<Location

0.06

 оскільки

0.06

.DateFormat

0.06

-basic

0.06

(savedInstanceState

0.06

Activations Density 0.172%

mental health

Text discussing mental health issues (especially depression and suicidal risk) and related help/resources.

No Comments

No Known Activations