INDEX

Explanations

military service

np_max-act · gemini-2.0-flash

This neuron detects references to LGBTQ identities (gay, lesbian, bisexual, transgender) in the context of military service.

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 nuevo

-0.08

즈

-0.07

would

-0.07

 burge

-0.06

asmus

-0.06

ASF

-0.06

Would

-0.06

Stand

-0.06

.te

-0.06

POSITIVE LOGITS

 Differences

0.07

[],↵

0.06

 тільки

0.06

 []);↵

0.06

 drama

0.06

<<"

0.06

(handles

0.06

 reckon

0.06

 Figures

0.06

 gifts

0.06

Activations Density 0.003%

military service

This neuron detects references to LGBTQ identities (gay, lesbian, bisexual, transgender) in the context of military service.

No Comments

No Known Activations