INDEX

Explanations

Words ending in "us"

np_max-act · gemini-2.0-flash

The neuron activates on proper names (especially classical or mythological place/person names) ending in the “-ius” (or similar Latin/Greek) suffix.

oai_token-act-pair · o4-mini Triggered by @xinyanhu8

New Auto-Interp

Configuration

andyrdt/saes-llama-3.1-8b-instruct/resid_post_layer_11/trainer_1

Dataset (Dashboard)

Various

Features

131,072

Data Type

float32

Hook Name

blocks.11.hook_resid_post

Architecture

standard

Context Size

1,024

Dataset

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

-0.07

’te

-0.07

'e

-0.07

Free

-0.07

DE

-0.07

en

-0.07

'E

-0.07

JE

-0.07

jící

-0.06

 Free

-0.06

POSITIVE LOGITS

us

0.16

ium

0.13

UM

0.13

um

0.12

US

0.12

ius

0.11

inus

0.11

enus

0.11

anium

0.11

acus

0.11

Activations Density 0.066%

Words ending in "us"

The neuron activates on proper names (especially classical or mythological place/person names) ending in the “-ius” (or similar Latin/Greek) suffix.

No Comments

No Known Activations