INDEX

Explanations

contrast or opposition

The neuron flags common “function” words—small grammatical tokens like pronouns (I, my), conjunctions (and), modals/auxiliaries (will, might, has), and adverbs (just, along).

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

 only

-1.16

 keeps

-0.95

so

-0.93

 brings

-0.91

Carcinogenicity

-0.90

Mutagenicity

-0.89

 ALWAYS

-0.86

 ensures

-0.86

 just

-0.84

Urls

-0.84

POSITIVE LOGITS

 แต่

1.46

 даже

1.45

 nhưng

1.44

 nawet

1.44

but

1.42

 zelfs

1.28

 certamente

1.25

 навіть

1.24

 incluso

1.24

 لكن

1.23

Activations Density 0.042%