INDEX

Explanations

except as otherwise when

The neuron fires on words signaling exclusion or negation—e.g. “except,” “unless,” “not,” “without”—marking exception clauses or negative qualifiers.

New Auto-Interp

Configuration

Prompts (Dashboard)

24,576 prompts, 128 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

dienne

-1.03

ounces

-0.99

très

-0.97

 vôtre

-0.96

licitations

-0.96

 doivent

-0.95

ľov

-0.94

oplasmic

-0.94

 malheureusement

-0.94

OnAdd

-0.93

POSITIVE LOGITS

 prescribed

1.16

 previously

1.15

if

1.13

 black

1.10

 ausdrücklich

1.08

 可以

1.03

 according

1.02

 accepted

1.02

 aquellos

1.01

 traditionally

1.00

Activations Density 0.051%