INDEX

Explanations

prefixes followed by specific words

The neuron reliably activates on proper names and other named-entity tokens—especially author or contributor names in bylines and citations.

New Auto-Interp

Configuration

Prompts (Dashboard)

392,802 prompts, 256 tokens each

Dataset (Dashboard)

monology/pile-uncopyrighted

Embeds

IFrame

Link

Not in Any Lists

Negative Logits

0.51

0.49

0.46

AND

0.46

 Messi

0.45

 Skywalker

0.45

 gaan

0.44

 Hadid

0.44

POSITIVE LOGITS

杍

0.41

躹

0.39

竝

0.38

нициа

0.38

indazol

0.38

急速

0.37

靔

0.35

夻

0.34

pulsewidth

0.34

कार्यक्रम

0.34

Activations Density 0.521%