INDEX
Explanations
Procrastination
This neuron detects the subword “procrast” (as in “procrastination”).
New Auto-Interp
Negative Logits
servicio
-0.07
朋
-0.06
독
-0.06
nack
-0.06
.locals
-0.06
detection
-0.06
metabolism
-0.06
TZ
-0.06
descent
-0.06
GitHub
-0.06
POSITIVE LOGITS
crast
0.10
procrast
0.10
fin
0.06
.↵↵↵
0.06
xhr
0.06
olut
0.06
").↵↵
0.06
ertia
0.06
(Constants
0.06
Kasich
0.06
Activations Density 0.002%