INDEX
Explanations
This neuron detects bibliographic citation markers and reference labels in academic text.
New Auto-Interp
Negative Logits
scape
-0.06
wanted
-0.06
nab
-0.06
urinary
-0.06
termin
-0.06
strap
-0.06
shady
-0.06
sinful
-0.06
ulus
-0.06
ranked
-0.06
POSITIVE LOGITS
See
0.07
MethodBeat
0.06
READ
0.06
Від
0.06
.defer
0.06
ö
0.06
больш
0.06
^{°}0.06
LAS
0.06
.ReadToEnd
0.06
Activations Density 0.001%