INDEX
Explanations
fragment
This neuron detects occurrences of the word “fragment” (in any form or morphological variant, e.g., Fragment, fragments, fragmentation).
New Auto-Interp
Negative Logits
initely
-0.07
ouis
-0.07
(on
-0.07
ponential
-0.07
Lee
-0.07
こそ
-0.07
oi
-0.06
Louise
-0.06
Cincinnati
-0.06
Russell
-0.06
POSITIVE LOGITS
fragments
0.10
frag
0.10
Frag
0.09
fragment
0.09
frag
0.09
fragmentation
0.08
fragile
0.08
fragmented
0.08
Fragment
0.08
/fr
0.08
Activations Density 0.009%