INDEX
Explanations
references to clothing or items that are worn
New Auto-Interp
Negative Logits
(',')-0.84
Cousins
-0.77
hasMoreElements
-0.73
Plummer
-0.73
decisivo
-0.72
Callum
-0.70
Slu
-0.70
Kristine
-0.68
来不及
-0.68
pergillus
-0.68
POSITIVE LOGITS
Wear
1.36
wear
1.32
Wear
1.24
worn
1.23
wear
1.19
wears
1.19
WEAR
1.17
Worn
1.09
wore
1.06
Wearing
1.02
Activations Density 0.028%