INDEX
Explanations
instances of the word "Has" followed by numbers
occurrences of the phrase "Has" indicating possession, action, or existence
New Auto-Interp
Negative Logits
Mercury
-0.71
takedown
-0.68
eering
-0.68
primates
-0.64
Tropical
-0.63
æĢ
-0.62
lining
-0.60
andering
-0.60
aneous
-0.60
pole
-0.60
POSITIVE LOGITS
kell
1.16
linger
1.00
Been
0.95
bro
0.91
lem
0.90
fred
0.90
been
0.89
endor
0.88
lington
0.87
merga
0.86
Activations Density 0.007%