INDEX
Explanations
the presence of the word "have" indicating actions or conditions
New Auto-Interp
Negative Logits
s
-0.66
lots
-0.63
needs
-0.63
Capp
-0.62
cks
-0.62
mits
-0.61
sax
-0.58
needs
-0.57
Moc
-0.57
dil
-0.57
POSITIVE LOGITS
themselves
0.73
photobucket
0.69
protoimpl
0.68
ourselves
0.65
Rave
0.65
verifyException
0.65
Twe
0.65
peak
0.64
otheby
0.64
Vendo
0.64
Activations Density 0.056%