INDEX
Explanations
words indicating a call to action or suggestion to the reader
occurrences of the word "have" and related phrases
New Auto-Interp
Negative Logits
anism
-0.73
appell
-0.66
ô
-0.66
cules
-0.65
matters
-0.65
Islamic
-0.64
upon
-0.64
organs
-0.64
objectionable
-0.64
irregularities
-0.63
POSITIVE LOGITS
couple
1.18
few
1.14
bunch
1.11
lot
0.97
handful
0.96
dozen
0.91
uras
0.89
friend
0.87
bit
0.83
glimpse
0.82
Activations Density 0.345%