INDEX
Explanations
specific mentions of people and actions related to incidents or events
occurrences of the word "the."
New Auto-Interp
Negative Logits
ç¥ŀ
-0.77
joice
-0.73
Gohan
-0.71
luster
-0.70
ccoli
-0.69
Enjoy
-0.68
Buddhism
-0.67
Krugman
-0.67
topia
-0.67
Luffy
-0.67
POSITIVE LOGITS
latter
1.20
same
1.15
heaviest
1.00
FBI
0.99
defendant
0.99
complainant
0.98
alleged
0.97
aforementioned
0.97
incident
0.96
affidavit
0.95
Activations Density 1.348%