INDEX
Explanations
newsworthy locations or events
punctuation marks, specifically parentheses and periods
New Auto-Interp
Negative Logits
fen
-0.64
antha
-0.62
untouched
-0.62
oooooooooooooooo
-0.61
azon
-0.61
overw
-0.60
yrus
-0.59
fruit
-0.58
blo
-0.58
stabil
-0.57
POSITIVE LOGITS
--
0.82
âĢķ
0.73
Investigators
0.73
Retrieved
0.72
®
0.71
Tonight
0.71
>>
0.71
–
0.69
esm
0.69
---
0.67
Activations Density 0.037%