INDEX
Explanations
phrases indicating observations or examinations
personal pronouns and references to viewers or audiences
New Auto-Interp
Negative Logits
Virtue
-0.69
Vengeance
-0.63
delaying
-0.62
Kills
-0.62
extortion
-0.61
sacrific
-0.61
ransom
-0.59
publicity
-0.59
onstage
-0.58
yss
-0.58
POSITIVE LOGITS
uggest
0.99
discern
0.91
understand
0.85
realise
0.84
noticed
0.82
inguishable
0.81
realize
0.79
glimpse
0.79
RESULTS
0.79
aeus
0.77
Activations Density 0.348%