INDEX
Explanations
words and phrases indicating emotional responses or expressions
New Auto-Interp
Negative Logits
ClientSize
-0.64
frameCount
-0.54
Liar
-0.53
SYNOPSIS
-0.51
cticut
-0.50
bli
-0.49
seg
-0.49
槛
-0.49
Lie
-0.48
fart
-0.48
POSITIVE LOGITS
Displays
0.75
displays
0.75
exhibiting
0.74
íncia
0.74
Personensuche
0.73
displays
0.72
noinspection
0.72
displaying
0.71
Displays
0.70
الدراسه
0.70
Activations Density 0.179%