INDEX
Explanations
sentences indicating negative events or situations
sentence endings that convey impactful or conclusive statements
New Auto-Interp
Negative Logits
',"
-0.61
lled
-0.59
Thumbnail
-0.58
inguishable
-0.58
'."
-0.54
ucer
-0.53
untarily
-0.53
Cup
-0.52
Instance
-0.51
Mobil
-0.49
POSITIVE LOGITS
↵Âł
1.18
Âł
1.13
Âł Âł
1.09
³³
1.09
³³
1.05
americ
0.94
Secondly
0.92
Âł
0.88
tumblr
0.84
↵↵
0.83
Activations Density 0.493%