INDEX
Explanations
instances of intimate and potentially non-consensual interactions
Following prepositions and locations
social locations and activities
New Auto-Interp
Negative Logits
TokenNameDOT
-0.43
reported
-0.36
Davidson
-0.35
DebuggerNonUser
-0.35
igel
-0.34
holtz
-0.34
hlen
-0.34
Matta
-0.34
larger
-0.33
nahilalakip
-0.33
POSITIVE LOGITS
yntaxException
0.67
Мексичка
0.50
Meksiku
0.49
ſelves
0.47
oprecipitation
0.47
zove
0.46
дописавши
0.45
BagLayout
0.45
AndroidJUnit
0.44
anthene
0.43
Activations Density 0.283%