INDEX
Explanations
expressions of agreement and support in discussions
New Auto-Interp
Negative Logits
actéristique
-0.63
drawSprites
-0.60
olesale
-0.59
InputBorder
-0.58
"?>
-0.58
fhort
-0.57
fhew
-0.56
dafx
-0.55
ndose
-0.55
motherfucker
-0.55
POSITIVE LOGITS
@
0.78
OP
0.73
regarding
0.71
above
0.69
earlier
0.69
Regarding
0.63
ovan
0.62
выше
0.62
earlier
0.61
regarding
0.61
Activations Density 0.274%