INDEX
Negative Logits
Expr
0.42
العشر
0.40
glue
0.40
izon
0.40
glue
0.39
gl
0.39
WebDriver
0.39
Hub
0.38
Reich
0.38
Hardwick
0.38
POSITIVE LOGITS
assumptive
0.45
jantung
0.44
ใจ
0.44
void
0.42
اط
0.42
paroles
0.41
vate
0.40
серде
0.40
heart
0.39
corazón
0.39
Activations Density 0.000%