INDEX
Explanations
phrases indicating the absence of something or a lack of evidence
New Auto-Interp
Negative Logits
manufact
-0.36
Arbeit
-0.33
pack
-0.32
les
-0.32
router
-0.32
couz
-0.31
marquer
-0.31
7
-0.30
épo
-0.30
se
-0.30
POSITIVE LOGITS
OGND
0.89
قایناقلار
0.87
цездатний
0.84
GEBURTSDATUM
0.76
saraba
0.75
$_"
0.72
tartalomajánló
0.71
ंदीखरीदारी
0.71
rungsseite
0.71
$_(
0.69
Activations Density 0.151%