INDEX
Explanations
phrases emphasizing totality or completeness
New Auto-Interp
Negative Logits
Kontakte
-0.68
Lieferumfang
-0.68
ولد
-0.66
nameLabel
-0.66
Datuak
-0.66
Missile
-0.65
Snakes
-0.64
missile
-0.64
piety
-0.63
Arque
-0.63
POSITIVE LOGITS
Completely
1.65
completely
1.62
Completely
1.58
totally
1.52
Totally
1.50
Totally
1.46
completely
1.38
totally
1.30
entirely
1.26
completamente
1.18
Activations Density 0.123%