INDEX
Explanations
connections in scientific or technical explanations
New Auto-Interp
Negative Logits
surla
-0.64
estekak
-0.61
transfieras
-0.57
Administrativna
-0.52
expandindo
-0.52
esternos
-0.47
strix
-0.46
հղումներ
-0.46
دانشنامهٔ
-0.44
Privacidade
-0.44
POSITIVE LOGITS
indeed
3.42
indeed
2.94
Indeed
2.78
Indeed
2.64
inderdaad
2.56
effectivement
2.25
infatti
2.16
確かに
2.06
действительно
2.03
的确
1.91
Activations Density 0.352%