INDEX
Explanations
articles or determiners in various forms
New Auto-Interp
Negative Logits
poussière
-0.77
fumée
-0.77
espagne
-0.71
Gild
-0.69
armée
-0.69
charité
-0.69
nationaux
-0.68
économie
-0.68
survie
-0.67
ásban
-0.66
POSITIVE LOGITS
a
1.63
une
1.24
una
1.24
eine
1.23
Eine
1.20
einer
1.19
μια
1.18
an
1.15
uma
1.13
một
1.13
Activations Density 0.017%