INDEX
Explanations
characters with diacritics or accents
New Auto-Interp
Negative Logits
********
-0.66
****************
-0.63
XXXXXXXX
-0.62
<strong>
-0.59
*****
-0.58
المعيارى
-0.58
chanti
-0.57
asztal
-0.56
almaz
-0.55
↵↵
-0.55
POSITIVE LOGITS
Erdoğan
1.02
Citroën
0.97
Beyoncé
0.95
Nestlé
0.95
Renée
0.94
étit
0.94
Yucatán
0.93
]='\
0.92
Condé
0.92
Crème
0.91
Activations Density 0.384%