INDEX
Explanations
proper nouns, including names and places
New Auto-Interp
Negative Logits
PreferredItem
-0.94
Personensuche
-0.92
ValueStyle
-0.89
BIBSYS
-0.88
:✨
-0.80
ieteur
-0.79
ويكيپيديا
-0.77
Familienname
-0.75
absl
-0.74
Примітки
-0.74
POSITIVE LOGITS
Monti
0.88
Janis
0.81
Honig
0.75
Kon
0.72
Richt
0.71
Camb
0.70
Кон
0.69
Kon
0.68
()].
0.68
Camb
0.67
Activations Density 2.745%