INDEX
Explanations
geographical locations and proper nouns related to places and entities
New Auto-Interp
Negative Logits
agoza
-0.63
paksa
-0.60
Trier
-0.58
PARIS
-0.58
Maine
-0.56
españa
-0.56
المصري
-0.55
a
-0.55
Honolulu
-0.54
APORE
-0.54
POSITIVE LOGITS
Савезне
0.87
Anſ
0.79
Jefus
0.76
Efq
0.75
Quelles
0.73
Politique
0.70
Beſ
0.69
ſever
0.68
Conſ
0.68
swatches
0.68
Activations Density 0.466%