INDEX
Explanations
words related to countries, particularly focusing on Croatia
mentions of Croatia and related geographical references
New Auto-Interp
Negative Logits
lying
-0.81
ellation
-0.78
ENTS
-0.74
atives
-0.71
INGS
-0.69
ledged
-0.69
glim
-0.68
reads
-0.67
ively
-0.67
ENT
-0.67
POSITIVE LOGITS
oslov
0.87
Croatia
0.86
Äį
0.76
Herz
0.75
cius
0.74
Philippe
0.74
eri
0.68
uth
0.66
Confederation
0.65
IRC
0.65
Activations Density 0.022%