INDEX
Explanations
mentions of specific countries, particularly focusing on Romania
references to Romania and related concepts
New Auto-Interp
Negative Logits
oos
-0.79
=-=-=-=-
-0.77
creen
-0.76
uries
-0.74
lihood
-0.73
vind
-0.73
ITNESS
-0.73
emark
-0.72
merce
-0.72
edience
-0.72
POSITIVE LOGITS
arest
0.94
Romanian
0.90
Romania
0.87
Buch
0.83
Monteneg
0.81
srfAttach
0.75
Reign
0.70
senal
0.70
Catholicism
0.69
Falls
0.68
Activations Density 0.014%