INDEX
Explanations
mentions of the word "Mongolia" or related terms
references to the Mongolian culture, people, or location
New Auto-Interp
Negative Logits
cher
-0.74
rez
-0.74
Bet
-0.73
arget
-0.71
elle
-0.68
eries
-0.67
ere
-0.67
ulsion
-0.67
Sirius
-0.66
Rem
-0.65
POSITIVE LOGITS
Mong
3.44
Mongol
2.83
Mongolia
2.43
mong
1.85
ONG
1.60
jong
1.19
Dmitry
1.14
Cambod
1.09
ongs
1.06
osaurus
1.05
Activations Density 0.038%