INDEX
Explanations
mentions of different countries, particularly Japan
mentions of Japanese in various contexts
New Auto-Interp
Negative Logits
mble
-0.94
byss
-0.88
olkien
-0.84
xual
-0.81
phe
-0.81
arth
-0.80
achine
-0.75
ructure
-0.74
liest
-0.73
arter
-0.73
POSITIVE LOGITS
nationals
1.10
diplomats
1.05
embassy
1.00
consulate
0.99
officials
0.97
embassies
0.96
authorities
0.92
Embassy
0.90
hostages
0.86
counterparts
0.86
Activations Density 0.097%