INDEX
Explanations
mentions of various countries and political entities
references to specific countries and their political or military involvement
New Auto-Interp
Negative Logits
Els
-0.64
eternity
-0.62
ãĥł
-0.61
tnc
-0.60
ixt
-0.60
ãĤ¦ãĤ¹
-0.59
âĶĢâĶĢâĶĢâĶĢ
-0.58
Translation
-0.58
ãĥĥãĥī
-0.57
Zip
-0.57
POSITIVE LOGITS
denies
1.25
accuses
1.22
meanwhile
1.18
reacted
1.16
insists
1.14
spokesman
1.13
contends
1.12
responded
1.12
opposes
1.09
appealed
1.07
Activations Density 0.340%