INDEX
Explanations
references or mentions of the United Nations (UN) in the text
references to the United Nations (UN)
New Auto-Interp
Negative Logits
thora
-0.82
Camb
-0.73
Ö¼
-0.73
hetti
-0.72
士
-0.70
stals
-0.68
Mechdragon
-0.67
Elect
-0.66
ãĤ¼
-0.66
Demons
-0.66
POSITIVE LOGITS
SC
0.97
KNOWN
0.95
UN
0.95
ilateral
0.85
ICE
0.84
DEC
0.82
UN
0.82
IF
0.81
namese
0.80
envoy
0.80
Activations Density 0.007%