INDEX
Explanations
references to political figures and their interactions in a diplomatic context
New Auto-Interp
Negative Logits
Ö¼
-0.66
phased
-0.61
ILCS
-0.61
Dangerous
-0.60
ucket
-0.58
Wr
-0.57
]=
-0.56
proportion
-0.55
commodity
-0.55
subscrib
-0.54
POSITIVE LOGITS
representatives
0.90
armac
0.78
ilaterally
0.76
regarding
0.75
strate
0.75
backstage
0.74
counterparts
0.73
Listen
0.73
discussing
0.72
reet
0.72
Activations Density 0.242%