INDEX
Explanations
references to government officials and their positions, particularly foreign ministers
New Auto-Interp
Head Attr Weights
0:0.13
1:0.04
2:0.03
3:0.05
4:0.03
5:0.39
6:0.07
7:0.02
8:0.09
9:0.05
10:0.02
11:0.02
Negative Logits
Haunted
-2.61
robbers
-2.33
Clockwork
-2.33
Corner
-2.25
brick
-2.25
CODE
-2.24
Warehouse
-2.19
Eater
-2.19
Campus
-2.18
Street
-2.15
POSITIVE LOGITS
iane
2.55
anyahu
2.54
VERTIS
2.35
ince
2.33
France
2.30
Lavrov
2.30
diplomacy
2.28
Jinping
2.28
uri
2.19
plom
2.17
Activations Density 0.029%