INDEX
Explanations
references to international relations and geopolitical entities
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.05
3:0.05
4:0.04
5:0.04
6:0.41
7:0.05
8:0.04
9:0.04
10:0.12
11:0.06
Negative Logits
ACTIONS
-1.54
lifes
-1.40
Sack
-1.23
tast
-1.23
Roll
-1.21
candle
-1.21
esville
-1.21
Packs
-1.16
downward
-1.16
ween
-1.16
POSITIVE LOGITS
Magikarp
1.47
Clar
1.32
Vaj
1.30
Neh
1.28
consulate
1.26
Lia
1.25
サーティワン
1.24
imaru
1.23
フォ
1.23
kindred
1.22
Activations Density 0.018%