INDEX
Explanations
references to political parties and sides in a conflict
New Auto-Interp
Head Attr Weights
0:0.03
1:0.01
2:0.14
3:0.05
4:0.08
5:0.10
6:0.14
7:0.05
8:0.06
9:0.04
10:0.15
11:0.09
Negative Logits
inventoryQuantity
-1.85
ビ
-1.62
Pand
-1.55
Prometheus
-1.43
Reincarn
-1.42
lar
-1.39
Offline
-1.37
Tears
-1.36
Prophe
-1.35
Grave
-1.34
POSITIVE LOGITS
alike
1.81
looph
1.64
astern
1.61
sides
1.51
uthor
1.46
iola
1.46
penchant
1.44
ellig
1.43
istries
1.43
sembly
1.42
Activations Density 0.001%