INDEX
Explanations
phrases related to politics, government, and policies
mentions of the word "enemy" and its related context
New Auto-Interp
Head Attr Weights
0:0.04
1:0.03
2:0.15
3:0.04
4:0.36
5:0.03
6:0.04
7:0.02
8:0.07
9:0.10
10:0.04
11:0.02
Negative Logits
throp
-1.52
anecd
-1.49
Guard
-1.45
abo
-1.44
aqu
-1.39
Zip
-1.38
rek
-1.37
Stock
-1.36
quad
-1.36
rh
-1.36
POSITIVE LOGITS
ドラ
1.53
Confeder
1.42
Mormonism
1.36
Brotherhood
1.35
Axis
1.34
terness
1.31
ドラゴン
1.29
Hearts
1.27
Ferdinand
1.25
prejudice
1.24
Activations Density 0.008%