INDEX
Explanations
mentions of communication tools and surveillance methods
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.07
3:0.05
4:0.04
5:0.03
6:0.40
7:0.04
8:0.02
9:0.03
10:0.11
11:0.11
Negative Logits
trump
-1.38
Pradesh
-1.35
displayText
-1.29
hindsight
-1.29
Lauder
-1.27
者
-1.26
departures
-1.25
vable
-1.22
ngth
-1.21
oway
-1.17
POSITIVE LOGITS
runs
1.36
icc
1.30
parts
1.22
cereal
1.22
theless
1.21
tablets
1.20
Downloadha
1.14
γ
1.13
whatever
1.11
issues
1.11
Activations Density 0.014%