INDEX
Explanations
issues related to emails, accountability, or communications in various contexts
New Auto-Interp
Head Attr Weights
0:0.04
1:0.10
2:0.10
3:0.04
4:0.03
5:0.05
6:0.09
7:0.07
8:0.11
9:0.11
10:0.10
11:0.09
Negative Logits
Roots
-0.85
Maritime
-0.84
odore
-0.83
earchers
-0.81
Throne
-0.77
Madness
-0.76
Tropical
-0.76
distinction
-0.75
temptation
-0.74
Sunder
-0.74
POSITIVE LOGITS
pi
0.99
rus
0.84
'.
0.84
iannopoulos
0.80
heres
0.79
%.
0.78
widget
0.78
ASA
0.77
UKIP
0.77
XXXX
0.75
Activations Density 1.632%