INDEX
Explanations
discussions around decision making and opinion sharing in various contexts
New Auto-Interp
Head Attr Weights
0:0.07
1:0.03
2:0.04
3:0.34
4:0.13
5:0.05
6:0.03
7:0.08
8:0.07
9:0.02
10:0.07
11:0.03
Negative Logits
perfected
-2.45
ielding
-2.25
favoured
-2.17
'),
-2.17
oufl
-2.07
clad
-2.04
disguised
-2.03
discarded
-2.02
arers
-2.02
fitted
-2.02
POSITIVE LOGITS
Interview
2.80
[+
2.78
SPORTS
2.66
Aren
2.64
CBS
2.59
JC
2.58
WOM
2.56
:
2.53
AMY
2.50
?:
2.49
Activations Density 0.204%