INDEX
Explanations
references to personal relationships or friendships
New Auto-Interp
Head Attr Weights
0:0.09
1:0.06
2:0.08
3:0.07
4:0.09
5:0.07
6:0.08
7:0.08
8:0.07
9:0.08
10:0.09
11:0.09
Negative Logits
habi
-2.96
spotting
-2.57
wil
-2.48
attribut
-2.43
attacking
-2.39
casting
-2.36
attackers
-2.35
abi
-2.30
emort
-2.29
poaching
-2.27
POSITIVE LOGITS
Greenland
2.52
Debbie
2.52
OPLE
2.50
Gree
2.45
Benefit
2.44
Sylvia
2.43
Montana
2.43
Peggy
2.40
Lois
2.38
FDR
2.37
Activations Density 0.000%