INDEX
Explanations
mentions of geographical locations or landmarks
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.06
3:0.05
4:0.04
5:0.03
6:0.39
7:0.11
8:0.03
9:0.05
10:0.06
11:0.07
Negative Logits
irregularities
-1.28
autos
-1.17
billboards
-1.16
stitching
-1.14
GOODMAN
-1.14
ultrasound
-1.13
violations
-1.13
paran
-1.11
ATM
-1.11
advertisements
-1.09
POSITIVE LOGITS
Revenge
1.40
essen
1.38
]}
1.36
Recover
1.35
Ruin
1.33
Romance
1.32
oris
1.24
Despair
1.24
orgetown
1.22
riet
1.21
Activations Density 0.002%