INDEX
Explanations
references to "New York."
New Auto-Interp
Head Attr Weights
0:0.12
1:0.02
2:0.04
3:0.14
4:0.05
5:0.05
6:0.17
7:0.05
8:0.03
9:0.24
10:0.03
11:0.03
Negative Logits
MU
-2.89
Au
-2.86
Skinner
-2.83
Deer
-2.81
McH
-2.81
Martial
-2.78
Stall
-2.78
Caval
-2.74
MU
-2.73
caval
-2.71
POSITIVE LOGITS
Times
3.84
NYT
3.60
ny
3.33
yss
3.27
times
3.26
Times
3.20
TIM
3.03
ias
3.03
Tas
3.02
tm
2.88
Activations Density 0.002%