INDEX
Explanations
references to figures and visual data representations
New Auto-Interp
Head Attr Weights
0:0.07
1:0.11
2:0.05
3:0.04
4:0.02
5:0.27
6:0.04
7:0.05
8:0.05
9:0.11
10:0.07
11:0.05
Negative Logits
orkshire
-1.91
nesday
-1.79
yip
-1.66
unicip
-1.64
vironment
-1.61
mber
-1.61
racuse
-1.60
uador
-1.59
nz
-1.58
hya
-1.55
POSITIVE LOGITS
Ez
1.33
conc
1.32
truths
1.27
summar
1.25
innocent
1.23
reminders
1.21
confirm
1.18
Obj
1.17
Brennan
1.17
certainty
1.16
Activations Density 0.003%