INDEX
Explanations
statements or phrases that indicate conclusions
New Auto-Interp
Head Attr Weights
0:0.05
1:0.02
2:0.11
3:0.05
4:0.10
5:0.09
6:0.04
7:0.03
8:0.10
9:0.24
10:0.07
11:0.04
Negative Logits
ustomed
-1.30
cel
-1.27
unts
-1.21
Joined
-1.20
apon
-1.17
broom
-1.11
bill
-1.07
lav
-1.07
abroad
-1.06
brushes
-1.06
POSITIVE LOGITS
��
1.46
���
1.42
Forth
1.30
captcha
1.27
explan
1.24
��
1.24
causation
1.23
moot
1.23
disapp
1.21
Preferences
1.21
Activations Density 0.014%