INDEX
Explanations
numerical values associated with scores or performances
New Auto-Interp
Head Attr Weights
0:0.13
1:0.09
2:0.10
3:0.06
4:0.04
5:0.10
6:0.04
7:0.01
8:0.12
9:0.07
10:0.05
11:0.12
Negative Logits
orney
-1.60
ographies
-1.59
Seym
-1.57
®
-1.54
Authors
-1.46
Writing
-1.45
distinguishing
-1.43
orget
-1.41
SOFTWARE
-1.40
eaturing
-1.39
POSITIVE LOGITS
alore
1.63
elope
1.40
altern
1.37
mates
1.34
Phys
1.33
move
1.32
carc
1.32
morrow
1.31
Talk
1.29
Bed
1.29
Activations Density 0.002%