INDEX
Explanations
numeric measurements
sentences that express measurements or specifications
New Auto-Interp
Negative Logits
advis
-0.78
worsh
-0.65
chained
-0.63
maiden
-0.63
performer
-0.61
trouble
-0.61
particip
-0.60
enrollment
-0.60
flooding
-0.60
purse
-0.60
POSITIVE LOGITS
5
1.44
75
1.32
0
1.31
25
1.27
8
1.23
66666666
1.22
875
1.19
000000
1.17
6
1.16
3333
1.15
Activations Density 0.085%