INDEX
Explanations
numerical references such as quantities or steps in a process
New Auto-Interp
Negative Logits
actionDate
-0.65
sacked
-0.64
Jagu
-0.63
swearing
-0.61
rolet
-0.60
Fas
-0.58
Exodus
-0.56
slogan
-0.56
Sacrament
-0.56
folly
-0.56
POSITIVE LOGITS
nd
1.20
½
1.11
rd
1.10
WD
1.05
200
0.96
ÃĹ
0.94
ÏĢ
0.91
yrs
0.90
%-
0.89
âĺħ
0.89
Activations Density 0.140%