INDEX
Explanations
sequences of numbers with different format characters in between
numerical values or references to statistics
New Auto-Interp
Negative Logits
teasp
-0.74
vertisement
-0.69
BuyableInstoreAndOnline
-0.67
corrid
-0.66
comprom
-0.65
PDATE
-0.63
Leban
-0.62
referen
-0.61
carbohyd
-0.61
gdala
-0.59
POSITIVE LOGITS
wm
1.01
df
0.86
e
0.85
chev
0.83
ffe
0.81
eus
0.78
eff
0.76
201
0.76
bg
0.75
rez
0.74
Activations Density 0.102%