INDEX
Explanations
number ranges
numerical ranges or age groups specified in the text
New Auto-Interp
Negative Logits
redes
-0.70
yet
-0.61
Polic
-0.60
NRS
-0.58
erman
-0.58
Adds
-0.57
ŃĶ
-0.57
AFTA
-0.56
ãĥķãĤ©
-0.56
Judge
-0.52
POSITIVE LOGITS
])
0.66
vag
0.61
flush
0.60
sexes
0.59
BuyableInstoreAndOnline
0.58
ust
0.58
wealth
0.57
Í
0.56
akespeare
0.56
ties
0.56
Activations Density 0.075%