INDEX
Explanations
quantitative expressions of quantity or scale
phrases that include fractional quantities or measurements
New Auto-Interp
Negative Logits
appropri
-0.66
alloc
-0.66
bearer
-0.64
propri
-0.64
owicz
-0.63
PLA
-0.61
ADA
-0.60
amac
-0.60
uncond
-0.59
intrins
-0.59
POSITIVE LOGITS
dozen
0.96
dozen
0.85
tails
0.84
hops
0.78
peeled
0.74
ousand
0.72
oil
0.71
agos
0.70
nir
0.69
assed
0.69
Activations Density 0.070%