INDEX
Explanations
phrases related to cleanliness and purity
phrases indicating purity or the absence of undesirable substances
New Auto-Interp
Negative Logits
successfully
-0.79
isSpecialOrderable
-0.76
20439
-0.75
Alive
-0.75
Enabled
-0.75
reinstated
-0.73
YES
-0.72
ESE
-0.71
ttp
-0.71
Excellent
-0.71
POSITIVE LOGITS
fuss
1.10
intrusive
0.99
coercion
0.97
distractions
0.96
prejudice
0.95
distortion
0.94
gimm
0.94
wasteful
0.93
compromises
0.93
interference
0.93
Activations Density 0.347%