INDEX
Explanations
mentions of physical states, ailments, or conditions with an emphasis on 'mild' severity
descriptors indicating a mild intensity or degree
New Auto-Interp
Negative Logits
funding
-0.71
iston
-0.69
hub
-0.68
stack
-0.67
etheus
-0.65
Observatory
-0.65
owicz
-0.64
paralle
-0.64
collided
-0.64
-0.63
POSITIVE LOGITS
mild
3.00
pleasant
1.96
gentle
1.91
Mild
1.74
benign
1.65
mildly
1.59
harmless
1.58
pleasant
1.51
mell
1.37
soothing
1.34
Activations Density 0.025%