INDEX
Explanations
words related to physical sensations or conditions, particularly negative ones
words and roots related to actions or states of being
New Auto-Interp
Negative Logits
YN
-0.79
ADRA
-0.74
aver
-0.72
ainer
-0.69
clerosis
-0.68
ynthesis
-0.67
chwitz
-0.67
ajor
-0.67
uther
-0.67
ovember
-0.66
POSITIVE LOGITS
ness
1.37
ly
1.27
nesses
1.26
est
1.08
enough
1.02
liness
0.95
glers
0.92
itude
0.92
hearted
0.90
humour
0.89
Activations Density 0.220%