INDEX
Explanations
words related to health, physical condition, or general well-being
New Auto-Interp
Negative Logits
ifles
-0.71
incorrectly
-0.71
unknow
-0.69
Hate
-0.66
invented
-0.63
forcibly
-0.62
compuls
-0.62
Dictionary
-0.62
agine
-0.61
futile
-0.61
POSITIVE LOGITS
margins
0.93
footing
0.86
bye
0.86
parity
0.85
enough
0.85
progress
0.83
performer
0.82
turnout
0.78
financially
0.77
performers
0.76
Activations Density 0.271%