INDEX
    Explanations

    words related to health, physical condition, or general well-being

    New Auto-Interp
    Negative Logits
    ifles
    -0.71
     incorrectly
    -0.71
     unknow
    -0.69
     Hate
    -0.66
     invented
    -0.63
     forcibly
    -0.62
     compuls
    -0.62
     Dictionary
    -0.62
    agine
    -0.61
     futile
    -0.61
    POSITIVE LOGITS
     margins
    0.93
     footing
    0.86
    bye
    0.86
     parity
    0.85
    enough
    0.85
     progress
    0.83
     performer
    0.82
     turnout
    0.78
     financially
    0.77
     performers
    0.76
    Act Density 0.271%

    No Known Activations