INDEX
    Explanations

    words related to health and medical conditions

    New Auto-Interp
    Negative Logits
     Flavoring
    -0.88
    mosp
    -0.73
    ochond
    -0.66
    rador
    -0.60
    ortment
    -0.60
    ools
    -0.59
     bow
    -0.58
    icrobial
    -0.57
    deen
    -0.57
    arlane
    -0.55
    POSITIVE LOGITS
     enough
    0.99
     nonetheless
    0.79
     compared
    0.77
    ;
    0.76
    .
    0.74
    enough
    0.72
    :[
    0.69
     insofar
    0.69
    .<
    0.68
     nevertheless
    0.68
    Act Density 0.224%

    No Known Activations