INDEX
    Explanations

    terms indicating high efficacy or strength, specifically relating to health benefits or natural substances

    New Auto-Interp
    Negative Logits
    undi
    -0.15
    etten
    -0.15
    riott
    -0.15
    ãĥIJãĥ¼
    -0.15
    -me
    -0.14
    墨
    -0.14
    Jet
    -0.14
    оба
    -0.14
    jet
    -0.14
    untime
    -0.14
    POSITIVE LOGITS
    rego
    0.14
    aches
    0.14
    589
    0.14
    igon
    0.14
    roy
    0.14
    ACHI
    0.14
    otor
    0.14
    udden
    0.14
    ardi
    0.14
    eps
    0.13
    Act Density 0.004%

    No Known Activations