INDEX
    Explanations

    phrases indicating a need for attention to health-related issues

    Preceding "fact", "evidence", or "rarity"

    New Auto-Interp
    Negative Logits
     devriez
    -0.51
    最有
    -0.48
    ರೆ
    -0.48
    RegressionTest
    -0.48
     personenbez
    -0.47
    真正的
    -0.47
     diretamente
    -0.45
    的最
    -0.45
     attacc
    -0.45
    @
    -0.44
    POSITIVE LOGITS
     fact
    1.49
     Tatsache
    1.22
     lack
    1.13
    fact
    1.04
     sheer
    1.01
     fakt
    0.95
     ease
    0.88
     tremendous
    0.87
     inability
    0.87
     feit
    0.87
    Act Density 0.789%

    No Known Activations