INDEX
    Explanations

    words related to claims or allegations

    words indicating claims or descriptions that may lack credibility or certainty

    New Auto-Interp
    Negative Logits
    izoph
    -0.82
     reinforcement
    -0.62
    thro
    -0.61
     Leilan
    -0.60
     IPM
    -0.60
    ersen
    -0.59
     Option
    -0.58
     electr
    -0.58
     Trials
    -0.57
     Syndicate
    -0.57
    POSITIVE LOGITS
    ãĥ¼ãĥĨãĤ£
    0.93
    OPLE
    0.89
    mble
    0.77
    aily
    0.76
    ELF
    0.75
     Parenthood
    0.70
    querque
    0.68
     innocuous
    0.67
    ational
    0.66
    ãĥŁ
    0.66
    Act Density 0.029%

    No Known Activations