INDEX
    Explanations

    words related to allegations and claims of wrongdoing

    New Auto-Interp
    Negative Logits
     lately
    -0.19
    isko
    -0.16
     recently
    -0.15
    reek
    -0.15
    emme
    -0.15
     since
    -0.15
     Happ
    -0.15
    _recent
    -0.14
    -addon
    -0.14
     Hass
    -0.14
    POSITIVE LOGITS
     hadn
    0.18
     telah
    0.17
     has
    0.17
     hav
    0.17
     have
    0.17
     had
    0.16
     haber
    0.15
    mÄ±ÅŁtır
    0.15
     ÙĤد
    0.15
     hath
    0.15
    Act Density 0.167%

    No Known Activations