INDEX
    Explanations

    expressions related to failure or negative outcomes

    instances of the word "fo" and other related terms suggestive of failure or administrative actions

    New Auto-Interp
    Negative Logits
     Lumpur
    -0.66
     Sector
    -0.66
    uyomi
    -0.66
     Poster
    -0.62
     Norn
    -0.61
    Downloadha
    -0.60
     warr
    -0.59
    owship
    -0.59
    é¾įå¥ij士
    -0.59
     XY
    -0.59
    POSITIVE LOGITS
    eton
    0.82
    eto
    0.81
    emen
    0.81
    isted
    0.78
    oche
    0.77
    oing
    0.77
    ogs
    0.77
    umbers
    0.76
    cludes
    0.75
    amy
    0.75
    Act Density 0.011%

    No Known Activations