INDEX
    Explanations

    words that indicate doubt or uncertainty

    negative contractions indicating unfulfilled actions or states

    New Auto-Interp
    Negative Logits
    CVE
    -0.69
    Reviewer
    -0.63
     behavi
    -0.61
    è»
    -0.60
     SetTextColor
    -0.60
     Polaris
    -0.58
    CRIP
    -0.58
    士
    -0.58
     Pike
    -0.57
    çĦ
    -0.57
    POSITIVE LOGITS
    aken
    1.00
    alion
    0.99
    ween
    0.97
    reprene
    0.95
    ournament
    0.94
    akers
    0.92
    rees
    0.91
    otally
    0.89
    itles
    0.88
    enegger
    0.87
    Act Density 0.013%

    No Known Activations