INDEX
    Explanations

    patterns or symbols, particularly unusual characters or formatting in the text

    New Auto-Interp
    Negative Logits
    inyin
    -0.13
    wendung
    -0.13
     Inquiry
    -0.13
     ÄijÃłi
    -0.13
     zpráva
    -0.13
    çĴ
    -0.13
    alarm
    -0.12
     bÄĥng
    -0.12
     invasive
    -0.12
    indr
    -0.12
    POSITIVE LOGITS
     voting
    0.41
     votes
    0.40
     vote
    0.39
     Voting
    0.38
     Votes
    0.34
     voter
    0.34
     Vote
    0.33
     voted
    0.33
    votes
    0.33
     voters
    0.32
    Act Density 0.005%

    No Known Activations