INDEX
    Explanations

    references to gameplay mechanics and instructions in gaming contexts

    New Auto-Interp
    Negative Logits
    chter
    -0.14
    emean
    -0.14
    olly
    -0.14
     Armstrong
    -0.14
    emailer
    -0.14
    ombok
    -0.13
    ruba
    -0.13
    obl
    -0.13
    ıydı
    -0.13
    eree
    -0.13
    POSITIVE LOGITS
    613
    0.15
     this
    0.14
    655
    0.13
     Asc
    0.13
    egasus
    0.13
    614
    0.13
    ilos
    0.13
    è¿Ļç§į
    0.13
    ilha
    0.12
    éĤª
    0.12
    Act Density 8.003%

    No Known Activations