INDEX
    Explanations

    references to governmental or organizational authority

    New Auto-Interp
    Negative Logits
    LEM
    -0.16
    wner
    -0.15
    .cljs
    -0.15
    positor
    -0.14
    ẫ
    -0.14
    ufe
    -0.13
    ãĥijãĥ³
    -0.13
    klä
    -0.13
    voie
    -0.13
    Łèĥ½
    -0.13
    POSITIVE LOGITS
    eldon
    0.15
    rieve
    0.15
    thest
    0.14
    usto
    0.13
     дво
    0.13
    998
    0.13
     fur
    0.13
     hobbies
    0.13
    995
    0.13
    pra
    0.13
    Act Density 0.109%

    No Known Activations