INDEX
    Explanations

    punctuation marks and question marks

    New Auto-Interp
    Negative Logits
    tvguidetime
    -0.97
     surla
    -0.73
    findpost
    -0.72
     '{@
    -0.70
     linkovi
    -0.65
    #+#
    -0.62
    -0.62
     Athen
    -0.61
     AssemblyVersion
    -0.58
     بيها
    -0.58
    POSITIVE LOGITS
    pium
    0.59
    fino
    0.58
    ėk
    0.57
    يادة
    0.57
    ();)
    0.55
    tled
    0.54
    élev
    0.54
     بلکه
    0.53
    COLOG
    0.53
    cjon
    0.52
    Act Density 0.378%

    No Known Activations