INDEX
    Explanations

    instances of punctuation used in citations and references

    New Auto-Interp
    Negative Logits
    acz
    -0.17
    jspx
    -0.16
    ipp
    -0.16
    çĽ
    -0.15
     sey
    -0.15
    agua
    -0.15
    μοÏħ
    -0.14
    ÂŃi
    -0.14
    valuator
    -0.14
    deniz
    -0.14
    POSITIVE LOGITS
    ZO
    0.18
     Kons
    0.17
    jer
    0.16
    ocab
    0.16
    266
    0.16
    asher
    0.15
    ummer
    0.15
    ogl
    0.15
     tw
    0.15
    ä½Ļ
    0.15
    Act Density 0.002%

    No Known Activations