INDEX
    Explanations

    key actions or relationships that involve comparison and citation

    New Auto-Interp
    Negative Logits
    oi
    -0.18
    eller
    -0.16
    ivi
    -0.15
    oc
    -0.15
    "."
    -0.14
     Fallon
    -0.14
    î
    -0.14
    ry
    -0.14
     Walsh
    -0.14
    pr
    -0.14
    POSITIVE LOGITS
    ì§Ħ
    0.17
    enger
    0.17
     Všech
    0.16
    idlo
    0.15
    oran
    0.15
     fin
    0.15
    rve
    0.15
    (ARG
    0.14
    xaf
    0.14
    adero
    0.14
    Act Density 0.019%

    No Known Activations