INDEX
    Explanations

    markers or indicators of lists and enumerations

    New Auto-Interp
    Negative Logits
    pend
    -0.15
    ulus
    -0.15
    ahoma
    -0.15
    ain
    -0.14
    ostel
    -0.14
    irt
    -0.14
    isin
    -0.14
    vail
    -0.14
    elor
    -0.14
    íģ
    -0.13
    POSITIVE LOGITS
    ownt
    0.17
    istrovstvÃŃ
    0.15
    bove
    0.14
    incident
    0.14
    .gdx
    0.14
    earn
    0.14
     oldukları
    0.14
     meny
    0.13
    IIIK
    0.13
    824
    0.13
    Act Density 0.014%

    No Known Activations