INDEX
    Explanations

    front end positions or references in various contexts

    New Auto-Interp
    Negative Logits
    anooga
    -0.15
    ANN
    -0.14
    agua
    -0.14
    ziej
    -0.14
    udev
    -0.14
    rita
    -0.14
    thane
    -0.13
    pone
    -0.13
     Mim
    -0.13
    èĮĤ
    -0.13
    POSITIVE LOGITS
    sted
    0.17
    sold
    0.16
    yle
    0.15
    άνει
    0.15
    IALIZ
    0.14
    YLE
    0.14
    ylim
    0.14
    amen
    0.14
    yles
    0.14
    aran
    0.14
    Act Density 0.028%

    No Known Activations