INDEX
    Explanations

    numerical data and percentages

    New Auto-Interp
    Negative Logits
    abant
    -0.15
    Verdana
    -0.14
    ondere
    -0.13
    ĥĿ
    -0.13
     Tanz
    -0.13
    itung
    -0.13
    nda
    -0.13
    buz
    -0.13
    otr
    -0.13
    au
    -0.12
    POSITIVE LOGITS
    ensen
    0.16
    ény
    0.15
    oint
    0.14
    nier
    0.14
    angelo
    0.14
    linger
    0.14
    constant
    0.14
    æģĴ
    0.14
    jer
    0.13
    amar
    0.13
    Act Density 0.005%

    No Known Activations