INDEX
    Explanations

    titles and references to various books and authors

    New Auto-Interp
    Negative Logits
    raud
    -0.16
    олоÑģ
    -0.16
    akk
    -0.14
    arez
    -0.14
    áš
    -0.13
    emma
    -0.13
     subj
    -0.13
     reg
    -0.13
     Lal
    -0.13
     Rough
    -0.13
    POSITIVE LOGITS
    benh
    0.16
    ego
    0.16
    Æ°á»Łng
    0.15
    iek
    0.15
    ONTAL
    0.15
    دÙĪØ§Ø¬
    0.14
    edn
    0.14
    eko
    0.14
    ILING
    0.14
    aber
    0.14
    Act Density 0.055%

    No Known Activations