INDEX
    Explanations

    titles of books and scholarly works

    New Auto-Interp
    Negative Logits
    :
    -0.14
    æ§ĺ
    -0.14
    atty
    -0.14
    ipi
    -0.14
    .reactivex
    -0.14
    ł
    -0.14
    łí
    -0.13
     Burgess
    -0.13
    ollapsed
    -0.13
     bins
    -0.13
    POSITIVE LOGITS
     sát
    0.17
     chatte
    0.15
    iling
    0.15
    оÑģÑĥд
    0.15
    &W
    0.14
    inkle
    0.14
    RIPT
    0.14
    بÙĪØ§Ø³Ø·Ø©
    0.14
    'gc
    0.14
    ضاÙĨ
    0.14
    Act Density 0.043%

    No Known Activations