INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ก็
    -1.21
    gehen
    -1.18
     realmente
    -1.17
    ticularly
    -1.16
     ats
    -1.13
     gustar
    -1.13
     ſou
    -1.13
    และ
    -1.12
     kompet
    -1.11
    geladen
    -1.11
    POSITIVE LOGITS
     only
    2.09
     by
    1.80
    Only
    1.60
     лишь
    1.57
     for
    1.55
     merely
    1.55
     few
    1.51
     before
    1.45
     Only
    1.45
     ONLY
    1.39
    Act Density 0.028%

    No Known Activations