INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     λε
    -0.08
     própria
    -0.07
    -0.07
    يسة
    -0.07
    -0.07
     ανα
    -0.07
    εν
    -0.07
     lone
    -0.07
     chim
    -0.07
     limiter
    -0.07
    POSITIVE LOGITS
    Interested
    0.09
     interesados
    0.08
    _windows
    0.08
    tych
    0.08
     वालों
    0.08
    *innen
    0.08
     cared
    0.08
     interesado
    0.08
    introduced
    0.08
     attending
    0.08
    Act Density 0.008%

    No Known Activations