INDEX
    Explanations

    calculating sums

    New Auto-Interp
    Negative Logits
    øst
    -0.08
    -0.07
     biom
    -0.07
     Head
    -0.07
     Lock
    -0.07
     Hör
    -0.07
    пол
    -0.07
    عين
    -0.07
     sensible
    -0.07
     fidèle
    -0.07
    POSITIVE LOGITS
     govern
    0.08
    ected
    0.08
     જાય
    0.08
     ناش
    0.08
     যায়
    0.07
    0.07
    Src
    0.07
     معها
    0.07
    _it
    0.07
     معه
    0.07
    Act Density 0.013%

    No Known Activations