INDEX
    Explanations

    Bringing things from outside

    New Auto-Interp
    Negative Logits
     forming
    -0.09
    ൈന
    -0.08
    ार
    -0.07
    推进
    -0.07
    ование
    -0.07
     staking
    -0.07
    ાઇન
    -0.07
    فاء
    -0.07
     both
    -0.07
     redund
    -0.07
    POSITIVE LOGITS
    回来
    0.12
    0.11
     freshly
    0.11
     regreso
    0.11
     hasil
    0.10
     возвращ
    0.10
     berasal
    0.10
     afkomstig
    0.10
     frisch
    0.10
     pós
    0.09
    Act Density 0.104%

    No Known Activations