INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    に戻
    0.42
     plonge
    0.36
    0.35
     windmills
    0.35
     pozi
    0.34
     retourner
    0.34
     ফিরিয়ে
    0.33
    <unused85>
    0.33
     возвра
    0.32
    0.32
    POSITIVE LOGITS
    home
    0.48
     домой
    0.40
     home
    0.40
     arrivare
    0.39
     llegar
    0.38
    Home
    0.36
     hjem
    0.36
     llegaron
    0.35
     llegue
    0.35
     llegado
    0.35
    Act Density 0.000%

    No Known Activations