INDEX
    Explanations

    capital city, seat, mind, best group

    New Auto-Interp
    Negative Logits
    nij
    0.53
    行う
    0.53
    मधील
    0.50
     induced
    0.50
     Einrichtung
    0.50
     inductive
    0.47
     چون
    0.46
    impossible
    0.46
    how
    0.45
     assass
    0.45
    POSITIVE LOGITS
    out
    0.46
     ook
    0.43
     temperament
    0.42
    Out
    0.41
     
    0.41
    បន្ថែម
    0.40
    sized
    0.40
     महंत
    0.40
    ément
    0.40
    に合わせて
    0.40
    Act Density 0.000%

    No Known Activations