INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     disequ
    0.51
    規則
    0.45
     규칙
    0.44
     racional
    0.43
     axioms
    0.43
     рациона
    0.43
    moid
    0.42
     astrophys
    0.42
    િમ
    0.42
     বিধি
    0.42
    POSITIVE LOGITS
     translation
    1.28
     bilingual
    1.26
     multilingual
    1.26
     Languages
    1.24
     Translation
    1.23
     language
    1.23
     Language
    1.23
     languages
    1.20
     Bilingual
    1.17
     translations
    1.15
    Act Density 0.322%

    No Known Activations