INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    originally
    0.41
    全新
    0.40
    ਵਾ
    0.38
     சமீப
    0.38
    지금
    0.38
     únicamente
    0.37
     собственные
    0.37
    =['
    0.37
    들은
    0.37
    丝毫
    0.36
    POSITIVE LOGITS
    前者
    0.60
     aforesaid
    0.57
     above
    0.55
     previous
    0.54
     Previous
    0.53
     aforementioned
    0.52
     предыду
    0.51
     foregoing
    0.50
     उपरोक्त
    0.47
     Above
    0.45
    Act Density 0.205%

    No Known Activations