INDEX
    Explanations

    traditional

    New Auto-Interp
    Negative Logits
     Этот
    -0.09
     degrade
    -0.08
     റെ
    -0.08
     Нед
    -0.08
    _runs
    -0.08
     അഭിപ്രായ
    -0.08
    endoza
    -0.08
     постоянно
    -0.08
    ekiso
    -0.08
     Poems
    -0.08
    POSITIVE LOGITS
    -fashioned
    0.12
     traditional
    0.11
    传统
    0.11
    traditional
    0.10
     tradition
    0.09
     tradicionais
    0.09
     tradi
    0.09
     clásica
    0.09
     traditionnelle
    0.09
     tradicional
    0.09
    Act Density 0.032%

    No Known Activations