INDEX
    Explanations

    mentions of languages

    New Auto-Interp
    Negative Logits
    xus
    -0.92
    lier
    -0.91
    igham
    -0.87
    urion
    -0.87
    vre
    -0.85
    rons
    -0.80
    ldon
    -0.79
    llan
    -0.78
    rences
    -0.78
    apego
    -0.77
    POSITIVE LOGITS
     translation
    1.06
     pronunciation
    1.03
     diction
    0.97
     language
    0.97
     translations
    0.90
     languages
    0.90
     transl
    0.89
     Nadu
    0.88
     accents
    0.87
     transcription
    0.87
    Act Density 0.085%

    No Known Activations