INDEX
    Explanations

    first breakdown explanation

    New Auto-Interp
    Negative Logits
    くれた
    0.82
     till
    0.78
    rierung
    0.75
     Till
    0.72
    ងារ
    0.72
     கொண்டிருக்கும்
    0.70
     renfer
    0.69
    czeń
    0.66
     llegó
    0.66
     trasferimento
    0.65
    POSITIVE LOGITS
    First
    0.78
    first
    0.78
     first
    0.77
     ابتدا
    0.75
     First
    0.71
    sect
    0.68
    Firstly
    0.67
    0.64
     Firstly
    0.64
    0.64
    Act Density 0.179%

    No Known Activations