INDEX
    Explanations

    references to language learning and bilingualism

    New Auto-Interp
    Negative Logits
    oa
    -0.16
    ymes
    -0.14
    asma
    -0.14
    oxy
    -0.14
    oval
    -0.14
     Levine
    -0.14
    ãĥ¼ãĥĹ
    -0.14
    loyment
    -0.14
    ansson
    -0.14
    ife
    -0.14
    POSITIVE LOGITS
    emiz
    0.16
     Barbar
    0.16
    _singleton
    0.16
    enÃŃ
    0.15
    ahr
    0.15
    atural
    0.14
    बल
    0.14
    rowable
    0.13
    abra
    0.13
    _PAIR
    0.13
    Act Density 0.461%

    No Known Activations