INDEX
    Explanations

    terms related to global concepts and competition

    New Auto-Interp
    Negative Logits
    ikel
    -0.09
    idel
    -0.07
    cken
    -0.07
    sse
    -0.07
    ORY
    -0.07
    seau
    -0.07
    ابت
    -0.07
    esis
    -0.07
    awai
    -0.07
    leri
    -0.06
    POSITIVE LOGITS
    /global
    0.12
    /local
    0.12
     warming
    0.12
    ToLocal
    0.12
    /world
    0.11
    -wide
    0.10
    isation
    0.10
    ized
    0.10
    izing
    0.10
    -local
    0.09
    Act Density 0.023%

    No Known Activations