INDEX
    Explanations

    references to "new" entities, such as updates or changes in roles, positions, or locations

    New Auto-Interp
    Negative Logits
    NavController
    -0.15
    IGIN
    -0.15
     поÑĪ
    -0.15
     ÎļοÏħ
    -0.14
    upertino
    -0.14
    tae
    -0.14
    snap
    -0.14
    hma
    -0.14
     rencont
    -0.13
    اسÙĬ
    -0.13
    POSITIVE LOGITS
    iche
    0.20
    -found
    0.18
    æł·åŃIJ
    0.17
    swire
    0.16
    roz
    0.16
    opers
    0.16
    çĵ
    0.15
    uper
    0.15
    erve
    0.15
    iger
    0.15
    Act Density 0.062%

    No Known Activations