INDEX
    Explanations

    references to years and dates related to individuals or events

    New Auto-Interp
    Negative Logits
    gettext
    -0.16
    cta
    -0.15
    elter
    -0.15
    hpp
    -0.14
    SWG
    -0.14
    ÑĪев
    -0.14
    ISMATCH
    -0.14
    owie
    -0.13
    éry
    -0.13
    ér
    -0.13
    POSITIVE LOGITS
    /
    0.17
    AD
    0.16
    ÃĹ
    0.15
    Ø¡
    0.15
     CE
    0.15
    pong
    0.15
     was
    0.15
    ênh
    0.15
    ëħĦ
    0.14
    CE
    0.14
    Act Density 0.014%

    No Known Activations