INDEX
    Explanations

    temporal references related to past events

    New Auto-Interp
    Negative Logits
    DMIN
    -0.17
    enge
    -0.16
    ackson
    -0.15
    utral
    -0.14
    ICES
    -0.14
    uo
    -0.14
    zbek
    -0.13
    lero
    -0.13
     witnesses
    -0.13
    [Index
    -0.13
    POSITIVE LOGITS
    ody
    0.24
    544
    0.17
    stell
    0.16
    akk
    0.16
    oger
    0.15
    iat
    0.15
    ëģ¼
    0.15
     ÑĢай
    0.15
     INDIRECT
    0.14
    odi
    0.14
    Act Density 0.004%

    No Known Activations