INDEX
    Explanations

    references to historical events and changes in society

    New Auto-Interp
    Negative Logits
     Yesterday
    -0.17
     recently
    -0.16
    Yesterday
    -0.14
    eldom
    -0.14
    utures
    -0.14
     Recently
    -0.14
     minul
    -0.14
    æĺ¨
    -0.13
    rix
    -0.13
    oned
    -0.13
    POSITIVE LOGITS
     until
    0.27
    until
    0.23
     gradually
    0.23
     during
    0.22
    Until
    0.22
     Until
    0.21
     Beginning
    0.21
     beginning
    0.20
     Grad
    0.20
    Beginning
    0.20
    Act Density 0.208%

    No Known Activations