INDEX
    Explanations

    dates and specific temporal markers in the text

    New Auto-Interp
    Negative Logits
    uve
    -0.16
    onis
    -0.16
    ģm
    -0.15
    bnb
    -0.15
    riend
    -0.15
    iverz
    -0.15
    uyen
    -0.14
    ignon
    -0.14
    -prepend
    -0.14
     READY
    -0.14
    POSITIVE LOGITS
    _flutter
    0.15
    ippo
    0.14
     rele
    0.14
    elter
    0.14
    esar
    0.14
     Kahn
    0.14
    .named
    0.14
     Sund
    0.14
    enu
    0.14
     honesty
    0.13
    Act Density 0.047%

    No Known Activations