INDEX
    Explanations

    references to New Year's celebrations and related events

    New Auto-Interp
    Negative Logits
    erness
    -0.16
    ler
    -0.15
    odo
    -0.15
    ITES
    -0.14
     dele
    -0.14
    ernes
    -0.14
    arkin
    -0.14
    .reporting
    -0.14
    oux
    -0.13
    umo
    -0.13
    POSITIVE LOGITS
     Eve
    0.21
    (New
    0.15
     eve
    0.15
    .NEW
    0.15
    /New
    0.14
    Resolve
    0.14
     resolutions
    0.14
    raud
    0.13
    .NewLine
    0.13
    atif
    0.13
    Act Density 0.013%

    No Known Activations