INDEX
    Explanations

    references to the New Year and its associated traditions or resolutions

    New Auto-Interp
    Negative Logits
    irt
    -0.17
    CanBe
    -0.16
     pat
    -0.15
    okin
    -0.15
    ajs
    -0.14
    lef
    -0.14
    jem
    -0.14
    à¥Ģय
    -0.14
    ais
    -0.14
     Late
    -0.13
    POSITIVE LOGITS
    اÙĨÙĩ
    0.16
    arie
    0.15
    \Collections
    0.14
    ç«ĭãģ¦
    0.14
    uzzi
    0.14
    ming
    0.14
    æŃ´
    0.13
    abies
    0.13
    æ¶²
    0.13
     Boxes
    0.13
    Act Density 0.020%

    No Known Activations