INDEX
    Explanations

    temporal phrases indicating specific times or dates

    New Auto-Interp
    Negative Logits
    nodoc
    -0.17
    indsight
    -0.17
     بخش
    -0.15
    .Networking
    -0.15
    buch
    -0.14
    xeb
    -0.14
    EXPR
    -0.14
    ÑĨей
    -0.13
    راد
    -0.13
    bestos
    -0.13
    POSITIVE LOGITS
    889
    0.15
    xfff
    0.15
     Gott
    0.14
    icy
    0.14
    agues
    0.14
    ays
    0.13
    .dw
    0.13
     Dw
    0.13
     McCl
    0.13
    chia
    0.13
    Act Density 0.012%

    No Known Activations