INDEX
    Explanations

    temporal references like dates, days of the week, and time-related words

    temporal references such as time-related words and phrases

    New Auto-Interp
    Negative Logits
    CRE
    -0.71
    toc
    -0.65
    ategy
    -0.60
    abase
    -0.60
    grave
    -0.58
    ĪĴ
    -0.58
     Tide
    -0.58
    andestine
    -0.58
    TY
    -0.57
     nonex
    -0.56
    POSITIVE LOGITS
    è£ħ
    0.80
    lier
    0.76
    grade
    0.69
    iven
    0.66
     Airways
    0.62
     rumours
    0.61
    ãĥŁ
    0.60
    nis
    0.60
    UFC
    0.59
     thicker
    0.59
    Act Density 0.408%

    No Known Activations