INDEX
    Explanations

    dates and temporal expressions

    New Auto-Interp
    Negative Logits
    198
    -0.22
    199
    -0.21
    196
    -0.19
     COVID
    -0.19
    195
    -0.18
    197
    -0.18
    COVID
    -0.17
    Û±Û¹Û¹
    -0.17
    192
    -0.17
     Covid
    -0.17
    POSITIVE LOGITS
    201
    0.33
    Û²Û°Û±
    0.22
     Obama
    0.22
    Obama
    0.21
    012
    0.20
     Twelve
    0.19
    013
    0.18
    13
    0.18
     Barack
    0.18
    12
    0.17
    Act Density 0.088%

    No Known Activations