INDEX
    Explanations

    references to specific years

    New Auto-Interp
    Negative Logits
    969
    -0.18
    COVID
    -0.18
     COVID
    -0.17
    ima
    -0.16
    569
    -0.16
    itary
    -0.15
     Covid
    -0.15
    59
    -0.15
    03
    -0.15
    04
    -0.15
    POSITIVE LOGITS
     Sevent
    0.20
     ä¸ĥ
    0.19
    Eight
    0.18
    Û·
    0.18
    eight
    0.17
    à¥Ń
    0.17
     Eight
    0.17
     sevent
    0.17
    ä¸ĥ
    0.17
    7
    0.17
    Act Density 0.048%

    No Known Activations