INDEX
    Explanations

    frequent mentions of COVID-19

    New Auto-Interp
    Negative Logits
    lopen
    -0.17
     Covid
    -0.15
     åİ
    -0.14
     COVID
    -0.14
    bbe
    -0.14
    Defaults
    -0.14
    éł¼
    -0.14
    Overrides
    -0.13
    гÑĥ
    -0.13
    aves
    -0.13
    POSITIVE LOGITS
    19
    0.40
    -
    0.39
    019
    0.31
    ãĥ¼
    0.24
     nineteen
    0.21
    gnore
    0.20
    18
    0.19
    Û±Û¹
    0.19
    iloc
    0.19
    -âĢIJ
    0.18
    Act Density 0.009%

    No Known Activations