INDEX
Explanations
mentions of COVID-19 and its impacts in various contexts
New Auto-Interp
Negative Logits
COVID
-0.25
Covid
-0.25
covid
-0.22
COVID
-0.20
Coronavirus
-0.18
coronavirus
-0.16
tin
-0.15
owing
-0.15
lags
-0.14
ovid
-0.14
POSITIVE LOGITS
19
0.23
-related
0.22
related
0.22
ymes
0.18
-response
0.18
019
0.17
cases
0.17
-era
0.17
gnore
0.16
related
0.16
Activations Density 0.027%