INDEX
Explanations
mentions of COVID-19 and its variants along with related public health discussions
New Auto-Interp
Negative Logits
INTR
-0.17
usercontent
-0.15
blade
-0.15
ÙĪÙĩ
-0.15
ãĥªãĥ³ãĤ°
-0.15
εÏĨ
-0.14
Unhandled
-0.14
cene
-0.14
æĺ
-0.13
didFinish
-0.13
POSITIVE LOGITS
cases
0.28
Cases
0.27
mask
0.26
case
0.23
masks
0.23
cases
0.23
Cases
0.22
Masks
0.22
booster
0.22
masking
0.22
Activations Density 0.089%