INDEX
Explanations
mentions of the COVID-19 pandemic
New Auto-Interp
Negative Logits
ᅠ
-0.55
ρους
-0.54
fromLTRB
-0.48
насељу
-0.48
orienta
-0.47
haj
-0.47
tain
-0.46
SO
-0.46
ModelExpression
-0.45
tragen
-0.45
POSITIVE LOGITS
írus
1.27
coronavirus
1.26
IRUS
1.18
coronavirus
1.12
Coronavirus
1.11
Coronavirus
1.06
COVID
0.99
Covid
0.96
pandemic
0.89
Covid
0.87
Activations Density 0.058%