INDEX
Explanations
frequent mentions of COVID-19
New Auto-Interp
Negative Logits
lopen
-0.17
Covid
-0.15
åİ
-0.14
COVID
-0.14
bbe
-0.14
Defaults
-0.14
éł¼
-0.14
Overrides
-0.13
гÑĥ
-0.13
aves
-0.13
POSITIVE LOGITS
19
0.40
-
0.39
019
0.31
ãĥ¼
0.24
nineteen
0.21
gnore
0.20
18
0.19
Û±Û¹
0.19
iloc
0.19
-âĢIJ
0.18
Activations Density 0.009%