INDEX
Explanations
references to viruses and viral outbreaks
New Auto-Interp
Negative Logits
onne
-0.16
vit
-0.15
staging
-0.15
pret
-0.15
Churchill
-0.14
&view
-0.14
Dirt
-0.14
ova
-0.14
osen
-0.14
Pret
-0.14
POSITIVE LOGITS
ARS
0.23
coron
0.22
Middle
0.21
bet
0.20
pneumonia
0.20
civ
0.20
-Co
0.19
pang
0.18
ACE
0.18
Coron
0.18
Activations Density 0.017%