INDEX
Explanations
references to disease diagnosis and potential exposure to infectious viruses
New Auto-Interp
Negative Logits
verity
-0.17
olik
-0.16
isas
-0.15
deaux
-0.15
observer
-0.14
IRST
-0.14
ectl
-0.14
NotExist
-0.14
inium
-0.14
-Ñħ
-0.14
POSITIVE LOGITS
Exposure
0.25
exposure
0.24
contact
0.24
close
0.23
contacts
0.22
close
0.22
contact
0.21
exposed
0.21
Contact
0.20
конÑĤак
0.20
Activations Density 0.028%