INDEX
Explanations
instances of words related to infections or diseases
terminology related to infections and infectious diseases
New Auto-Interp
Negative Logits
compr
-0.78
kee
-0.71
rend
-0.67
lawy
-0.67
umbn
-0.64
YC
-0.63
ibur
-0.63
MAG
-0.63
tallest
-0.62
OPA
-0.62
POSITIVE LOGITS
infected
0.93
infect
0.92
ious
0.89
infect
0.87
infection
0.85
outbreak
0.83
iosis
0.83
bacteria
0.83
inoc
0.83
etts
0.82
Activations Density 0.044%