INDEX
Explanations
mentions of medical research topics and their implications
New Auto-Interp
Negative Logits
TestBed
-0.40
oyuncak
-0.38
embaraz
-0.38
carreira
-0.37
FlatAppearance
-0.36
curiosidad
-0.36
UseVisualStyle
-0.36
murni
-0.36
ModelExpression
-0.34
bề
-0.34
POSITIVE LOGITS
COVID
0.91
Covid
0.84
COVID
0.82
Covid
0.80
covid
0.79
coronavirus
0.73
Coronavirus
0.69
covid
0.69
Coronavirus
0.66
pandemic
0.64
Activations Density 0.205%