INDEX
Explanations
mentions of treatments for various medical conditions
New Auto-Interp
Negative Logits
Vide
-0.58
Aerospace
-0.57
arrass
-0.56
gow
-0.55
inet
-0.54
asketball
-0.53
inx
-0.52
ova
-0.51
bidding
-0.51
hover
-0.51
POSITIVE LOGITS
ises
0.93
ments
0.83
illnesses
0.80
ailments
0.76
ise
0.76
diseases
0.76
ties
0.73
patients
0.69
illness
0.68
ts
0.68
Activations Density 8.637%