INDEX
Explanations
words related to medical conditions and treatments
words associated with illness or health-related terminology
New Auto-Interp
Negative Logits
maker
-0.75
interpre
-0.72
appropriated
-0.71
cessation
-0.70
interpreter
-0.69
revenues
-0.68
river
-0.67
RAG
-0.66
creditor
-0.66
peak
-0.66
POSITIVE LOGITS
coli
1.17
illus
1.02
Magikarp
0.98
sie
0.96
ococ
0.94
mone
0.93
amins
0.91
_(
0.82
Õ
0.82
irus
0.81
Activations Density 0.023%