INDEX
Explanations
instances of illness or health-related issues
New Auto-Interp
Negative Logits
apes
-0.54
osphere
-0.53
homic
-0.50
pearl
-0.49
freude
-0.48
มาะ
-0.48
MetaObject
-0.48
homicidio
-0.48
sampah
-0.48
olymers
-0.48
POSITIVE LOGITS
health
0.99
diagnosed
0.95
suffering
0.94
health
0.90
diagnosis
0.88
symptoms
0.87
illnesses
0.84
diagnosis
0.82
suffer
0.82
suffers
0.82
Activations Density 0.621%