INDEX
Explanations
words related to physical health and well-being
references to health-related topics and issues
New Auto-Interp
Negative Logits
Helpful
-0.86
coni
-0.78
Reloaded
-0.71
Hundred
-0.70
âĸ¬
-0.67
Darkness
-0.63
xual
-0.63
Duo
-0.63
ãĥ£
-0.62
Downing
-0.61
POSITIVE LOGITS
care
1.11
care
1.09
iest
0.98
span
0.97
ily
0.95
Care
0.91
aceutical
0.90
professionals
0.88
Care
0.87
isot
0.86
Activations Density 0.031%