INDEX
Explanations
mentions of medical care or attention to health-related matters
references to various types of care
New Auto-Interp
Negative Logits
ãĥĥãĥī
-0.81
kered
-0.80
ãĥ³ãĤ¸
-0.79
onite
-0.68
bluff
-0.67
âĸ¬
-0.65
akedown
-0.63
repr
-0.62
awe
-0.61
Mines
-0.61
POSITIVE LOGITS
taker
1.53
giving
1.23
lessly
1.07
tta
1.04
taking
1.03
fully
0.93
lessness
0.91
lington
0.88
free
0.86
maid
0.86
Activations Density 0.025%