INDEX
Explanations
various health-related terms and phrases, including medical conditions and treatments
aspects related to activities, practices, and engagement in various tasks
New Auto-Interp
Negative Logits
censored
-0.96
eleph
-0.94
seized
-0.89
oun
-0.89
unspecified
-0.87
exiled
-0.81
outgoing
-0.81
committed
-0.80
earthqu
-0.80
contested
-0.78
POSITIVE LOGITS
Avoid
1.78
Tips
1.77
Types
1.73
Learn
1.72
Tip
1.71
Sometimes
1.71
Use
1.70
Understanding
1.66
Remember
1.66
Know
1.65
Activations Density 0.347%