INDEX
Explanations
words related to health conditions such as arthritis, joint pain, and fatigue
the pronoun "he" in various contexts
New Auto-Interp
Negative Logits
hips
-0.91
eleph
-0.70
GEAR
-0.69
Labrador
-0.61
domestically
-0.61
ãĥ¯
-0.59
rador
-0.59
enthal
-0.57
internationally
-0.57
linking
-0.56
POSITIVE LOGITS
isure
1.10
ller
0.99
lling
0.96
ALTH
0.95
atre
0.92
aven
0.91
aton
0.90
dule
0.89
arth
0.89
ather
0.88
Activations Density 0.043%