INDEX
Explanations
words related to physical body parts, specifically focusing on the neck area
references to neck injuries or conditions
New Auto-Interp
Negative Logits
vation
-0.71
ãĥ¼ãĥ³
-0.68
ãĥĩ
-0.68
sunshine
-0.66
PLIED
-0.65
ðĿ
-0.64
intervening
-0.63
âĺħâĺħ
-0.63
izoph
-0.62
abeth
-0.61
POSITIVE LOGITS
lace
1.29
neck
0.98
erd
0.93
beard
0.89
Guitar
0.84
igans
0.83
sweater
0.82
bones
0.81
er
0.80
bone
0.80
Activations Density 0.010%