INDEX
Explanations
phrases related to specific body parts or objects worn on the body
colloquial expressions and metaphors related to challenges and societal issues
New Auto-Interp
Negative Logits
ãĥ¼ãĥĨãĤ£
-0.83
arov
-0.77
lav
-0.71
asus
-0.69
Gam
-0.69
Sah
-0.68
ãĤ±
-0.68
horm
-0.66
ãĥ´ãĤ¡
-0.65
rig
-0.64
POSITIVE LOGITS
financially
0.88
enance
0.79
offensively
0.77
academ
0.73
economically
0.70
ideologically
0.69
regarding
0.68
politically
0.68
âĦ¢
0.67
mentality
0.65
Activations Density 0.451%