INDEX
Explanations
phrases related to using tools or performing actions
New Auto-Interp
Negative Logits
Career
-0.91
Politics
-0.88
Events
-0.85
Carib
-0.78
isSpecialOrderable
-0.78
Communities
-0.77
ylum
-0.76
Sov
-0.74
Ukrain
-0.74
Equity
-0.74
POSITIVE LOGITS
gently
1.33
shove
1.30
submer
1.27
slit
1.22
scrape
1.21
chop
1.20
peel
1.20
spray
1.20
remove
1.19
sprinkle
1.18
Activations Density 0.419%