INDEX
Explanations
phrases related to direct competition or comparison
phrases expressing competition or match-style events
New Auto-Interp
Negative Logits
nces
-0.66
recip
-0.64
Byzantine
-0.63
ilial
-0.62
nan
-0.62
icity
-0.62
cial
-0.61
conn
-0.61
tsky
-0.60
DOM
-0.59
POSITIVE LOGITS
quarters
0.85
shoulders
0.78
quartered
0.73
heels
0.67
tails
0.65
Exit
0.65
scissors
0.62
arted
0.60
ache
0.58
shove
0.58
Activations Density 0.083%