INDEX
Explanations
phrases related to competition or performance
variations of the words "spend," "scored," and "run."
New Auto-Interp
Negative Logits
ancial
-0.74
agall
-0.72
agher
-0.69
ulhu
-0.69
Seym
-0.68
aghan
-0.67
farious
-0.66
unal
-0.63
iard
-0.62
Cyan
-0.62
POSITIVE LOGITS
xual
1.04
dden
0.74
oeuv
0.74
imental
0.71
mitt
0.70
cers
0.70
uates
0.70
igated
0.68
urations
0.67
mosp
0.65
Activations Density 0.057%