INDEX
Explanations
phrases related to winning and success
New Auto-Interp
Negative Logits
Factor
-0.68
senal
-0.66
ria
-0.59
periphery
-0.58
bian
-0.58
gypt
-0.57
matured
-0.56
uras
-0.56
plun
-0.56
rete
-0.56
POSITIVE LOGITS
't
1.07
now
1.06
nings
1.04
throp
0.99
cest
0.91
ced
0.88
cing
0.84
ipeg
0.83
ests
0.82
hardt
0.81
Activations Density 0.395%