INDEX
Explanations
phrases related to failure or loss
references to the concept of defeat
New Auto-Interp
Negative Logits
liner
-0.87
atern
-0.74
pores
-0.73
olen
-0.70
azer
-0.64
OPER
-0.64
acca
-0.63
negie
-0.63
ORPG
-0.62
overe
-0.62
POSITIVE LOGITS
defeat
1.08
defeats
1.01
pport
0.87
ptives
0.84
nces
0.82
lessly
0.81
Defeat
0.80
enance
0.76
ingly
0.74
ery
0.72
Activations Density 0.013%