INDEX
Explanations
winning-related words and phrases
mentions of winning or success
New Auto-Interp
Negative Logits
underground
-0.78
arom
-0.76
involuntary
-0.72
serpent
-0.69
acid
-0.68
infusion
-0.67
feral
-0.65
contempl
-0.64
antim
-0.63
masked
-0.63
POSITIVE LOGITS
Win
4.00
win
2.15
WIN
2.13
Win
1.67
WIN
1.57
Winning
1.50
winning
1.46
Winner
1.45
winner
1.35
win
1.26
Activations Density 0.013%