INDEX
Explanations
words associated with victories and wins in competitive contexts
New Auto-Interp
Negative Logits
useCallback
-0.78
]';
-0.77
]").
-0.71
,$_
-0.71
"}")
-0.70
Референце
-0.70
]').
-0.69
}();
-0.68
Yourself
-0.66
geest
-0.64
POSITIVE LOGITS
win
1.92
win
1.85
Win
1.82
Win
1.80
WIN
1.73
WIN
1.59
Wins
1.59
wins
1.55
Wins
1.53
wins
1.40
Activations Density 0.039%