INDEX
Explanations
references to betting or gambling terms
"bet" or its variants
bet, Beta, or β
New Auto-Interp
Negative Logits
rentina
-0.57
UniformLocation
-0.51
кру
-0.51
mika
-0.47
unately
-0.46
jkl
-0.44
自
-0.44
mousse
-0.43
PRINCIP
-0.43
concor
-0.43
POSITIVE LOGITS
BET
0.87
bets
0.84
bet
0.77
RegressionTest
0.76
Betten
0.73
Bets
0.71
Bet
0.70
bet
0.69
Bets
0.69
""],
0.69
Activations Density 0.117%