INDEX
Explanations
occurrences of the word "ban" followed by a number indicating the strength of the match
discussions about bans or prohibitions in various contexts
New Auto-Interp
Negative Logits
Generations
-0.81
LV
-0.70
ORY
-0.69
rious
-0.68
Io
-0.66
Sea
-0.66
ACTED
-0.65
Barg
-0.65
everal
-0.63
Fault
-0.62
POSITIVE LOGITS
hammer
1.15
zai
1.07
ishment
1.06
quet
0.91
bans
0.89
ish
0.86
ban
0.84
jo
0.82
banning
0.82
etooth
0.81
Activations Density 0.013%