INDEX
Explanations
instances of the word "bullying."
instances of the word "bully" and its variations, focusing on themes of bullying in various contexts
New Auto-Interp
Negative Logits
cise
-0.75
ateur
-0.75
aeda
-0.73
Donation
-0.73
cession
-0.72
ournal
-0.72
icrobial
-0.71
uncture
-0.70
uve
-0.69
arbon
-0.69
POSITIVE LOGITS
bullies
1.13
bullying
0.99
bullied
0.93
bully
0.93
pul
0.86
behav
0.85
ãħĭ
0.80
iors
0.72
kids
0.71
girls
0.68
Activations Density 0.014%