INDEX
Explanations
terms related to bullying or being bullied
terms related to bullying and its effects
New Auto-Interp
Negative Logits
Donation
-0.77
aeda
-0.76
ittance
-0.74
arbon
-0.72
ossier
-0.72
vid
-0.70
ixed
-0.70
cession
-0.70
ixture
-0.69
ournal
-0.69
POSITIVE LOGITS
bullies
1.24
bullying
1.19
bully
1.08
bullied
1.08
behav
0.85
ãħĭ
0.82
pul
0.80
kids
0.77
slurs
0.71
HAEL
0.69
Activations Density 0.012%