INDEX
Explanations
references to bullying and feelings of being bullied
New Auto-Interp
Negative Logits
bab
-0.15
erialized
-0.15
BuilderFactory
-0.14
é¢
-0.13
ucs
-0.13
UTO
-0.13
.ManyToMany
-0.13
rray
-0.13
icopt
-0.13
.Dataset
-0.13
POSITIVE LOGITS
bullying
0.28
bully
0.25
bullied
0.24
ostr
0.21
bull
0.20
bul
0.20
school
0.20
clique
0.20
popularity
0.20
peer
0.20
Activations Density 0.113%