INDEX
Negative Logits
unchecked
-0.09
整
-0.08
/*
-0.08
sanitario
-0.08
adept
-0.08
/*
-0.08
Provider
-0.08
Buffer
-0.07
impair
-0.07
laz
-0.07
POSITIVE LOGITS
controversial
0.12
controversy
0.11
controversies
0.10
extremist
0.09
polém
0.09
contro
0.09
provocative
0.09
hateful
0.09
онлайн
0.09
विवाद
0.09
Activations Density 0.054%