INDEX
Negative Logits
LGBTQ
0.51
Indigenous
0.44
clueless
0.44
disappointing
0.44
いろんな
0.42
🫤
0.42
الناس
0.42
संवै
0.41
السّ
0.41
려는
0.40
POSITIVE LOGITS
actions
0.80
pursuits
0.69
endeavors
0.68
activities
0.68
usages
0.66
areas
0.64
tasks
0.63
acciones
0.63
affairs
0.61
actions
0.61
Activations Density 0.002%