INDEX
Negative Logits
Doch
0.78
மனைவி
0.77
Soup
0.74
berbagi
0.72
asil
0.72
аб
0.71
creatic
0.71
javac
0.70
WithDictionary
0.70
Pooling
0.70
POSITIVE LOGITS
queer
0.88
gay
0.82
gay
0.69
gays
0.66
LGBT
0.65
ní
0.64
homosexual
0.63
transgender
0.63
LGBTQ
0.63
human
0.61
Activations Density 0.019%