INDEX
Explanations
references to political and social events involving LGBTQ+ rights and issues
New Auto-Interp
Negative Logits
endoza
-0.17
925
-0.15
pent
-0.15
amento
-0.15
425
-0.14
šov
-0.14
ванов
-0.14
tro
-0.14
omm
-0.14
leo
-0.13
POSITIVE LOGITS
INTR
0.16
'=>"
0.15
utor
0.14
'=>['
0.14
är
0.14
abbr
0.14
ingroup
0.14
Ñıл
0.14
Winn
0.14
Sin
0.14
Activations Density 0.217%