INDEX
Explanations
references to the rainbow theme, particularly in the context of LGBTQ+ pride
New Auto-Interp
Negative Logits
áno
-0.16
PHY
-0.16
oga
-0.16
TON
-0.16
noch
-0.16
ager
-0.15
bir
-0.15
ton
-0.15
\<^
-0.15
ilon
-0.15
POSITIVE LOGITS
-striped
0.17
ÏĢη
0.15
iasi
0.15
COPE
0.14
ëĵľë¦¬
0.14
.want
0.14
ãĤ¤ãĥ³ãĥĪ
0.14
gın
0.14
ãĥªãĤ«
0.14
olik
0.14
Activations Density 0.021%