INDEX
Explanations
words associated with societal issues and controversies surrounding LGBTQ+ topics
New Auto-Interp
Negative Logits
-0.56
(
-0.53
↵↵
-0.51
↵
-0.44
Джерела
-0.42
nueces
-0.41
Välislingid
-0.41
.
-0.39
<sup>
-0.38
//
-0.38
POSITIVE LOGITS
kasarigan
1.28
متعلقه
1.28
DebuggerNonUser
1.27
IsContent
1.24
myſelf
1.11
Билгалдахарш
1.05
itſelf
1.03
himſelf
0.98
فريبيس
0.97
betweenstory
0.95
Activations Density 2.976%