INDEX
Explanations
references to political decisions and actions regarding LGBTQ+ issues
New Auto-Interp
Negative Logits
ï¼´
-0.15
stalk
-0.15
æŃ
-0.14
ÏīÏĤ
-0.14
salopes
-0.14
áme
-0.13
ÙĪØº
-0.13
bbe
-0.13
âĢIJ
-0.13
declspec
-0.13
POSITIVE LOGITS
Wednesday
0.20
nearly
0.20
Thursday
0.20
âģ
0.20
Tuesday
0.19
roughly
0.18
âĢķ
0.18
—
0.18
âģ
0.18
0.17
Activations Density 0.518%