INDEX
Explanations
mentions of LGBT Pride events and related figures, particularly Taylor Swift
New Auto-Interp
Negative Logits
umer
-0.16
ritz
-0.16
zin
-0.15
raj
-0.15
ooth
-0.15
collapse
-0.15
wasted
-0.14
Collapse
-0.14
ä¸ĸ
-0.14
autoload
-0.13
POSITIVE LOGITS
Reputation
0.25
Tay
0.22
Folk
0.21
aylor
0.21
Taylor
0.20
tay
0.20
Taylor
0.19
Swift
0.19
Shake
0.18
Swift
0.18
Activations Density 0.007%