INDEX
Explanations
phrases related to news stories covering the LGBT community globally
references to global events or topics related to the LGBT community
New Auto-Interp
Negative Logits
nowhere
-0.61
bottleneck
-0.59
earlier
-0.56
harsher
-0.55
dime
-0.53
worse
-0.53
bigger
-0.53
withdrawals
-0.53
later
-0.53
IPO
-0.52
POSITIVE LOGITS
ãĥĦ
0.70
¥
0.70
rave
0.69
eem
0.69
¯¯¯¯¯¯¯¯
0.68
oise
0.67
ractor
0.66
assador
0.65
lete
0.64
ï
0.64
Activations Density 0.590%