INDEX
Explanations
phrases relating to social or political movements or campaigns
references to personal or collective identity and advocacy
New Auto-Interp
Negative Logits
>[
-0.74
veins
-0.72
peculiar
-0.71
apologies
-0.67
uitous
-0.67
folklore
-0.64
umerable
-0.64
chem
-0.63
ĸļ
-0.63
olver
-0.63
POSITIVE LOGITS
Consent
0.91
Yourself
0.91
Own
0.89
Tomorrow
0.89
Funding
0.88
Slowly
0.86
Loud
0.82
Terrorism
0.82
ebook
0.82
Faster
0.81
Activations Density 0.318%