INDEX
Explanations
information related to credit, acknowledgments, and mention
promotional content for products or events
New Auto-Interp
Negative Logits
intermedi
-0.77
indiscrim
-0.71
discredited
-0.71
milit
-0.70
unamb
-0.70
nonexistent
-0.70
llah
-0.69
sharply
-0.69
perceived
-0.68
exacerb
-0.68
POSITIVE LOGITS
Updates
1.08
Theme
1.06
Features
1.05
Releases
1.04
Credits
1.02
Announce
1.00
Reviews
0.99
Featured
0.98
Contents
0.98
Newsletter
0.97
Activations Density 0.739%