INDEX
Explanations
phrases related to political news or events
brackets and punctuation
New Auto-Interp
Negative Logits
gems
-0.67
fandom
-0.66
ties
-0.63
assemblies
-0.62
unaccompanied
-0.61
papers
-0.60
stood
-0.60
disciplines
-0.58
joints
-0.58
setting
-0.58
POSITIVE LOGITS
WATCHED
1.05
Continued
1.03
Expand
0.88
Photos
0.85
ccording
0.85
Read
0.83
Meanwhile
0.79
Related
0.78
Attempts
0.77
Advertisement
0.77
Activations Density 0.194%