INDEX
Explanations
mentions of media organizations and news-related terms
words and abbreviations relevant to media organizations and journalistic sources
New Auto-Interp
Negative Logits
)",
-0.61
Valve
-0.58
Magikarp
-0.55
franchise
-0.54
setting
-0.53
fame
-0.53
decomp
-0.53
dressing
-0.53
Borderlands
-0.52
colleg
-0.51
POSITIVE LOGITS
WATCHED
1.01
guiActiveUn
0.87
UNCLASSIFIED
0.77
NEWS
0.75
PHOTO
0.70
<|endoftext|>
0.70
UFF
0.70
Shutterstock
0.69
Photograph
0.68
SHARES
0.67
Activations Density 0.515%