INDEX
Explanations
words related to news stories and current events, possibly focused on politics, sports, and health issues
New Auto-Interp
Negative Logits
ovember
-0.57
eco
-0.57
yang
-0.57
urized
-0.56
Footnote
-0.54
wild
-0.53
tracing
-0.53
storm
-0.53
sucker
-0.53
harmon
-0.53
POSITIVE LOGITS
cgi
0.78
interstitial
0.76
subscribing
0.63
php
0.63
âĿ
0.62
unctions
0.61
à¤
0.60
oa
0.59
Mu
0.59
iframe
0.57
Activations Density 0.022%