INDEX
Explanations
mentions of specific people, sports teams, and events within news articles
New Auto-Interp
Negative Logits
ãģĻ
-0.65
$.
-0.63
thereof
-0.57
..."
-0.56
guiActiveUnfocused
-0.55
Redd
-0.53
ãĤ¤ãĥĪ
-0.52
Eva
-0.52
}.
-0.52
requisite
-0.52
POSITIVE LOGITS
meanwhile
1.10
reacted
0.88
responded
0.83
fared
0.82
declined
0.79
spokesman
0.75
reportedly
0.74
also
0.74
awoke
0.74
grew
0.74
Activations Density 0.887%