INDEX
Explanations
mentions of historical events or occurrences that happened in the past
phrases that indicate new or significant entities
New Auto-Interp
Negative Logits
imar
-0.75
Favorite
-0.75
Arcade
-0.73
ãĥĬ
-0.72
inently
-0.72
anism
-0.71
views
-0.71
AIDS
-0.71
frames
-0.70
Avoid
-0.70
POSITIVE LOGITS
colleague
1.06
handful
1.05
gunman
0.97
reporter
0.97
group
0.96
delegation
0.96
majority
0.95
consortium
0.95
spate
0.94
flurry
0.93
Activations Density 0.177%