INDEX
Explanations
instances where an article or piece of text is complete
strong indicators of urgency or significant events
New Auto-Interp
Negative Logits
tradem
-0.79
amnesty
-0.75
reper
-0.74
thodox
-0.74
undai
-0.74
exting
-0.73
eleph
-0.70
ilty
-0.70
acquies
-0.69
destro
-0.69
POSITIVE LOGITS
CLOSE
1.15
WASHINGTON
1.07
Trivia
1.06
Abstract
1.05
Still
1.05
Description
1.04
Overview
1.03
Story
1.02
Untitled
1.01
Breaking
1.01
Activations Density 0.116%