INDEX
Explanations
phrases related to political figures and official positions
content related to significant public figures and events
New Auto-Interp
Negative Logits
iating
-0.57
interven
-0.55
theirs
-0.54
reper
-0.49
foremost
-0.49
iates
-0.48
hoard
-0.48
ilty
-0.47
iated
-0.47
ownership
-0.47
POSITIVE LOGITS
WASHINGTON
1.02
Untitled
0.99
Abstract
0.98
Description
0.91
Welcome
0.88
CLOSE
0.88
CTV
0.87
SAN
0.85
Overview
0.84
SCP
0.82
Activations Density 0.131%