INDEX
Explanations
phrases related to politics and political figures
references to notable individuals and their achievements or contributions
New Auto-Interp
Negative Logits
.",
-0.59
Decay
-0.58
ravings
-0.54
resy
-0.54
<-
-0.52
izoph
-0.52
:,
-0.51
Prelude
-0.50
rex
-0.50
ubi
-0.50
POSITIVE LOGITS
})
0.75
)}
0.70
)—
0.68
)|
0.66
)]
0.60
)
0.59
)</
0.59
*)
0.59
interstitial
0.55
gram
0.55
Activations Density 2.456%