INDEX
Explanations
news or information related to current events and politics
phrases related to consequential actions or events
New Auto-Interp
Negative Logits
iably
-0.84
oresc
-0.82
fortunately
-0.80
entimes
-0.80
omever
-0.80
roximately
-0.79
inarily
-0.78
atu
-0.78
ÙĴ
-0.77
ogether
-0.76
POSITIVE LOGITS
probe
1.00
feds
0.97
Kavanaugh
0.89
'
0.88
NYT
0.85
rift
0.84
pope
0.81
showdown
0.81
Latest
0.81
aftermath
0.81
Activations Density 0.520%