INDEX
Explanations
phrases related to various current events and news topics
phrases related to political events and their implications
New Auto-Interp
Negative Logits
.]
-0.75
.).
-0.74
'.
-0.71
'."
-0.70
'.
-0.68
.'"
-0.66
".
-0.65
].
-0.64
!".
-0.64
!.
-0.64
POSITIVE LOGITS
âĢ
1.79
âĢ
1.46
âĢł
1.34
ãĢ
1.29
â
1.13
âľ
1.08
âĹ
1.07
*,
1.06
âĶ
1.05
âĶ
1.05
Activations Density 1.272%