INDEX
Explanations
phrases related to economic and political commentary, government actions, and societal issues
New Auto-Interp
Negative Logits
dimension
-0.59
ãĥĪ
-0.58
ãĥĻ
-0.57
\":
-0.55
å°Ĩ
-0.55
ewitness
-0.54
ãĥŀ
-0.52
ãģĹ
-0.51
itely
-0.50
pires
-0.49
POSITIVE LOGITS
;
1.12
whereas
1.01
because
1.01
.;
0.99
though
0.98
although
0.95
but
0.93
however
0.88
unless
0.83
.
0.82
Activations Density 1.433%