INDEX
Explanations
phrases related to political and financial discussions
New Auto-Interp
Negative Logits
enthal
-0.68
LF
-0.62
OND
-0.57
SPA
-0.57
CLOSE
-0.56
oir
-0.56
(>
-0.55
wan
-0.55
(-
-0.55
Actor
-0.55
POSITIVE LOGITS
into
0.80
because
0.67
since
0.66
depending
0.66
than
0.65
when
0.64
inducing
0.64
according
0.63
bet
0.62
to
0.62
Activations Density 1.951%