INDEX
Explanations
terms related to political leadership and majorities
New Auto-Interp
Negative Logits
thouse
-0.16
oto
-0.15
ogl
-0.15
Manning
-0.15
olumn
-0.15
oulouse
-0.15
mann
-0.15
ÙİØ£
-0.15
ording
-0.15
sup
-0.14
POSITIVE LOGITS
azzi
0.16
idir
0.15
anytime
0.14
heck
0.14
Carp
0.14
_compiler
0.14
stract
0.14
èĻ
0.14
ires
0.14
ier
0.14
Activations Density 0.029%