INDEX
Explanations
various stances and opinions regarding political and social issues, particularly those related to legislation
New Auto-Interp
Negative Logits
vow
-0.16
cek
-0.16
BOOLE
-0.15
adge
-0.14
ALCHEMY
-0.14
Reviewer
-0.13
má
-0.13
rana
-0.13
sein
-0.13
jadx
-0.13
POSITIVE LOGITS
idea
0.23
allowing
0.21
efforts
0.19
continued
0.19
expansion
0.19
proposals
0.19
extending
0.18
continuation
0.18
Idea
0.18
/op
0.18
Activations Density 0.251%