INDEX
Explanations
phrases related to political and government discussions
New Auto-Interp
Negative Logits
olves
-0.74
aukee
-0.64
SourceFile
-0.63
nexus
-0.63
igers
-0.61
eers
-0.60
zing
-0.59
è¦ļéĨĴ
-0.58
forest
-0.57
crop
-0.56
POSITIVE LOGITS
alas
1.06
unlike
0.99
although
0.98
contrary
0.97
despite
0.97
according
0.92
somew
0.89
barring
0.88
notwithstanding
0.86
interestingly
0.85
Activations Density 0.125%