INDEX
Explanations
sentences that contain significant legal or political commentary
New Auto-Interp
Negative Logits
rell
-0.20
мов
-0.15
/Dk
-0.14
Heller
-0.14
stants
-0.14
缤
-0.14
UTH
-0.14
cond
-0.13
eth
-0.13
cente
-0.13
POSITIVE LOGITS
odyn
0.15
kim
0.15
kı
0.14
IonicModule
0.14
ãĤ¸
0.14
zug
0.14
wij
0.14
iba
0.14
cj
0.13
imiento
0.13
Activations Density 0.518%