INDEX
Explanations
phrases related to political hypocrisy and criticism
New Auto-Interp
Negative Logits
efined
-0.16
Progressive
-0.15
ongs
-0.14
uli
-0.14
omencl
-0.14
progressive
-0.14
AGO
-0.14
uguay
-0.13
Ãľ
-0.13
Scrolls
-0.13
POSITIVE LOGITS
catid
0.16
fflush
0.15
YYS
0.15
encount
0.15
licos
0.14
BITTE
0.14
Farrell
0.14
.palette
0.14
iphy
0.14
kontakte
0.13
Activations Density 0.001%