INDEX
Explanations
ideas related to politics and governance
New Auto-Interp
Negative Logits
Miko
-0.82
ĪĴ
-0.78
²¾
-0.78
»Ĵ
-0.72
Huss
-0.70
xual
-0.68
vis
-0.67
vae
-0.66
proble
-0.63
xus
-0.63
POSITIVE LOGITS
owship
0.76
ercise
0.76
roam
0.76
resp
0.72
PLAY
0.72
ktop
0.72
expression
0.70
flo
0.69
Initialized
0.66
heit
0.66
Activations Density 0.081%