INDEX
Explanations
concepts related to governance and societal structures
New Auto-Interp
Negative Logits
/Core
-0.17
/not
-0.16
inton
-0.15
irus
-0.15
/we
-0.15
recur
-0.15
.mvp
-0.15
aller
-0.15
ALAR
-0.14
ness
-0.14
POSITIVE LOGITS
/legal
0.21
-cultural
0.19
ä¸ĬçļĦ
0.19
açı
0.19
-economic
0.19
/ge
0.17
/ec
0.17
ìłģ
0.16
/math
0.16
/pol
0.16
Activations Density 0.261%