INDEX
Explanations
references to government structure and political systems
New Auto-Interp
Negative Logits
çĤİ
-0.16
lander
-0.16
uyu
-0.15
osite
-0.14
åħ¹
-0.14
ÄĽn
-0.14
↵↵
-0.14
.Operator
-0.13
еÑĤа
-0.13
Äįel
-0.13
POSITIVE LOGITS
quiz
0.17
auses
0.16
Quiz
0.16
UNIT
0.16
bserv
0.15
pol
0.14
ãĤīãģĽ
0.14
bir
0.14
bab
0.14
Unit
0.13
Activations Density 0.174%