INDEX
Explanations
concepts related to elections and political accountability
New Auto-Interp
Negative Logits
y
-1.11
י
-1.05
ی
-0.98
er
-0.82
ised
-0.80
ième
-0.80
۰
-0.79
sweise
-0.73
o
-0.69
ه
-0.69
POSITIVE LOGITS
<bos>
1.12
papers
0.57
roms
0.56
vény
0.55
cedar
0.55
pośred
0.54
mens
0.53
peas
0.53
cob
0.52
хьтан
0.52
Activations Density 1.255%