INDEX
Explanations
references to local governance or community-related topics
New Auto-Interp
Negative Logits
гÑĢа
-0.17
imb
-0.15
å¹ķ
-0.15
ogh
-0.15
iest
-0.14
/todo
-0.14
backs
-0.13
Ùĥس
-0.13
ier
-0.13
è¢ĸ
-0.13
POSITIVE LOGITS
ised
0.31
vore
0.26
/global
0.26
-global
0.25
/local
0.24
isation
0.23
izing
0.23
izable
0.22
ities
0.22
ized
0.21
Activations Density 0.034%