INDEX
Explanations
instances of community engagement and responses to local issues
New Auto-Interp
Negative Logits
æĺ¯ä¸Ģ个
-0.22
æīĢæľī
-0.21
çļĦä¸Ģ个
-0.20
ä¸ī个
-0.16
모ëĵł
-0.16
æĺ¯ä¸ª
-0.16
itself
-0.16
ä»»ä½ķ
-0.15
everybody
-0.15
má»įi
-0.15
POSITIVE LOGITS
either
0.46
either
0.37
Either
0.36
themselves
0.35
Either
0.33
EITHER
0.32
либо
0.29
or
0.28
various
0.26
varying
0.25
Activations Density 1.191%