INDEX
Explanations
words related to participation and participatory concepts
New Auto-Interp
Negative Logits
inged
-0.18
halt
-0.17
ingers
-0.16
ings
-0.15
hiba
-0.15
liž
-0.15
.yahoo
-0.14
inger
-0.14
INGER
-0.14
zeÅĪ
-0.14
POSITIVE LOGITS
ip
0.32
ipation
0.28
ip
0.27
IP
0.27
Ip
0.26
ipe
0.24
ipa
0.24
ipp
0.24
Ip
0.24
-ip
0.23
Activations Density 0.007%