INDEX
Explanations
phrases related to participation and involvement in events
New Auto-Interp
Negative Logits
اسÙĬ
-0.17
loh
-0.15
Choice
-0.15
apon
-0.15
ukan
-0.14
504
-0.14
eries
-0.14
pagen
-0.14
ellen
-0.14
orie
-0.13
POSITIVE LOGITS
resh
0.16
dl
0.15
ÃĿ
0.14
Ø·ÙĨ
0.14
help
0.14
utsch
0.14
stood
0.14
cret
0.14
ients
0.13
ingroup
0.13
Activations Density 0.087%