INDEX
Explanations
expressions of commitment and participation in activities or groups
New Auto-Interp
Negative Logits
åīĽ
-0.16
vang
-0.14
stripe
-0.13
ock
-0.13
Else
-0.13
elsewhere
-0.13
nid
-0.12
ç¤
-0.12
ITHER
-0.12
ัย
-0.12
POSITIVE LOGITS
part
0.94
part
0.64
parte
0.61
ÑĩаÑģÑĤÑĮ
0.57
apart
0.52
_part
0.51
-part
0.50
Part
0.50
.part
0.50
PART
0.49
Activations Density 0.241%