INDEX
Explanations
relationships and dynamics between individuals and their social contexts
New Auto-Interp
Negative Logits
à¹Ģà¸Ĺ
-0.14
.$$
-0.13
pinch
-0.13
apon
-0.13
phin
-0.13
');?>
-0.12
aki
-0.12
opian
-0.12
iram
-0.12
ons
-0.12
POSITIVE LOGITS
ï¼īãģ¯
0.21
"is
0.19
人ãģ¯
0.17
åŃIJãģ¯
0.16
)ëĬĶ
0.16
)
0.15
)ìĿĢ
0.14
ê²ĥìĿĢ
0.14
“
0.14
obl
0.14
Activations Density 1.509%