INDEX
Explanations
elements related to sharing and communal experiences
New Auto-Interp
Negative Logits
ãģĬãĤĬ
-0.17
ape
-0.16
ritz
-0.15
evin
-0.15
sic
-0.15
iams
-0.14
sume
-0.14
lum
-0.14
egral
-0.14
yle
-0.14
POSITIVE LOGITS
cro
0.26
responsibility
0.23
custody
0.21
crop
0.18
openly
0.17
Responsibility
0.17
knowledge
0.17
/common
0.17
responsibilities
0.16
cust
0.16
Activations Density 0.059%