INDEX
Explanations
concepts related to family dynamics and interpersonal relationships
New Auto-Interp
Negative Logits
ÙĪÙĬÙĦ
-0.15
stoi
-0.15
receiving
-0.14
åıĹåΰ
-0.14
Receive
-0.14
osto
-0.14
blast
-0.14
esson
-0.14
rice
-0.14
cose
-0.14
POSITIVE LOGITS
contribution
0.19
contribute
0.18
cause
0.16
capable
0.16
æķĪ
0.15
contrib
0.15
Actions
0.15
contrib
0.15
<quote
0.15
EPS
0.15
Activations Density 0.038%