INDEX
Explanations
instances of empathy and charitable actions toward children in need
New Auto-Interp
Negative Logits
еко
-0.16
ien
-0.15
Caucus
-0.15
icios
-0.15
Culture
-0.14
elier
-0.14
mando
-0.14
olum
-0.14
venta
-0.14
odu
-0.14
POSITIVE LOGITS
Gran
0.17
commitment
0.15
alsa
0.15
>[]
0.15
kar
0.15
adopted
0.15
лаÑĪ
0.15
direct
0.14
ulls
0.14
ulk
0.14
Activations Density 0.194%