INDEX
Explanations
concepts related to altruism and community support
New Auto-Interp
Negative Logits
internetowa
-0.66
estekak
-0.61
تضيفلها
-0.61
copg
-0.58
legitimacy
-0.58
שוליים
-0.55
جغرافيا
-0.55
audiovisuel
-0.54
kháu
-0.54
Ārējās
-0.53
POSITIVE LOGITS
caring
0.71
solidarité
0.66
duty
0.62
rescuing
0.60
support
0.59
caring
0.59
charité
0.58
Pflicht
0.57
rescue
0.57
dévou
0.57
Activations Density 0.354%