INDEX
Explanations
care followed by giving or receiving
New Auto-Interp
Negative Logits
الحياة
0.50
டிய
0.43
ス
0.42
lymphatiques
0.42
ក្ល
0.41
সুখ
0.41
등학교
0.40
জৈ
0.40
كلة
0.40
ுள்ளனர்
0.40
POSITIVE LOGITS
giver
0.80
Care
0.80
care
0.78
Care
0.75
giving
0.70
care
0.69
taker
0.66
en
0.63
taker
0.58
CARE
0.57
Activations Density 0.011%