INDEX
Explanations
expressions of empathy and kindness towards children and families
New Auto-Interp
Negative Logits
ute
-0.15
UTE
-0.15
ocz
-0.15
ů
-0.14
olini
-0.14
adlo
-0.14
rey
-0.14
reste
-0.14
auga
-0.14
marginal
-0.13
POSITIVE LOGITS
aven
0.18
piler
0.15
immer
0.15
chio
0.15
767
0.14
odge
0.14
508
0.14
Enumeration
0.14
abile
0.14
Cav
0.14
Activations Density 0.063%