INDEX
Explanations
expressions highlighting appreciation and recognition of caregivers
New Auto-Interp
Negative Logits
â
-0.42
â
-0.35
ÃIJ
-0.34
Â
-0.30
Ãİ
-0.29
ÃIJ
-0.29
ÂĢ
-0.27
Âħ
-0.27
Âĸ
-0.26
Ãİ
-0.24
POSITIVE LOGITS
0.87
0.84
0.73
0.67
↵↵
0.67
0.52

0.36
âĢĮ
0.32

0.31
$$
0.30
Activations Density 0.121%