INDEX
Explanations
expressions of gratitude and emotional connections
New Auto-Interp
Negative Logits
_AA
-0.14
incy
-0.14
.idea
-0.14
Tubes
-0.14
طة
-0.13
AGMA
-0.13
á»ĭch
-0.13
aden
-0.13
Ø·ÙĦب
-0.13
YPE
-0.12
POSITIVE LOGITS
heart
0.95
hearts
0.88
Heart
0.76
heart
0.76
-heart
0.74
Heart
0.71
Hearts
0.69
å¿ĥ
0.61
coraz
0.59
ÑģеÑĢд
0.59
Activations Density 0.176%