INDEX
Explanations
instances of gratitude or expressions of thanks
New Auto-Interp
Negative Logits
ynn
-0.18
Jad
-0.17
елик
-0.16
ded
-0.16
addock
-0.15
coat
-0.15
Ded
-0.15
äre
-0.15
_factory
-0.15
Dün
-0.14
POSITIVE LOGITS
omik
0.15
нÑĥл
0.15
ono
0.14
HEME
0.14
enha
0.14
iyim
0.14
رÙģØª
0.14
(;;
0.14
amo
0.14
ãĤ¹ãĤ¿ãĥ¼
0.13
Activations Density 0.037%