INDEX
Explanations
expressions of gratitude
New Auto-Interp
Negative Logits
يتيمه
-0.81
s
-0.76
prolly
-0.70
Camus
-0.69
hoga
-0.68
حياته
-0.68
حياتها
-0.67
juz
-0.67
Hues
-0.67
mers
-0.64
POSITIVE LOGITS
Thank
1.61
thank
1.60
Thank
1.52
thank
1.42
THANK
1.31
THANK
1.23
kyou
1.12
thanking
0.98
Thankyou
0.96
thanked
0.94
Activations Density 0.048%