INDEX
Explanations
emoji sequences often representing emotions or reactions
special characters and symbols, particularly those commonly used in social media contexts
New Auto-Interp
Negative Logits
Manit
-0.70
Gillespie
-0.70
Abyssal
-0.68
Sakuya
-0.67
Sidd
-0.67
Takeru
-0.66
wagen
-0.65
Reloaded
-0.64
Bain
-0.64
recorder
-0.63
POSITIVE LOGITS
¥ŀ
1.29
Ĵ
1.28
Ķ
1.25
ĵ
1.24
Į
1.21
ĻĤ
1.18
į
1.18
ĵĺ
1.16
İ
1.15
ķ
1.08
Activations Density 0.007%