INDEX
Explanations
emojis and special characters that express emotions, money, and food
New Auto-Interp
Negative Logits
Ã
-0.54
"
-0.52
'
-0.49
L
-0.48
Unter
-0.47
Ans
-0.46
\"
-0.46
typelib
-0.43
áu
-0.43
â
-0.43
POSITIVE LOGITS
0.94
Cæsar
0.82
Савезне
0.81
bivolt
0.79
heureuse
0.76
itſelf
0.76
Jefus
0.76
Monfieur
0.76
Theſe
0.75
elemField
0.73
Activations Density 0.121%