INDEX
Explanations
expressions of excitement or enthusiasm
New Auto-Interp
Negative Logits
=
-0.62
â
-0.49
Â
-0.45
Awesome
-0.42
ベーション
-0.41
awesome
-0.40
būs
-0.40
Â
-0.40
Chú
-0.38
geografic
-0.38
POSITIVE LOGITS
🥺
0.84
,,,
0.81
abt
0.80
,,,,
0.77
✨
0.74
bc
0.71
🥰
0.70
,,
0.70
👀
0.69
😔
0.67
Activations Density 0.149%