INDEX
Explanations
expressions of admiration or appreciation for artwork or design
New Auto-Interp
Negative Logits
à¸ģà¸ķ
-0.15
Thomson
-0.14
Ľå»º
-0.14
playbook
-0.14
Jennings
-0.13
nfl
-0.13
putas
-0.13
game
-0.13
Buffett
-0.13
ðŁ
-0.13
POSITIVE LOGITS
XD
0.25
^^
0.24
xD
0.24
^
0.23
.^
0.23
XD
0.22
._
0.22
~↵
0.22
^.
0.21
~
0.20
Activations Density 0.079%