INDEX
Explanations
language related to performance and entertainment, especially in magic or comedy contexts
New Auto-Interp
Negative Logits
ellt
-0.17
563
-0.16
710
-0.15
pit
-0.15
Erg
-0.14
κÏĮ
-0.14
çķ«
-0.14
idal
-0.14
forgiven
-0.14
æı®
-0.14
POSITIVE LOGITS
card
0.27
magic
0.25
magician
0.24
Magic
0.24
.magic
0.23
MAGIC
0.23
mag
0.22
_magic
0.21
Magic
0.21
magic
0.21
Activations Density 0.015%