INDEX
Explanations
words related to fantasy or fantastical elements
New Auto-Interp
Negative Logits
#af
-0.16
prü
-0.16
peer
-0.15
ingo
-0.15
flater
-0.15
itters
-0.15
ATUS
-0.15
ture
-0.15
eyer
-0.14
idable
-0.14
POSITIVE LOGITS
astically
0.29
asia
0.29
asy
0.26
asma
0.24
ast
0.23
ôme
0.23
ASY
0.21
asm
0.20
AST
0.20
asty
0.20
Activations Density 0.007%