INDEX
    Explanations

    words related to fantasy or fantastical elements

    New Auto-Interp
    Negative Logits
    #af
    -0.16
    prü
    -0.16
    peer
    -0.15
    ingo
    -0.15
    flater
    -0.15
    itters
    -0.15
    ATUS
    -0.15
    ture
    -0.15
    eyer
    -0.14
    idable
    -0.14
    POSITIVE LOGITS
    astically
    0.29
    asia
    0.29
    asy
    0.26
    asma
    0.24
    ast
    0.23
    ôme
    0.23
    ASY
    0.21
    asm
    0.20
    AST
    0.20
    asty
    0.20
    Act Density 0.007%

    No Known Activations