INDEX
    Explanations

    mentions of items or objects being described as "fancy"

    references to the word "fancy."

    New Auto-Interp
    Negative Logits
    etts
    -0.78
    upon
    -0.77
    sen
    -0.75
    scl
    -0.72
    essee
    -0.70
    arenthood
    -0.68
    iland
    -0.67
    ————————————————
    -0.66
    IRO
    -0.66
    amaru
    -0.65
    POSITIVE LOGITS
     fancy
    1.23
    pants
    0.95
     fanc
    0.77
     notions
    0.74
    tail
    0.73
     fries
    0.71
     nifty
    0.70
     dress
    0.70
    rous
    0.70
    vier
    0.68
    Act Density 0.014%

    No Known Activations