INDEX
    Explanations

    references to the word "fluff"

    New Auto-Interp
    Negative Logits
    CHR
    -0.76
    chrome
    -0.65
    Hunt
    -0.65
     lapse
    -0.64
    xual
    -0.64
     Cambod
    -0.62
    metic
    -0.62
     erg
    -0.61
     freeze
    -0.61
    ³³³³³³³³³³³³³³³³
    -0.60
    POSITIVE LOGITS
    ington
    1.13
    ield
    1.04
    IELD
    0.99
    iculty
    0.91
    uff
    0.89
    aneous
    0.84
    alo
    0.82
    ixel
    0.82
    ieri
    0.82
    ucket
    0.81
    Act Density 0.015%

    No Known Activations