INDEX
    Explanations

    the word "normal" or variations of it

    references to the concept of "normal."

    New Auto-Interp
    Negative Logits
    iosyncr
    -0.77
    hani
    -0.76
    haw
    -0.72
    Lens
    -0.70
    hop
    -0.70
    NRS
    -0.69
    Sov
    -0.69
    Spot
    -0.68
    artisan
    -0.68
    otle
    -0.64
    POSITIVE LOGITS
    ization
    1.18
    isation
    1.18
    cy
    1.16
    izes
    1.16
    ised
    1.12
    izing
    1.09
    ises
    1.05
    ize
    1.05
    izers
    1.01
    ized
    0.99
    Act Density 0.024%

    No Known Activations