INDEX
    Explanations

    terms related to physical and mental well-being

    terms related to health, wellness, and structure

    New Auto-Interp
    Negative Logits
     Converted
    -0.64
     typo
    -0.63
     numerical
    -0.63
     legal
    -0.61
     Happ
    -0.59
     Flavoring
    -0.59
     stray
    -0.59
     outgoing
    -0.58
     alcoholic
    -0.58
     keyword
    -0.58
    POSITIVE LOGITS
    sheets
    1.09
    ings
    1.08
    pieces
    1.07
    otype
    1.05
    aments
    1.04
    piece
    1.03
    ologies
    1.02
    coat
    1.01
    ages
    1.01
    ups
    1.00
    Act Density 0.769%

    No Known Activations