INDEX
    Explanations

    phrases about common human experiences or traits with a focus on uniqueness

    phrases indicating possession or existence related to individuals

    New Auto-Interp
    Negative Logits
    inth
    -0.82
    Newsletter
    -0.70
    ound
    -0.69
    etting
    -0.69
    live
    -0.68
    edia
    -0.68
    QUI
    -0.68
    ãĤº
    -0.66
    sole
    -0.66
    cise
    -0.65
    POSITIVE LOGITS
     flaws
    0.97
     biases
    0.88
     quirks
    0.87
     weaknesses
    0.84
     faults
    0.83
     differing
    0.83
     varying
    0.83
     precon
    0.82
     strengths
    0.82
     undergone
    0.78
    Act Density 0.183%

    No Known Activations