INDEX
    Explanations

    phrases related to personal accounts or stories

    references to personal experiences

    New Auto-Interp
    Negative Logits
    inately
    -0.72
    nda
    -0.70
    efficiency
    -0.68
    leaf
    -0.68
    laws
    -0.67
     nod
    -0.67
    vous
    -0.66
    pillar
    -0.65
    corn
    -0.65
    gem
    -0.64
    POSITIVE LOGITS
     experiences
    0.99
     experien
    0.95
     firsthand
    0.95
     Exper
    0.87
     experience
    0.86
     Experience
    0.82
    ually
    0.82
    iences
    0.80
    ional
    0.75
    Shape
    0.72
    Act Density 0.036%

    No Known Activations