INDEX
    Explanations

    phrases indicating going through a challenging or exhaustive experience

    references to experiences of going through challenges or processes

    New Auto-Interp
    Negative Logits
    NPR
    -0.67
    iPhone
    -0.66
    nai
    -0.65
     Nurse
    -0.64
     Shine
    -0.64
    irlfriend
    -0.62
    POSE
    -0.61
    Percent
    -0.61
     AAP
    -0.61
    ufact
    -0.61
    POSITIVE LOGITS
     maze
    1.03
     labyrinth
    1.01
     hoops
    0.93
     motions
    0.91
     veins
    0.90
     ranks
    0.81
     stages
    0.81
     loops
    0.80
     hurdles
    0.80
     corridors
    0.78
    Act Density 0.276%

    No Known Activations