INDEX
    Explanations

    text that discusses personal experiences and narratives

    New Auto-Interp
    Negative Logits
    eldom
    -0.07
    arg
    -0.07
     Hans
    -0.06
    asu
    -0.06
    çĶ
    -0.06
     GDK
    -0.06
    args
    -0.06
    AREN
    -0.06
    .impl
    -0.06
    idot
    -0.06
    POSITIVE LOGITS
     interviewer
    0.09
     interview
    0.09
     describe
    0.08
     describes
    0.08
     entrev
    0.07
     Interview
    0.07
     remin
    0.07
    è«ĩ
    0.07
     descriptions
    0.07
    Interview
    0.07
    Act Density 0.009%

    No Known Activations