INDEX
    Explanations

    features related to narrative storytelling or impactful character development

    New Auto-Interp
    Negative Logits
    crast
    -0.16
    uyu
    -0.15
    hd
    -0.15
    IX
    -0.15
    VERR
    -0.15
    OKIE
    -0.14
    кав
    -0.14
    oot
    -0.14
    arsers
    -0.13
    Vs
    -0.13
    POSITIVE LOGITS
     and
    0.17
    allel
    0.16
     coll
    0.16
    oret
    0.15
     progress
    0.15
     align
    0.15
     gag
    0.15
     
    0.14
     or
    0.14
    agu
    0.14
    Act Density 0.251%

    No Known Activations