INDEX
    Explanations

    interactions between characters and their actions in a narrative context

    New Auto-Interp
    Negative Logits
    asaki
    -0.16
     pari
    -0.16
    vary
    -0.15
    pez
    -0.15
    chet
    -0.15
    elon
    -0.14
    imson
    -0.14
    ostel
    -0.14
    acz
    -0.14
     Guerrero
    -0.14
    POSITIVE LOGITS
     Orc
    0.15
    anh
    0.15
     dou
    0.14
    icorn
    0.14
     Dou
    0.14
    OWL
    0.14
    celed
    0.14
    -aos
    0.14
    ldb
    0.14
    гл
    0.14
    Act Density 0.351%

    No Known Activations