INDEX
    Explanations

    actions and interactions between characters in the narrative

    New Auto-Interp
    Negative Logits
    raj
    -0.15
    ling
    -0.15
    itag
    -0.14
     ifdef
    -0.14
    ynom
    -0.14
     sentiment
    -0.13
    ynet
    -0.13
    ynch
    -0.13
    aina
    -0.13
    Callable
    -0.13
    POSITIVE LOGITS
    hetto
    0.17
    echa
    0.16
    egrator
    0.15
    avaÅŁ
    0.15
    .rs
    0.15
    ieri
    0.15
    ugins
    0.14
    kili
    0.14
    lsi
    0.14
    obby
    0.14
    Act Density 0.700%

    No Known Activations