INDEX
    Explanations

    mentions of characters or people in a text

    references to characters in narratives

    New Auto-Interp
    Negative Logits
    VERTIS
    -0.72
    rup
    -0.72
    ת
    -0.72
    yg
    -0.70
    ntil
    -0.67
    LOCK
    -0.66
    Effective
    -0.66
    condition
    -0.64
    galitarian
    -0.63
     Tant
    -0.63
    POSITIVE LOGITS
    acters
    1.52
    istically
    1.17
    istics
    1.07
     arcs
    0.86
    izations
    0.86
     characters
    0.83
     portraits
    0.82
     assassinate
    0.82
    isations
    0.78
     Characters
    0.78
    Act Density 0.046%

    No Known Activations