INDEX
    Explanations

    dialogue between characters

    expressions of dialogue and character interactions

    New Auto-Interp
    Negative Logits
    uably
    -0.70
     "#
    -0.64
     batters
    -0.63
     WATCHED
    -0.62
     NYT
    -0.61
    ardi
    -0.61
    NBA
    -0.59
    NFL
    -0.59
    CNN
    -0.59
    endi
    -0.59
    POSITIVE LOGITS
    -"
    1.61
    â̦"
    1.49
    ..."
    1.41
    —"
    1.35
     â̦"
    1.24
    â̦."
    1.22
    !?"
    1.16
    ãĢį
    1.12
    ?"
    1.12
    !"
    1.09
    Act Density 0.559%

    No Known Activations