INDEX
    Explanations

    references to emotional relationships and significant events in character interactions

    New Auto-Interp
    Negative Logits
    ValueStyle
    -0.81
    AndEndTag
    -0.72
    ंदीखरीदारी
    -0.68
     autorytatywna
    -0.61
     kasarigan
    -0.60
     estekak
    -0.58
    BibitemOpen
    -0.55
    Bored
    -0.52
     utilising
    -0.52
    :✨
    -0.52
    POSITIVE LOGITS
     accident
    0.38
     gossip
    0.37
     mist
    0.37
     plot
    0.36
     plotting
    0.33
     teleno
    0.33
    setcounter
    0.32
     dress
    0.32
     gossi
    0.31
     Beauchamp
    0.31
    Act Density 0.101%

    No Known Activations