INDEX
    Explanations

    instances of conflict or tension between characters

    New Auto-Interp
    Negative Logits
     ujednoznacz
    -0.73
    حياته
    -0.68
     références
    -0.67
     itſelf
    -0.67
    下载附件
    -0.67
     postIndex
    -0.65
    OOTDTY
    -0.65
    Cordialement
    -0.64
     autorité
    -0.64
    حياتها
    -0.62
    POSITIVE LOGITS
    tagext
    0.46
     timp
    0.46
     дописавши
    0.45
    何を
    0.44
     night
    0.42
     tadi
    0.42
     hired
    0.42
     she
    0.41
     time
    0.41
    ↵↵
    0.40
    Act Density 0.101%

    No Known Activations