INDEX
    Explanations

    references to interpersonal relationships and interactions between characters

    New Auto-Interp
    Negative Logits
     gleichen
    -0.37
    pośred
    -0.36
     naselje
    -0.36
     takiej
    -0.34
    今回は
    -0.33
    Portale
    -0.33
     takiego
    -0.32
    ,
    -0.31
    此事
    -0.31
     similar
    -0.30
    POSITIVE LOGITS
     Majefty
    0.77
    randomUUID
    0.74
    ſelf
    0.72
     houſe
    0.72
     pleaſure
    0.69
     typelib
    0.61
    weakSelf
    0.60
     deſſen
    0.60
     חיצוניים
    0.59
     盗撮
    0.59
    Act Density 0.536%

    No Known Activations