INDEX
    Explanations

    phrases indicative of interpersonal relationships and dialogue

    pronouns after punctuation

    New Auto-Interp
    Negative Logits
     которое
    -0.60
     яке
    -0.53
    urtstag
    -0.51
    trone
    -0.50
    YOND
    -0.49
     hivyo
    -0.49
    PONENTS
    -0.49
    いくつか
    -0.49
     thừa
    -0.48
    것은
    -0.47
    POSITIVE LOGITS
     who
    1.02
     whom
    0.99
    此人
    0.95
    他是
    0.93
    whom
    0.91
    who
    0.90
     he
    0.89
     she
    0.88
     shes
    0.87
     him
    0.84
    Act Density 0.356%

    No Known Activations