INDEX
    Explanations

    references to dialogues and conversations within the text

    New Auto-Interp
    Negative Logits
    :+:
    -0.80
    *{\
    -0.77
     Bezirks
    -0.76
     newOwner
    -0.74
    ########.
    -0.73
     XCTest
    -0.72
    #+#
    -0.72
     تضيفلها
    -0.72
    存于互联网档案馆
    -0.69
    لاثة
    -0.68
    POSITIVE LOGITS
     dialog
    1.94
     Dialog
    1.80
    dialog
    1.70
     DIALOG
    1.68
     MatDialog
    1.60
    DIALOG
    1.60
    Dialog
    1.55
     dialogue
    1.46
     dialogues
    1.36
     Dialogue
    1.36
    Act Density 0.041%

    No Known Activations