INDEX
    Explanations

    instances of the word "but" indicating contrast or contradiction

    New Auto-Interp
    Negative Logits
    bane
    -0.17
    nier
    -0.15
    agna
    -0.15
    EFF
    -0.14
    774
    -0.14
    éĶĭ
    -0.14
    åłĤ
    -0.14
    soft
    -0.13
    eso
    -0.13
    наÑĢ
    -0.13
    POSITIVE LOGITS
    $MESS
    0.15
    Untitled
    0.14
    Ñľ
    0.14
     Hollywood
    0.14
    cmpeq
    0.14
    ToOne
    0.14
    ckill
    0.14
    andr
    0.14
    ToFit
    0.14
    ForRow
    0.14
    Act Density 0.185%

    No Known Activations