INDEX
    Explanations

    numbers at the end of sentences

    New Auto-Interp
    Negative Logits
     Schw
    -0.67
     yawn
    -0.65
    ienne
    -0.63
     bond
    -0.63
     chant
    -0.60
     Amen
    -0.60
     chants
    -0.60
     closer
    -0.59
     trough
    -0.59
     buzz
    -0.58
    POSITIVE LOGITS
    64
    3.46
    32
    2.18
    66
    2.03
    63
    1.98
    65
    1.97
    68
    1.91
    62
    1.78
    67
    1.72
    69
    1.72
    84
    1.72
    Act Density 0.020%

    No Known Activations