INDEX
    Explanations

    frequent mentions of the word "lots"

    New Auto-Interp
    Negative Logits
     cref
    -0.60
     fap
    -0.59
     patties
    -0.54
    io
    -0.50
    Jah
    -0.49
     tout
    -0.49
    inchilla
    -0.49
    ք
    -0.49
    theit
    -0.48
     út
    -0.48
    POSITIVE LOGITS
     lots
    1.47
    Lots
    1.44
     Lots
    1.40
     LOTS
    1.16
    lots
    1.12
     myſelf
    0.78
    ConstraintMaker
    0.78
    BufferException
    0.78
     loads
    0.76
    ^(@)
    0.75
    Act Density 0.035%

    No Known Activations