INDEX
    Explanations

    the name "Ett" with varying activation values

    instances of the character string "tt" in various contexts

    New Auto-Interp
    Negative Logits
     dispers
    -0.68
     sweeping
    -0.67
     disperse
    -0.65
     infiltrated
    -0.62
     responsible
    -0.61
     disbanded
    -0.59
     solvent
    -0.59
     conference
    -0.58
     occupy
    -0.57
     circulating
    -0.57
    POSITIVE LOGITS
    tt
    4.43
    tta
    1.91
    tto
    1.90
    ttle
    1.89
    tti
    1.74
    TT
    1.58
    ttes
    1.56
    tty
    1.54
    tten
    1.43
    ts
    1.38
    Act Density 0.011%

    No Known Activations