INDEX
    Explanations

    phrases indicating one example or part of several within a larger context

    instances of phrases emphasizing the notion of being just one among many, often in a context of comparison or exemplification

    New Auto-Interp
    Negative Logits
    same
    -0.72
    iano
    -0.65
    regn
    -0.65
    nob
    -0.63
    raped
    -0.62
    owers
    -0.60
     disbanded
    -0.60
    then
    -0.60
    sleep
    -0.58
    acha
    -0.58
    POSITIVE LOGITS
     examples
    0.99
     scratching
    0.98
     iceberg
    0.94
     sampling
    0.90
     anecdotal
    0.87
     sympt
    0.86
     example
    0.81
     sample
    0.79
     symptom
    0.79
     illustration
    0.78
    Act Density 0.115%

    No Known Activations