INDEX
    Explanations

    phrases indicating examples or samples of various items or concepts

    instances of the word "such" followed by explanations or examples

    New Auto-Interp
    Negative Logits
    ertodd
    -0.76
    zl
    -0.72
    olate
    -0.71
    itudinal
    -0.71
    ipedia
    -0.70
    rition
    -0.67
    oil
    -0.67
    hene
    -0.66
     Drum
    -0.66
    kick
    -0.66
    POSITIVE LOGITS
    ties
    0.78
     minded
    0.68
     consequential
    0.66
    cond
    0.66
    should
    0.62
     aggreg
    0.60
     abundantly
    0.60
    things
    0.60
    minded
    0.59
     constituted
    0.59
    Act Density 0.050%

    No Known Activations