INDEX
    Explanations

    positive adjectives followed by nouns

    expressions emphasizing the word "such" in various contexts

    New Auto-Interp
    Negative Logits
    ertodd
    -0.82
    kick
    -0.69
    ARM
    -0.69
    ntil
    -0.68
    TeX
    -0.68
     Murd
    -0.67
    ysc
    -0.64
    ·
    -0.62
     Drum
    -0.62
    iliate
    -0.62
    POSITIVE LOGITS
    ties
    0.72
    ities
    0.70
     abundantly
    0.65
     awful
    0.65
     consequential
    0.64
    thin
    0.62
     specificity
    0.61
    minded
    0.60
    vered
    0.60
     sums
    0.60
    Act Density 0.047%

    No Known Activations