INDEX
    Explanations

    the word "are" in sentences

    the phrase "We are" used in various contexts

    New Auto-Interp
    Negative Logits
    rouse
    -0.76
     mater
    -0.68
    leck
    -0.65
    osate
    -0.64
    pedia
    -0.64
    Rank
    -0.64
    ossom
    -0.63
    rarily
    -0.63
     entails
    -0.63
    Shape
    -0.62
    POSITIVE LOGITS
     glad
    0.94
     hereby
    0.93
     supposed
    0.91
     obligated
    0.91
     thankful
    0.91
     gonna
    0.89
     fortunate
    0.89
     proud
    0.88
     not
    0.87
     aware
    0.87
    Act Density 0.143%

    No Known Activations