INDEX
    Explanations

    phrases containing the expression "kind of"

    phrases that convey a sense of qualification or similarity

    New Auto-Interp
    Negative Logits
    ared
    -0.78
    vale
    -0.71
    ovember
    -0.70
     Yard
    -0.70
    heid
    -0.69
    listed
    -0.67
    arie
    -0.66
    ajor
    -0.66
    DS
    -0.66
    brush
    -0.65
    POSITIVE LOGITS
     luck
    0.86
     weird
    0.75
     nonsense
    0.70
     things
    0.69
     relig
    0.65
     humility
    0.65
     stuff
    0.65
     incomprehensible
    0.64
     fun
    0.62
     thing
    0.62
    Act Density 0.150%

    No Known Activations