INDEX
    Explanations

    careful actions or behaviors

    instances of the word "careful."

    New Auto-Interp
    Negative Logits
    anon
    -0.77
    Phones
    -0.71
    Wars
    -0.71
    Pear
    -0.71
    NZ
    -0.70
    flat
    -0.70
    ifles
    -0.68
    AF
    -0.67
    CBC
    -0.67
     Ethiopia
    -0.67
    POSITIVE LOGITS
     careful
    1.08
     scrutiny
    0.96
    autions
    0.88
     cautious
    0.84
     precautions
    0.82
     calibr
    0.80
     heed
    0.79
     attentive
    0.79
     precaution
    0.78
     stewards
    0.77
    Act Density 0.006%

    No Known Activations