INDEX
    Explanations

    words related to encouragement and support

    New Auto-Interp
    Negative Logits
    zio
    -0.73
    id
    -0.72
    n
    -0.71
    ("")]
    
    -0.68
    as
    -0.67
    t
    -0.67
    lan
    -0.65
    i
    -0.64
     half
    -0.64
    fi
    -0.63
    POSITIVE LOGITS
     encouraged
    1.53
     encourages
    1.46
    couraged
    1.45
     Encourage
    1.43
    Encourage
    1.41
     encourage
    1.40
     encouragement
    1.36
     encor
    1.27
    encouragement
    1.25
     encourag
    1.23
    Act Density 0.135%

    No Known Activations