INDEX
    Explanations

    expressions of laziness and unwillingness to take action

    New Auto-Interp
    Negative Logits
    Ifc
    -0.53
    SBATCH
    -0.52
    +#+#
    -0.46
     kasarigan
    -0.45
    WithIOException
    -0.44
    __);
    -0.44
    ICOLON
    -0.43
     يتيمه
    -0.43
    Personensuche
    -0.42
     AppColors
    -0.42
    POSITIVE LOGITS
     reluctance
    0.49
     lazy
    0.47
    lazy
    0.46
     reluctant
    0.46
    0.45
    Lazy
    0.44
     Lazy
    0.44
     unwilling
    0.42
    relu
    0.42
     unwillingness
    0.41
    Act Density 0.107%

    No Known Activations