INDEX
    Explanations

    statements evaluating the truthfulness of conditions or expressions, particularly those involving boolean values

    New Auto-Interp
    Negative Logits
     AssemblyProduct
    -0.89
    __':
    
    -0.83
    }`).
    -0.82
     Efq
    -0.81
    __":
    
    -0.80
    ]`
    -0.79
     Theſe
    -0.77
    }>;
    -0.77
     متعلقه
    -0.77
     ་་
    -0.76
    POSITIVE LOGITS
     Yes
    0.65
     True
    0.61
     for
    0.59
    0.57
    walde
    0.56
    SetBool
    0.56
    0.54
    val
    0.54
     true
    0.54
    Yes
    0.53
    Act Density 0.396%

    No Known Activations