INDEX
    Explanations

    references to the concept of "cause"

    New Auto-Interp
    Negative Logits
    tispiece
    -0.70
    <?=
    -0.69
    <?
    -0.65
     useStyles
    -0.64
    ViewBag
    -0.63
    енча
    -0.59
    }*/
    
    -0.59
    "><?=
    -0.59
    BeginContext
    -0.58
    ().__
    -0.56
    POSITIVE LOGITS
     cause
    1.73
    cause
    1.65
     Cause
    1.65
    Cause
    1.56
     CAUSE
    1.53
    CAUSE
    1.44
     cuz
    1.29
     causes
    1.23
     cos
    1.22
     causa
    1.12
    Act Density 0.130%

    No Known Activations