INDEX
    Explanations

    tendency or ability

    New Auto-Interp
    Negative Logits
     ability
    -1.53
     tendency
    -1.15
     Ability
    -0.98
     abilities
    -0.96
    Ability
    -0.93
    Rohy
    -0.90
    ability
    -0.89
     willingness
    -0.88
     propOrder
    -0.88
     kaarangay
    -0.87
    POSITIVE LOGITS
    a
    0.70
    e
    0.65
     for
    0.56
     in
    0.53
     seen
    0.52
     as
    0.49
    s
    0.49
     textStatus
    0.48
     named
    0.47
     on
    0.47
    Act Density 0.088%

    No Known Activations