INDEX
    Explanations

    instances of conditional or prohibitive statements

    New Auto-Interp
    Negative Logits
    */
    -0.70
    س
    -0.63
     srfAttach
    -0.62
     thereof
    -0.62
    itud
    -0.60
     attRot
    -0.59
    ibles
    -0.58
    âĿ
    -0.58
    "}],"
    -0.57
    ÙIJ
    -0.57
    POSITIVE LOGITS
    cknowled
    0.88
     Started
    0.81
     Own
    0.75
    itialized
    0.71
    dating
    0.70
    nea
    0.65
    gdala
    0.63
    iltr
    0.62
     Tradable
    0.61
     Fired
    0.61
    Act Density 0.143%

    No Known Activations