INDEX
    Explanations

    conditional phrases or clauses

    New Auto-Interp
    Negative Logits
     herself
    -0.58
    herself
    -0.53
    EndProject
    -0.52
     توانند
    -0.51
     pourront
    -0.49
    ConstraintMaker
    -0.48
    arily
    -0.48
    كويكب
    -0.47
     lenker
    -0.47
     Pleas
    -0.47
    POSITIVE LOGITS
     there
    1.16
     used
    0.96
     using
    0.87
     done
    0.85
     dealing
    0.84
     wanting
    0.81
     performed
    0.77
    there
    0.76
     trying
    0.73
     undertaken
    0.72
    Act Density 0.478%

    No Known Activations