INDEX
    Explanations

    references to educational or instructional contexts

    Followed by pronouns

    distinguished, formalism, differentiation

    New Auto-Interp
    Negative Logits
    InitVars
    -0.57
    klärt
    -0.55
    exitRule
    -0.52
    exao
    -0.48
    urantes
    -0.48
     afin
    -0.47
    TINGS
    -0.47
    fortable
    -0.46
    IZACIÓN
    -0.45
    WritableDatabase
    -0.45
    POSITIVE LOGITS
     it
    1.42
     оно
    1.21
     this
    1.04
    1.01
     это
    1.00
     она
    0.98
     they
    0.91
     nó
    0.90
     этот
    0.90
    它是
    0.89
    Act Density 0.532%

    No Known Activations