INDEX
    Explanations

    phrases indicating tasks or actions that need to be completed or demonstrated

    New Auto-Interp
    Negative Logits
    Билгалдахарш
    -0.87
    AndEndTag
    -0.80
    ftagPool
    -0.73
    rrggbb
    -0.72
    Espèce
    -0.72
    ConstraintMaker
    -0.72
     ModelExpression
    -0.70
    complexContent
    -0.67
    таратура
    -0.65
     noDo
    -0.63
    POSITIVE LOGITS
     Remaining
    0.54
    Remaining
    0.52
     remaining
    0.51
     pozosta
    0.47
    今度は
    0.46
    remaining
    0.45
     Âu
    0.45
    yaf
    0.45
    あとは
    0.45
     next
    0.44
    Act Density 0.333%

    No Known Activations