INDEX
    Explanations

    punctuation and formatting elements in the text

    New Auto-Interp
    Negative Logits
     snippetHide
    -0.72
    BASEPATH
    -0.72
    ReusableCell
    -0.72
     Efq
    -0.69
    principalTable
    -0.69
     للمعارف
    -0.68
    клопе
    -0.67
     $_"
    -0.66
     "..\..\..\
    -0.65
    endgroup
    -0.65
    POSITIVE LOGITS
     it
    0.63
     there
    0.56
    <bos>
    0.53
    وحة
    0.51
    ↵↵
    0.50
     these
    0.48
     спро
    0.45
     dichos
    0.45
    )))
    0.45
     econó
    0.45
    Act Density 0.510%

    No Known Activations