INDEX
    Explanations

    the presence of specific formatting or structure markers in the text, such as beginning of sections or lists

    New Auto-Interp
    Negative Logits
     tartalomajánló
    -0.85
     виправивши
    -0.84
    berdayakan
    -0.77
    )");
    
    -0.76
    ValueStyle
    -0.74
    ]--;
    -0.73
     ―――――
    -0.72
    jgl
    -0.72
    хьтан
    -0.70
    ]++;
    -0.69
    POSITIVE LOGITS
    enumii
    0.71
     I
    0.48
    cupertino
    0.48
    thin
    0.48
    ...
    0.48
     direct
    0.47
    ish
    0.46
    AxisAlignment
    0.44
    Thin
    0.43
    itness
    0.43
    Act Density 0.004%

    No Known Activations