INDEX
Explanations
the presence of specific formatting or structure markers in the text, such as beginning of sections or lists
New Auto-Interp
Negative Logits
tartalomajánló
-0.85
виправивши
-0.84
berdayakan
-0.77
)");
-0.76
ValueStyle
-0.74
]--;
-0.73
―――――
-0.72
jgl
-0.72
хьтан
-0.70
]++;
-0.69
POSITIVE LOGITS
enumii
0.71
I
0.48
cupertino
0.48
thin
0.48
...
0.48
direct
0.47
ish
0.46
AxisAlignment
0.44
Thin
0.43
itness
0.43
Activations Density 0.004%