INDEX
Explanations
the start of a document or a significant section marker
New Auto-Interp
Negative Logits
rrggbb
-0.92
sidemargin
-0.89
__':
-0.87
__":
-0.86
FormTagHelper
-0.85
__":
-0.83
__':
-0.83
كومونز
-0.81
Waray
-0.80
Himo
-0.79
POSITIVE LOGITS
——
0.51
ull
0.51
put
0.50
put
0.48
ضان
0.46
,
0.45
losa
0.45
~
0.44
……
0.44
众
0.44
Activations Density 0.026%