INDEX
Explanations
HTML table header elements (th)
New Auto-Interp
Negative Logits
Theſe
-0.98
ویکیپدیای
-0.97
ertale
-0.91
таратура
-0.86
✨:
-0.86
WriteTagHelper
-0.82
tagHelperRunner
-0.81
utafitiHapana
-0.81
AndEndTag
-0.76
виправивши
-0.75
POSITIVE LOGITS
th
1.77
TH
1.05
th
0.90
Th
0.89
ths
0.79
thu
0.76
thi
0.71
Th
0.69
thun
0.66
thy
0.62
Activations Density 0.020%