INDEX
Explanations
occurrences of nested HTML div elements
New Auto-Interp
Negative Logits
şört
-0.80
queſta
-0.77
windowFixed
-0.77
adaptiveStyles
-0.73
+#+
-0.70
yntaxException
-0.69
Superan
-0.68
ویکیپدی
-0.68
Numerade
-0.67
kasarigan
-0.67
POSITIVE LOGITS
div
0.56
0.52
the
0.51
The
0.43
0.40
0.40
2
0.39
ta
0.39
a
0.39
a
0.39
Activations Density 0.002%