INDEX
Explanations
neurons related to programming and technical language
Mathematical or unusual symbols
technical terms and punctuation
New Auto-Interp
Negative Logits
</em>
-0.49
—
-0.49
يا
-0.48
=
-0.47
だけに
-0.47
morti
-0.46
#
-0.46
JI
-0.46
@
-0.45
.
-0.45
POSITIVE LOGITS
TagMode
0.92
abestanden
0.91
Hochspringen
0.88
存于互联网档案馆
0.87
للمعارف
0.82
ViewImports
0.82
AutoresizingMask
0.80
ویکیپدیای
0.79
سكانية
0.78
Roskov
0.78
Activations Density 0.024%