INDEX
Explanations
quantitative data and numerical representations
New Auto-Interp
Negative Logits
resourceCulture
-0.96
كومونز
-0.95
дописавши
-0.93
виправивши
-0.91
Chwiliwch
-0.91
للاسماء
-0.87
complexContent
-0.87
featureID
-0.86
Wikimedijinoj
-0.86
ProtoMessage
-0.85
POSITIVE LOGITS
_
0.51
__
0.47
PR
0.45
2
0.44
SY
0.43
main
0.41
1
0.41
↵
0.40
sw
0.40
0.38
Activations Density 0.075%