INDEX
Explanations
mathematical symbols and expressions
New Auto-Interp
Negative Logits
ợn
-0.67
geslacht
-0.61
[…]
-0.60
-0.59
-0.58
[...]
-0.58
تفصیلات
-0.58
###
-0.58
maisha
-0.56
###
-0.56
POSITIVE LOGITS
Ί
0.53
tanleria
0.51
IVI
0.50
ıy
0.49
TagHelper
0.48
JJ
0.48
¢
0.47
نیم
0.46
IImage
0.45
ΐ
0.45
Activations Density 0.018%