INDEX
Explanations
punctuation and symbols indicating structure or separation in text
New Auto-Interp
Negative Logits
estekak
-0.89
CreateTagHelper
-0.83
majánló
-0.81
للمعارف
-0.80
otomatig
-0.78
وتسجيلات
-0.78
__':
-0.75
يميديا
-0.72
noDo
-0.71
ésultats
-0.71
POSITIVE LOGITS
0.70
0.66
0.66
}
0.61
}
0.59
0.58
0.56
0.54
0.54
0.53
Activations Density 0.103%