INDEX
Explanations
abbreviations or acronyms related to scientific or technical terms
New Auto-Interp
Negative Logits
تضيفلها
-0.85
ویکیپدیا
-0.84
AddTagHelper
-0.79
')):
-0.76
InputBorder
-0.76
)}-
-0.74
RIB
-0.74
']]
-0.73
"]:
-0.73
-0.73
POSITIVE LOGITS
saiba
0.65
popoli
0.65
nemico
0.62
nemici
0.59
nationaux
0.59
difesa
0.58
varandra
0.56
Züge
0.56
specchio
0.56
neutre
0.56
Activations Density 0.797%