INDEX
Explanations
questions indicated by the presence of question marks
ending punctuation of questions
New Auto-Interp
Negative Logits
nahilalakip
-0.52
يكب
-0.49
تضيفلها
-0.47
iastical
-0.46
GOTREF
-0.44
AnchorStyles
-0.44
MathML
-0.43
❱
-0.43
begin
-0.42
محفوظة
-0.42
POSITIVE LOGITS
?
1.09
%?
1.07
?
0.96
?"
0.94
…?
0.94
$?
0.94
!?
0.93
?”
0.91
...?
0.90
="?
0.88
Activations Density 0.054%