INDEX
Explanations
the presence of a specific initial marker or header in the document
Tokens preceding questions or requests for help
question or explanation
New Auto-Interp
Negative Logits
TagMode
-0.83
Administrativna
-0.69
FontOfSize
-0.67
igshid
-0.66
".
-0.66
تانيه
-0.65
Anſ
-0.65
OGND
-0.65
']>;
-0.65
Estatal
-0.64
POSITIVE LOGITS
.
0.69
_
0.59
my
0.58
et
0.54
[]
0.54
self
0.52
?
0.52
,
0.51
user
0.51
a
0.50
Activations Density 0.122%