INDEX
Explanations
patterns of underscores and asterisks, which may indicate placeholders or emphasis in the text formatting
New Auto-Interp
Negative Logits
tagHelperRunner
-0.74
GenerationType
-0.74
Monfieur
-0.73
foon
-0.71
Efq
-0.71
onCreateView
-0.70
UserScript
-0.70
الحره
-0.70
FormTagHelper
-0.69
Autoritní
-0.68
POSITIVE LOGITS
really
0.56
exactly
0.52
<i>
0.51
(!
0.49
‘
0.49
choix
0.46
even
0.45
раздо
0.45
actually
0.45
itself
0.45
Activations Density 0.122%