INDEX
Explanations
HTML table-related tags and formatting
New Auto-Interp
Negative Logits
,
-0.65
and
-0.64
Y
-0.55
the
-0.53
y
-0.52
re
-0.52
a
-0.50
this
-0.50
بوابة
-0.48
Re
-0.48
POSITIVE LOGITS
),"
1.10
)."
1.07
)':
1.07
])),
1.05
)}}
1.04
]."
1.04
IsContent
1.03
).'
1.02
?")
1.01
"]),
1.01
Activations Density 0.077%