INDEX
Explanations
references to mythological figures or deities
New Auto-Interp
Negative Logits
AssemblyTitle
-1.24
sidemargin
-0.95
متعلقه
-0.93
transfieras
-0.93
tagHelperRunner
-0.90
NUKAT
-0.90
himo
-0.90
+#+#
-0.90
Roskov
-0.90
الإنجليزية
-0.89
POSITIVE LOGITS
0.56
I
0.52
i
0.49
U
0.48
.
0.46
-
0.45
P
0.43
first
0.42
"
0.42
<eos>
0.41
Activations Density 0.170%