INDEX
Explanations
the frequency of numerical values represented in the document
New Auto-Interp
Negative Logits
itſelf
-1.08
myſelf
-1.08
InjectAttribute
-1.07
rungsseite
-1.05
―――――
-1.04
poffible
-1.04
للاسماء
-1.01
pleaſure
-1.00
AssemblyVersion
-0.97
Monfieur
-0.96
POSITIVE LOGITS
0.67
[toxicity=0]
0.54
/
0.54
mo
0.52
0.52
or
0.52
(
0.50
atau
0.50
\&
0.50
Tw
0.49
Activations Density 0.533%