INDEX
Explanations
words related to ordering or sequencing of processes, sometimes with an element of causality
instructions/procedure
New Auto-Interp
Negative Logits
juſt
-0.80
purpoſe
-0.79
becauſe
-0.76
sauvages
-0.74
Abonnez
-0.73
reaſon
-0.73
privées
-0.73
humaines
-0.73
himſelf
-0.73
vectorielles
-0.73
POSITIVE LOGITS
then
0.94
Then
0.88
THEN
0.83
Then
0.81
THEN
0.75
puis
0.68
ثم
0.66
then
0.63
ثم
0.63
Puis
0.62
Activations Density 1.136%