INDEX
Explanations
words indicating a sequence or list of items
New Auto-Interp
Negative Logits
fraid
-0.60
seits
-0.60
ahogy
-0.60
ítja
-0.58
groet
-0.56
ویکیپدیا
-0.55
örté
-0.55
掂
-0.55
recevez
-0.55
anus
-0.55
POSITIVE LOGITS
following
2.30
following
1.98
FOLLOWING
1.67
seguinte
1.66
suivante
1.60
siguiente
1.58
siguientes
1.56
Following
1.55
följande
1.55
Following
1.55
Activations Density 0.099%