INDEX
Explanations
formal beginnings and conclusions
New Auto-Interp
Negative Logits
ребят
-1.08
isn
-1.07
different
-1.06
guys
-1.03
kids
-1.02
บ
-1.02
really
-1.01
inside
-1.01
DIFFERENT
-0.98
ㄡ
-0.97
POSITIVE LOGITS
shall
1.44
forthwith
1.22
heretofore
1.21
henceforth
1.20
glorious
1.18
sehr
1.16
весьма
1.16
ибо
1.14
hitherto
1.13
swiftly
1.09
Activations Density 0.358%