INDEX
Explanations
phrases referring to the concept of balance or position among various elements
New Auto-Interp
Negative Logits
<boost
-0.15
thora
-0.14
ToDevice
-0.14
edm
-0.14
wer
-0.14
ãĤ¢ãĥ¼
-0.13
Ere
-0.13
ERCHANT
-0.13
å°ij
-0.13
Chore
-0.13
POSITIVE LOGITS
0.17
819
0.16
838
0.15
460
0.15
Men
0.14
less
0.14
between
0.13
égor
0.13
ASI
0.13
-ÑĤо
0.13
Activations Density 0.003%