INDEX
Explanations
words and phrases related to technical processes and modifications
New Auto-Interp
Negative Logits
ÃŃ
-0.21
ify
-0.18
lyn
-0.17
sam
-0.16
ingu
-0.16
ÃŃa
-0.15
iao
-0.15
sik
-0.15
ified
-0.15
yy
-0.15
POSITIVE LOGITS
ary
0.42
ally
0.38
naire
0.37
ist
0.33
al
0.32
nal
0.31
ists
0.31
nelle
0.29
ARY
0.28
nel
0.28
Activations Density 1.424%