INDEX
Explanations
elements related to authors and their works
New Auto-Interp
Negative Logits
ovky
-0.19
angi
-0.18
)((((
-0.15
hower
-0.15
adera
-0.15
itol
-0.15
ắng
-0.15
ogne
-0.15
éľŀ
-0.14
bucks
-0.14
POSITIVE LOGITS
otto
0.16
importe
0.15
042
0.14
ANY
0.14
distur
0.14
zin
0.14
ÑĤаб
0.14
outnumber
0.14
erland
0.14
586
0.13
Activations Density 0.440%