INDEX
Explanations
references to specific book publishers
references to publishing companies and related terms
New Auto-Interp
Negative Logits
ortium
-0.80
ornia
-0.76
eteria
-0.66
ĸļ
-0.66
jriwal
-0.63
ournals
-0.62
anty
-0.62
ujah
-0.59
DIR
-0.59
lished
-0.58
POSITIVE LOGITS
Diesel
0.70
metic
0.69
XL
0.65
ר
0.65
achev
0.65
kov
0.64
manship
0.64
etric
0.63
terness
0.62
Boxing
0.62
Activations Density 0.311%