INDEX
Explanations
references to novels and literature
New Auto-Interp
Negative Logits
Keim
-0.73
equalization
-0.72
^{\-0.69
Eich
-0.68
on
-0.66
Tweede
-0.64
løs
-0.61
kork
-0.60
ings
-0.58
aprend
-0.58
POSITIVE LOGITS
NOVEL
1.01
novels
1.00
Novel
0.99
Novel
0.99
theless
0.97
novel
0.95
Novels
0.92
novel
0.91
principalTable
0.86
weihnachten
0.85
Activations Density 0.176%