INDEX
Explanations
mentions of writers and writing
New Auto-Interp
Negative Logits
anter
-0.08
ERT
-0.08
eur
-0.08
edu
-0.08
tep
-0.08
ayers
-0.08
tsy
-0.07
ilion
-0.07
adera
-0.07
ÑĤим
-0.07
POSITIVE LOGITS
hip
0.08
hood
0.07
/editor
0.07
innen
0.07
/auth
0.07
itative
0.06
prene
0.06
lady
0.06
/art
0.06
Indies
0.06
Activations Density 0.012%