INDEX
Explanations
mentions of books or novels
references to novels or narrative works
New Auto-Interp
Negative Logits
ftime
-0.76
halla
-0.73
uler
-0.68
adelphia
-0.67
Sv
-0.65
Treasurer
-0.62
vals
-0.61
itas
-0.61
gie
-0.61
Michaels
-0.61
POSITIVE LOGITS
novel
3.79
Novel
2.57
novels
2.47
novelist
1.74
screenplay
1.41
memoir
1.36
fiction
1.33
poem
1.32
manuscript
1.32
thriller
1.30
Activations Density 0.016%