INDEX
Explanations
mentions of writers and their related activities
references to writers or their roles in various contexts
New Auto-Interp
Negative Logits
undai
-0.93
ibaba
-0.81
xon
-0.79
Lumpur
-0.78
rals
-0.77
illon
-0.74
inho
-0.73
umph
-0.71
opping
-0.70
Ĭ±
-0.70
POSITIVE LOGITS
writer
1.23
writer
1.07
laureate
1.04
writers
1.03
Writer
0.95
writ
0.93
writers
0.88
Writers
0.86
fiction
0.84
writing
0.82
Activations Density 0.020%