INDEX
Explanations
mentions of writers
occurrences of the word "writer" in various contexts
New Auto-Interp
Negative Logits
ADRA
-0.79
rals
-0.77
illon
-0.76
Lumpur
-0.73
aband
-0.72
ĸļ
-0.71
ypes
-0.70
ibaba
-0.70
eneg
-0.70
asonic
-0.68
POSITIVE LOGITS
writer
0.95
writer
0.94
laureate
0.91
uscript
0.87
fiction
0.86
writing
0.85
writ
0.82
haw
0.81
Beware
0.78
itatively
0.76
Activations Density 0.028%