INDEX
Explanations
phrases where someone is credited with writing something, such as books, articles, reports, or social media posts
occurrences of the word "wrote."
New Auto-Interp
Negative Logits
Ĭ±
-0.82
xon
-0.74
Magn
-0.70
amac
-0.67
EGA
-0.67
Afee
-0.67
Enlarge
-0.66
phant
-0.66
OPA
-0.65
nor
-0.63
POSITIVE LOGITS
penned
0.84
poems
0.84
writer
0.82
written
0.81
smanship
0.81
wrote
0.79
aloud
0.79
eloqu
0.78
memos
0.78
writ
0.77
Activations Density 0.034%