INDEX
Explanations
words related to writing or composing text
references to the act of writing
New Auto-Interp
Negative Logits
Ĭ±
-0.84
ega
-0.78
EGA
-0.77
rolet
-0.75
abe
-0.75
alo
-0.72
Afee
-0.68
Magn
-0.66
agara
-0.66
nel
-0.65
POSITIVE LOGITS
poems
0.85
penned
0.84
smanship
0.84
writing
0.78
notebook
0.76
essays
0.75
letters
0.74
writer
0.74
poem
0.74
writing
0.74
Activations Density 0.036%