INDEX
Explanations
phrases related to writing activities
instances of the word "write" in various forms and contexts
New Auto-Interp
Negative Logits
Ĭ±
-0.82
eger
-0.82
illon
-0.75
alo
-0.74
Unity
-0.73
ILCS
-0.71
aband
-0.70
agara
-0.69
azar
-0.67
EGA
-0.67
POSITIVE LOGITS
poems
0.85
writer
0.83
Write
0.80
manship
0.79
journal
0.79
smanship
0.79
poem
0.78
writing
0.77
wrote
0.77
writ
0.77
Activations Density 0.033%