INDEX
Explanations
timestamps for when something was written
instances of the word "wrote."
New Auto-Interp
Negative Logits
Ĭ±
-0.83
xon
-0.77
Enlarge
-0.71
Ton
-0.70
phant
-0.68
RELATED
-0.68
Nor
-0.68
amac
-0.67
Amph
-0.67
EGA
-0.67
POSITIVE LOGITS
penned
0.91
written
0.89
smanship
0.88
wrote
0.87
eloqu
0.85
scrib
0.85
aloud
0.84
vironment
0.83
write
0.83
writes
0.82
Activations Density 0.022%