INDEX
Explanations
passages indicating that something has been written
instances of the word "written" in various contexts
New Auto-Interp
Negative Logits
abe
-0.86
Ĭ±
-0.85
nel
-0.82
ugal
-0.80
Shinra
-0.75
elli
-0.73
illon
-0.73
allows
-0.72
isters
-0.71
alin
-0.70
POSITIVE LOGITS
essays
0.75
instrument
0.73
breath
0.72
acters
0.71
essay
0.70
excerpts
0.69
written
0.69
typed
0.68
aloud
0.67
itatively
0.66
Activations Density 0.025%