INDEX
Explanations
edits or edited versions of text
instances of the word "edited" and its variations
New Auto-Interp
Negative Logits
fall
-0.77
hood
-0.77
bley
-0.75
falls
-0.75
phal
-0.74
pton
-0.74
ptoms
-0.74
fw
-0.70
ãĤ©
-0.69
aches
-0.69
POSITIVE LOGITS
summ
0.82
excerpts
0.76
editing
0.74
transcript
0.73
edits
0.73
annex
0.72
orship
0.70
edited
0.69
Delete
0.66
edited
0.66
Activations Density 0.020%