INDEX
Explanations
text that has been edited or condensed
references to editing or altered content
New Auto-Interp
Negative Logits
falls
-0.79
fw
-0.76
aches
-0.74
fall
-0.74
ptoms
-0.74
gaard
-0.72
hood
-0.72
Arcade
-0.72
phal
-0.69
fell
-0.69
POSITIVE LOGITS
summ
0.84
excerpts
0.82
annex
0.78
transcript
0.76
edited
0.72
edited
0.68
icum
0.66
iola
0.66
narration
0.65
ison
0.64
Activations Density 0.024%