INDEX
Explanations
phrases relating to titles or headings that convey important information
instances of the text format indicating a new section or categorization in the document
New Auto-Interp
Negative Logits
aea
-0.71
perties
-0.71
lets
-0.69
alk
-0.68
vae
-0.67
nels
-0.65
eca
-0.65
atron
-0.64
auga
-0.64
chuk
-0.64
POSITIVE LOGITS
INFORMATION
1.47
LINE
1.46
HEAD
1.46
BOOK
1.46
THE
1.45
WORK
1.44
REPORT
1.44
PHOTO
1.43
ABOUT
1.40
WORK
1.39
Activations Density 0.202%