INDEX
Explanations
references to specific documents or written records
references to documents with specific lengths and characteristics
New Auto-Interp
Negative Logits
rians
-0.86
uve
-0.77
rossover
-0.75
region
-0.75
htaking
-0.73
ablishment
-0.72
utical
-0.71
alian
-0.71
lees
-0.71
abiding
-0.71
POSITIVE LOGITS
diary
1.46
autobiography
1.44
manifesto
1.39
broch
1.38
handwritten
1.36
manuscript
1.36
memoir
1.34
pamphlet
1.34
notebook
1.33
booklet
1.32
Activations Density 0.314%