INDEX
Explanations
terms related to citations and references within written text
sentence-ending punctuation marks
New Auto-Interp
Negative Logits
ient
-0.72
ettlement
-0.66
ierre
-0.64
olls
-0.64
ierra
-0.63
isons
-0.63
âķIJâķIJ
-0.63
ikuman
-0.62
nearest
-0.61
ãĥ¼ãĥ³
-0.61
POSITIVE LOGITS
...]
1.36
â̦]
1.21
Pg
1.01
note
0.98
?]
0.97
citation
0.94
".[
0.90
Footnote
0.89
src
0.89
]
0.88
Activations Density 0.031%