INDEX
Explanations
highly cited information or references in a text
phrases related to academic citations
New Auto-Interp
Negative Logits
wards
-0.83
Stretch
-0.71
achine
-0.71
locks
-0.70
oops
-0.70
goodbye
-0.66
ritch
-0.65
halves
-0.65
OOOO
-0.65
goodness
-0.65
POSITIVE LOGITS
cited
3.67
quoted
2.28
cite
2.12
cites
2.08
citing
1.91
referenced
1.86
citations
1.74
citation
1.63
credited
1.52
touted
1.51
Activations Density 0.014%