INDEX
Explanations
instances of the word "reference" and its variations, indicating a focus on citations or references in the text
New Auto-Interp
Negative Logits
iron
-0.16
ForeignKey
-0.15
frank
-0.15
ern
-0.15
ards
-0.15
arr
-0.15
arily
-0.14
eenth
-0.14
chá»ĭu
-0.14
ивÑĪи
-0.14
POSITIVE LOGITS
resher
0.18
izes
0.18
luž
0.16
coni
0.15
oldem
0.15
exual
0.15
ential
0.15
εια
0.15
attles
0.14
utable
0.14
Activations Density 0.058%