INDEX
Explanations
hyperlinks or related text indicating the presence of links within a document
occurrences of the word "links" in various forms
New Auto-Interp
Negative Logits
Merit
-0.84
IRE
-0.80
sburg
-0.76
otos
-0.75
ARS
-0.71
ority
-0.65
Ħ¢
-0.64
Penguins
-0.64
ZI
-0.63
oÄŁ
-0.63
POSITIVE LOGITS
links
1.22
Links
1.04
edin
0.98
links
0.95
linking
0.94
cius
0.93
link
0.90
link
0.90
Links
0.86
lash
0.85
Activations Density 0.012%