INDEX
Explanations
hyperlinks or connections between concepts/entities
mentions of "links" or connections within the text
New Auto-Interp
Negative Logits
onent
-0.80
creen
-0.66
vation
-0.65
oyer
-0.64
romy
-0.63
pee
-0.62
cry
-0.62
Apocalypse
-0.61
iva
-0.60
sein
-0.60
POSITIVE LOGITS
links
3.93
link
2.71
links
2.69
Links
2.40
Links
1.98
ties
1.98
linking
1.91
connections
1.90
link
1.82
LINK
1.76
Activations Density 0.008%