INDEX
Explanations
terms related to valuable, historically significant objects or creations, such as artifacts and relics
mentions of artifacts and relics
New Auto-Interp
Negative Logits
sen
-0.75
NN
-0.73
creen
-0.68
hua
-0.66
secut
-0.66
coli
-0.65
rol
-0.63
ppers
-0.63
ationally
-0.62
Western
-0.62
POSITIVE LOGITS
artifact
1.47
artifacts
1.44
arte
1.34
ifacts
1.33
artifacts
1.33
Artifact
0.96
relics
0.95
ifact
0.94
smanship
0.91
ibles
0.85
Activations Density 0.008%