INDEX
Explanations
references to religious locations or activities
terms related to religious sites and practices
New Auto-Interp
Negative Logits
estern
-0.76
Carlson
-0.70
RAW
-0.68
balance
-0.67
urtle
-0.66
Plex
-0.66
apse
-0.65
err
-0.63
NER
-0.61
olor
-0.61
POSITIVE LOGITS
pilgrimage
1.28
pilgrims
1.08
maiden
1.01
shrine
0.99
pilgr
0.99
worsh
0.88
worshipped
0.83
Shrine
0.83
worship
0.82
antry
0.80
Activations Density 0.030%