INDEX
Explanations
religious terms or references
terms associated with belief and believers in a religious context
New Auto-Interp
Negative Logits
anka
-0.67
parks
-0.66
Mamm
-0.64
Armour
-0.63
æ©Ł
-0.62
Paste
-0.62
Mane
-0.62
Grab
-0.61
Plaza
-0.59
Agric
-0.59
POSITIVE LOGITS
ievers
1.26
ieving
1.04
believer
1.01
ieve
1.00
iever
0.94
cius
0.91
believers
0.87
ieved
0.83
ieves
0.83
believing
0.79
Activations Density 0.025%