INDEX
Explanations
instances related to different branches of Christianity and Judaism, particularly focusing on critical viewpoints
references to religious beliefs and practices, specifically focusing on Catholicism and Christianity
New Auto-Interp
Negative Logits
individual
-0.75
abet
-0.75
umm
-0.66
lev
-0.62
Lear
-0.62
une
-0.61
organ
-0.60
TAIN
-0.60
vol
-0.60
Neg
-0.59
POSITIVE LOGITS
anship
0.99
opathy
0.83
ophobia
0.79
yip
0.79
imei
0.78
atism
0.78
ismo
0.75
jriwal
0.74
istry
0.74
anarchism
0.72
Activations Density 0.059%