INDEX
Explanations
references to religious figures, specifically bishops and their roles
New Auto-Interp
Negative Logits
ouz
-0.15
KIT
-0.15
cz
-0.15
/cgi
-0.14
ullah
-0.14
llib
-0.14
istically
-0.14
emale
-0.14
liner
-0.14
tin
-0.14
POSITIVE LOGITS
ric
0.28
rics
0.22
esses
0.19
rica
0.17
ofs
0.16
dom
0.16
loo
0.15
å²Ĺ
0.15
ÑĢÑĥÑĤ
0.15
-elect
0.15
Activations Density 0.029%