INDEX
Explanations
references to biblical figures, texts, or books from both the Old and New Testaments
New Auto-Interp
Negative Logits
ÑĢеж
-0.18
oload
-0.18
uss
-0.17
uchar
-0.16
rent
-0.15
ult
-0.14
еж
-0.14
jon
-0.14
agg
-0.14
aining
-0.14
POSITIVE LOGITS
nackte
0.15
ruba
0.15
_acl
0.14
uncios
0.14
piel
0.14
.Linked
0.14
Advoc
0.14
leftright
0.13
eniable
0.13
IVEN
0.13
Activations Density 0.140%