INDEX
Explanations
biblical references or citations related to academic works
New Auto-Interp
Negative Logits
vang
-0.17
oftware
-0.16
ascus
-0.16
erb
-0.15
ichier
-0.15
achers
-0.15
eba
-0.14
yme
-0.14
lland
-0.14
orate
-0.14
POSITIVE LOGITS
justice
0.17
Binder
0.16
abs
0.15
abra
0.14
worthy
0.14
istrovstvÃŃ
0.14
justice
0.14
598
0.14
ronic
0.14
McDon
0.14
Activations Density 0.019%