INDEX
Explanations
references to divine attributes and characteristics
New Auto-Interp
Negative Logits
oba
-0.16
heimer
-0.16
odb
-0.15
_ut
-0.15
yc
-0.14
oble
-0.14
526
-0.14
ut
-0.14
ance
-0.14
Hut
-0.14
POSITIVE LOGITS
whom
0.17
Pant
0.15
Claude
0.14
REC
0.14
огÑĥ
0.14
evade
0.14
oned
0.14
enth
0.13
ÃŃsticas
0.13
äd
0.13
Activations Density 0.119%