INDEX
Explanations
beliefs about spirituality and personal transformation
New Auto-Interp
Negative Logits
erif
-0.17
vide
-0.16
oppins
-0.16
Vog
-0.16
ahun
-0.15
isas
-0.15
aepernick
-0.15
enderit
-0.14
908
-0.14
avin
-0.14
POSITIVE LOGITS
saved
0.25
Saved
0.23
saved
0.21
sons
0.21
vessels
0.20
chosen
0.20
regenerated
0.20
regenerate
0.20
Elect
0.20
Saved
0.19
Activations Density 0.039%