INDEX
Explanations
Biblical references
formatted scripture references
New Auto-Interp
Negative Logits
ificant
-0.72
inate
-0.69
uve
-0.69
oler
-0.68
behavi
-0.67
dinand
-0.67
undai
-0.67
eatures
-0.67
reconc
-0.66
ocious
-0.65
POSITIVE LOGITS
00
1.02
59
0.95
58
0.94
53
0.93
30
0.92
09
0.92
12
0.89
45
0.88
51
0.88
13
0.88
Activations Density 0.049%