INDEX
Explanations
references to biblical scripture and teachings related to faith and morality
New Auto-Interp
Negative Logits
auer
-0.16
479
-0.16
876
-0.16
360
-0.15
bla
-0.15
964
-0.15
765
-0.14
550
-0.14
950
-0.14
thousand
-0.14
POSITIVE LOGITS
verses
0.19
verse
0.17
verse
0.16
(vs
0.16
-vers
0.16
)↵↵↵↵↵↵↵↵
0.15
Vers
0.15
cola
0.15
/GL
0.15
vers
0.15
Activations Density 0.021%