INDEX
Explanations
references to religious texts and figures
references to religious texts and figures
New Auto-Interp
Negative Logits
bledon
-0.81
leneck
-0.80
pless
-0.78
rared
-0.77
llular
-0.77
ptive
-0.74
ovo
-0.74
srfAttach
-0.73
osponsors
-0.72
jri
-0.72
POSITIVE LOGITS
Interpret
1.11
narrated
0.94
verse
0.94
narration
0.90
narr
0.89
verses
0.87
manuscripts
0.87
manuscript
0.87
texts
0.86
Romans
0.86
Activations Density 0.172%