INDEX
Explanations
phrases expressing confidence and the concept of divine faithfulness
New Auto-Interp
Negative Logits
deaux
-0.17
pic
-0.16
weit
-0.14
Pic
-0.14
shalt
-0.14
lus
-0.13
pic
-0.13
-License
-0.13
.Actions
-0.13
arter
-0.13
POSITIVE LOGITS
yourselves
0.18
Rh
0.15
ragon
0.14
uda
0.14
ceiving
0.14
auer
0.14
inton
0.14
iran
0.14
Fine
0.13
Colon
0.13
Activations Density 0.038%