INDEX
    Explanations

    references to divine commandments and moral warnings

    New Auto-Interp
    Negative Logits
    .scalablytyped
    -0.15
    _encoded
    -0.15
    yna
    -0.14
    licer
    -0.14
     Äijá»Ļng
    -0.14
    orus
    -0.14
    åĬ¨
    -0.14
    bero
    -0.14
    orris
    -0.13
    _css
    -0.13
    POSITIVE LOGITS
    cus
    0.15
    694
    0.14
    LECT
    0.14
     feed
    0.14
    riba
    0.13
    URE
    0.13
    lew
    0.13
     Feed
    0.13
    _Handle
    0.13
    à¥įà¤Łà¤°
    0.13
    Act Density 0.342%

    No Known Activations