INDEX
    Explanations

    religion and sciences

    New Auto-Interp
    Negative Logits
     Efq
    -1.34
     Monfieur
    -1.30
     greateſt
    -1.28
     ſeveral
    -1.24
     myſelf
    -1.23
     Anſ
    -1.22
     pleaſure
    -1.22
     reaſon
    -1.20
     Reſ
    -1.19
     purpoſe
    -1.19
    POSITIVE LOGITS
    0.74
    ,
    0.73
    s
    0.71
     is
    0.66
     in
    0.65
     of
    0.64
    .
    0.64
     (
    0.63
    ic
    0.61
    ↵↵
    0.60
    Act Density 0.445%

    No Known Activations