INDEX
    Explanations

    references to miraculous events or experiences

    New Auto-Interp
    Negative Logits
     Lightning
    -0.15
    inium
    -0.14
    kea
    -0.14
    mgr
    -0.14
    way
    -0.14
    ore
    -0.14
    iph
    -0.14
     Ying
    -0.13
    light
    -0.13
    ugh
    -0.13
    POSITIVE LOGITS
    iam
    0.22
    roring
    0.19
    rored
    0.18
    rors
    0.18
    ACLE
    0.18
    avit
    0.17
    iams
    0.17
    quee
    0.17
    IAM
    0.17
    rror
    0.17
    Act Density 0.013%

    No Known Activations