INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Car
    -0.71
    -0.69
     car
    -0.66
     ar
    -0.62
     far
    -0.59
     C
    -0.57
     ro
    -0.57
     B
    -0.57
     S
    -0.56
     R
    -0.56
    POSITIVE LOGITS
     myſelf
    1.61
     itſelf
    1.55
     Jefus
    1.43
     Efq
    1.41
     Majefty
    1.38
     Theſe
    1.38
     Anſ
    1.36
     Monfieur
    1.35
     purpoſe
    1.34
    GEBURTSDATUM
    1.34
    Act Density 0.241%

    No Known Activations