INDEX
    Explanations

    references to significant events or conditions that impact various practices, beliefs, or situations

    New Auto-Interp
    Negative Logits
    ſelf
    -0.88
    ſelves
    -0.85
     houſe
    -0.84
     ſche
    -0.79
     Monfieur
    -0.78
     Majefty
    -0.77
     uſe
    -0.77
     domani
    -0.76
    RegressionTest
    -0.76
    :✨
    -0.75
    POSITIVE LOGITS
     since
    1.23
     lately
    0.96
    since
    0.96
     recent
    0.94
     been
    0.94
     recently
    0.89
     SINCE
    0.88
     depuis
    0.85
    Since
    0.83
     sejak
    0.83
    Act Density 0.789%

    No Known Activations