INDEX
    Explanations

    Well-being/recovery

    New Auto-Interp
    Negative Logits
     CreateTagHelper
    -0.89
     Cæsar
    -0.89
     Theſe
    -0.87
     Majefty
    -0.86
     Jefus
    -0.86
     Monfieur
    -0.85
     myſelf
    -0.82
     Houſe
    -0.82
    iastes
    -0.82
     NDEBUG
    -0.79
    POSITIVE LOGITS
     zu
    0.49
     zur
    0.46
     Di
    0.46
    ?
    0.45
     to
    0.44
     apa
    0.44
     fais
    0.43
    関係
    0.43
     trình
    0.43
     Separate
    0.42
    Act Density 1.571%

    No Known Activations