INDEX
    Explanations

    expressions of apology and expressions of humility

    New Auto-Interp
    Negative Logits
     CreateTagHelper
    -0.55
    complexContent
    -0.55
    DebuggerNonUser
    -0.55
     TextInputType
    -0.55
    CppMethod
    -0.53
    ModelAdmin
    -0.52
    ValueStyle
    -0.52
    RectangleBorder
    -0.52
    StoryboardSegue
    -0.52
    Dimanche
    -0.51
    POSITIVE LOGITS
     reciproc
    0.45
     loyalty
    0.41
    reci
    0.40
     reciprocity
    0.38
     loyal
    0.36
     recipro
    0.34
     generosity
    0.33
     kindness
    0.33
     envers
    0.32
     love
    0.31
    Act Density 0.247%

    No Known Activations