INDEX
    Explanations

    expressions of emotional turmoil and regret

    New Auto-Interp
    Negative Logits
    odly
    -0.57
     trouble
    -0.55
     kew
    -0.55
    !*\
    -0.51
     new
    -0.50
     nice
    -0.50
     hassle
    -0.48
     sharp
    -0.47
     Spaß
    -0.47
     good
    -0.47
    POSITIVE LOGITS
    Personendaten
    0.77
    GEBURTSDATUM
    0.75
    SharedDtor
    0.73
     betweenstory
    0.69
    FormTagHelper
    0.69
    protoimpl
    0.68
    ArrowToggle
    0.68
    StructEnd
    0.67
    giveness
    0.65
    RegressionTest
    0.65
    Act Density 0.424%

    No Known Activations