INDEX
    Explanations

    phrases related to the cost or consequences of actions, particularly focusing on sacrifices made for personal or collective gain

    New Auto-Interp
    Negative Logits
    addon
    -0.16
    iams
    -0.15
    ina
    -0.15
    HttpResponse
    -0.15
     Tong
    -0.14
    ť
    -0.14
     landmark
    -0.14
    adelphia
    -0.14
     McK
    -0.13
    hit
    -0.13
    POSITIVE LOGITS
     expense
    0.52
     Expense
    0.43
    expense
    0.40
     at
    0.38
     detriment
    0.34
     expenses
    0.33
    Expense
    0.33
     sacrifice
    0.32
     Kosten
    0.30
     sacrificing
    0.29
    Act Density 0.083%

    No Known Activations