INDEX
    Explanations

    phrases related to contributions or attributions

    instances of the word "to" indicating contribution or causation

    New Auto-Interp
    Negative Logits
    HUD
    -0.70
    Glass
    -0.69
    Champ
    -0.69
    CHAT
    -0.68
    IED
    -0.66
    STAT
    -0.62
    anned
    -0.62
    types
    -0.62
    framework
    -0.61
    rawler
    -0.61
    POSITIVE LOGITS
     compensate
    0.81
     assist
    0.78
     create
    0.76
     amplify
    0.76
     ensure
    0.75
     counteract
    0.75
     facilitate
    0.75
     clot
    0.73
     propagate
    0.72
     accumulate
    0.72
    Act Density 0.076%

    No Known Activations