INDEX
    Explanations

    mentions of emissions or related terms

    references to emissions and their impacts on the environment

    New Auto-Interp
    Negative Logits
    Else
    -0.93
    ivas
    -0.81
    inosaur
    -0.75
    ISH
    -0.74
    anova
    -0.73
    rooms
    -0.72
    slice
    -0.71
    ciating
    -0.68
     Else
    -0.68
    amina
    -0.68
    POSITIVE LOGITS
     emissions
    1.35
     emission
    1.14
     emitting
    1.04
     emit
    0.95
     dioxide
    0.93
     pollution
    0.93
     gases
    0.92
     emitted
    0.90
     emits
    0.87
     pollutants
    0.84
    Act Density 0.026%

    No Known Activations