INDEX
    Explanations

    terms related to fossil fuels

    New Auto-Interp
    Negative Logits
    esp
    -0.07
    aday
    -0.06
    066
    -0.06
     Goose
    -0.06
    ongsTo
    -0.06
    ittings
    -0.06
    istrovstvÃŃ
    -0.06
    lessly
    -0.06
    ãĥ©ãĥ³ãĤ¹
    -0.06
    erval
    -0.06
    POSITIVE LOGITS
     Kaynak
    0.07
    /ros
    0.07
    ilere
    0.07
    Як
    0.07
     fuels
    0.07
    è³
    0.07
     yıldır
    0.07
     nhiên
    0.07
    ifer
    0.06
     Zu
    0.06
    Act Density 0.001%

    No Known Activations