INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    }],
    -0.07
    ui
    -0.07
     inspires
    -0.06
     такого
    -0.06
     besteht
    -0.06
    orean
    -0.06
     fran
    -0.06
    _none
    -0.06
    -0.06
    /(
    -0.06
    POSITIVE LOGITS
     Markets
    0.09
     market
    0.08
     Market
    0.08
     markets
    0.08
     населения
    0.06
     sorrow
    0.06
     Kernel
    0.06
     supermarkets
    0.06
     Kant
    0.06
     Debate
    0.06
    Act Density 0.007%

    No Known Activations