INDEX
    Explanations

    question words

    New Auto-Interp
    Negative Logits
     pressure
    -0.06
     Дем
    -0.06
    IPP
    -0.06
     transportation
    -0.06
     Avery
    -0.06
     Rita
    -0.06
     Able
    -0.06
     Tracker
    -0.05
     Şu
    -0.05
     Coordinates
    -0.05
    POSITIVE LOGITS
    kus
    0.07
     нат
    0.06
    0.06
     /*!↵
    0.06
     multic
    0.06
    _attached
    0.06
    archive
    0.06
    mys
    0.06
    .React
    0.06
     magg
    0.06
    Act Density 0.049%

    No Known Activations