INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     companyId
    -0.57
    сторія
    -0.54
     TabLayout
    -0.52
    datei
    -0.50
    ceq
    -0.50
    thoff
    -0.50
     roleId
    -0.49
     doPost
    -0.49
     dto
    -0.49
    doi
    -0.48
    POSITIVE LOGITS
     green
    1.99
    Green
    1.84
     Green
    1.80
    green
    1.77
     GREEN
    1.65
    GREEN
    1.61
     yeşil
    1.32
     greens
    1.24
    绿
    1.21
     зелё
    1.16
    Act Density 0.010%

    No Known Activations