INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Regression
    -0.10
     regression
    -0.08
     hesab
    -0.08
     GAM
    -0.08
    RAR
    -0.08
     grada
    -0.08
     Prog
    -0.08
     перев
    -0.08
     Gam
    -0.08
     filed
    -0.07
    POSITIVE LOGITS
     shaping
    0.11
     opinions
    0.11
    意见
    0.10
     attitudes
    0.09
     shaped
    0.09
     opiniões
    0.08
     history
    0.08
    curve
    0.08
     destiny
    0.08
     shape
    0.08
    Act Density 0.025%

    No Known Activations