INDEX
    Explanations

    expressions related to health consequences or controversies

    New Auto-Interp
    Negative Logits
    ]));
    
    -0.90
     considérons
    -0.87
    }),
    
    -0.86
    }))
    
    -0.82
    }));
    
    -0.82
    ✨:
    -0.82
    ])):
    -0.82
    MLLoader
    -0.81
    сылкі
    -0.80
    новништво
    -0.80
    POSITIVE LOGITS
    0
    0.78
    1
    0.71
    2
    0.65
     Sal
    0.60
    żym
    0.57
     Ann
    0.53
    5
    0.53
    4
    0.53
    مار
    0.51
     Salv
    0.51
    Act Density 0.157%

    No Known Activations