INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ніципалі
    -0.59
     utafitiHapana
    -0.59
    NameInMap
    -0.57
    ništ
    -0.56
     IBOutlet
    -0.54
    twimg
    -0.52
    LayoutPanel
    -0.52
     Biôgrafia
    -0.51
    Chham
    -0.51
     arşivlendi
    -0.50
    POSITIVE LOGITS
    /**
    1.56
    /**
    
    1.23
    +/**
    1.03
    ///**
    0.97
     /**
    0.97
    /**
    0.91
    /*
    0.84
    /**
    
    
    0.78
    -/**
    0.74
    /***
    0.73
    Act Density 0.001%

    No Known Activations