INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     таблиц
    -0.07
    .exports
    -0.07
    vol
    -0.07
     mkdir
    -0.07
    @foreach
    -0.07
     spéc
    -0.06
    mkdir
    -0.06
    /****************
    -0.06
     tuple
    -0.06
    	board
    -0.06
    POSITIVE LOGITS
     thin
    0.12
     Thin
    0.09
    ún
    0.07
    Slim
    0.07
    Canadian
    0.07
     Slim
    0.07
    IN
    0.07
    Th
    0.07
     slim
    0.06
    think
    0.06
    Act Density 0.009%

    No Known Activations