INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Castillo
    -0.09
     However
    -0.07
     Stella
    -0.07
    However
    -0.07
    /count
    -0.07
     theft
    -0.07
     entrepreneurship
    -0.07
     Atlas
    -0.07
    Sketch
    -0.06
     clerk
    -0.06
    POSITIVE LOGITS
    .rf
    0.07
    HeaderCode
    0.07
    igram
    0.06
    ibi
    0.06
    Rx
    0.06
    �s
    0.06
    `t
    0.06
    icone
    0.06
     Hod
    0.06
    روز
    0.06
    Act Density 0.025%

    No Known Activations