INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     instinct
    -0.07
     ligero
    -0.07
    618
    -0.07
     cage
    -0.07
    vare
    -0.07
     voluntary
    -0.07
    restriction
    -0.07
     restriction
    -0.07
     legger
    -0.07
     worms
    -0.07
    POSITIVE LOGITS
    foreach
    0.12
    .foreach
    0.12
    .each
    0.11
     foreach
    0.11
     từng
    0.10
    Batch
    0.10
    	foreach
    0.10
    0.10
    ,每
    0.10
    Iterator
    0.10
    Act Density 0.020%

    No Known Activations