INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     ActiveRecord
    -0.07
     Front
    -0.07
    .clock
    -0.07
    andy
    -0.07
     sauna
    -0.06
    -fold
    -0.06
    .Insert
    -0.06
    otlin
    -0.06
     bananas
    -0.06
    -0.06
    POSITIVE LOGITS
    errorCode
    0.08
    crafted
    0.07
    ocratic
    0.07
    une
    0.06
     ErrorCode
    0.06
    -taking
    0.06
    >E
    0.06
    sunuz
    0.06
     olacaktır
    0.06
    ρωπα
    0.06
    Act Density 0.008%

    No Known Activations