INDEX
    Explanations

    measurements of weight and dimensions

    New Auto-Interp
    Negative Logits
    bis
    -0.16
    rost
    -0.16
    688
    -0.15
    idar
    -0.15
     birthday
    -0.15
    irs
    -0.14
     cold
    -0.14
    Ins
    -0.14
    ij
    -0.14
     Grill
    -0.14
    POSITIVE LOGITS
    @student
    0.16
    mdi
    0.16
    ÎłÎij
    0.15
    ltre
    0.14
    ewood
    0.14
    ourd
    0.14
    rrha
    0.14
    acci
    0.14
    еÑĩно
    0.13
    ITTE
    0.13
    Act Density 0.008%

    No Known Activations