INDEX
    Explanations

    references to "Box" and its associated numbers or categories

    New Auto-Interp
    Negative Logits
    ustrial
    -0.16
    izr
    -0.16
    urre
    -0.15
    ufs
    -0.15
    odate
    -0.15
    reich
    -0.15
    ufen
    -0.15
     Dew
    -0.15
    udes
    -0.14
     Mug
    -0.14
    POSITIVE LOGITS
    <dyn
    0.25
    (es
    0.22
    -sizing
    0.22
    (Box
    0.21
    .Box
    0.21
    .box
    0.21
    IsEmpty
    0.20
    -shadow
    0.19
    meer
    0.18
    (box
    0.18
    Act Density 0.036%

    No Known Activations