INDEX
    Explanations

    phrases related to organizing and categorizing items or information

    New Auto-Interp
    Negative Logits
     Curtain
    -0.16
    çħ
    -0.15
     Chain
    -0.14
    åĿª
    -0.14
    ibi
    -0.14
     halo
    -0.14
     extrapol
    -0.14
    ÙĦس
    -0.14
    icot
    -0.13
     grounds
    -0.13
    POSITIVE LOGITS
     box
    0.44
     boxes
    0.41
    box
    0.39
    -box
    0.38
     drawer
    0.37
    boxes
    0.37
     Box
    0.35
     bag
    0.35
    ç®±
    0.34
    abox
    0.34
    Act Density 0.262%

    No Known Activations