INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     ontde
    -0.08
    Found
    -0.08
    -0.08
    	found
    -0.08
     unspecified
    -0.08
     lifes
    -0.08
    .generated
    -0.07
    歓迎
    -0.07
    andoned
    -0.07
    -0.07
    POSITIVE LOGITS
    offset
    0.10
    _OFFSET
    0.09
     bounding
    0.09
    Bounding
    0.09
    _offset
    0.09
     Bounding
    0.09
     offsets
    0.09
    bounding
    0.08
     cumulative
    0.08
    .bounding
    0.08
    Act Density 0.004%

    No Known Activations