INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    taxonomy
    -0.08
     textbooks
    -0.08
     Kindes
    -0.08
     domanda
    -0.07
    nan
    -0.07
     ניצ
    -0.07
     Feira
    -0.07
     תורה
    -0.07
     Torah
    -0.07
    _valid
    -0.07
    POSITIVE LOGITS
     opacity
    0.13
    opacity
    0.13
    -opacity
    0.12
    Opacity
    0.12
    .opacity
    0.11
     faded
    0.11
    0.11
    透明
    0.11
     translucent
    0.10
     rgba
    0.10
    Act Density 0.005%

    No Known Activations