INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    really
    -0.81
     飲み
    -0.81
    -0.79
    melte
    -0.79
    衣服
    -0.75
     valutazione
    -0.75
    -0.74
    -0.74
    JsonInclude
    -0.72
     Bunt
    -0.71
    POSITIVE LOGITS
     black
    1.88
     Black
    1.46
     red
    1.41
    black
    1.38
    BLACK
    1.30
    Black
    1.27
     BLACK
    1.25
     pink
    1.10
     blue
    1.07
    1.06
    Act Density 0.013%

    No Known Activations