INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     ecru
    -0.92
     luxuriant
    -0.79
     pyridine
    -0.77
     tetrach
    -0.74
     friable
    -0.70
     calyx
    -0.68
     cupola
    -0.67
     mauve
    -0.67
     mohair
    -0.66
     annulus
    -0.66
    POSITIVE LOGITS
     kac
    0.93
     kram
    0.91
     logis
    0.90
     bera
    0.87
     antik
    0.87
     reger
    0.86
     simplif
    0.85
     Kategor
    0.85
     panik
    0.83
     glan
    0.82
    Act Density 0.087%

    No Known Activations