INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
    vana
    -0.83
    enment
    -0.72
    anches
    -0.72
    iddler
    -0.70
     legalized
    -0.65
    enfranch
    -0.64
     womb
    -0.63
     mathemat
    -0.63
     nurturing
    -0.62
    avement
    -0.62
    POSITIVE LOGITS
    Dat
    0.85
    py
    0.78
    ming
    0.76
    bot
    0.73
     Temp
    0.70
    rite
    0.69
    Tor
    0.67
    >]
    0.66
    ________________________________
    0.65
     TBA
    0.64
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.