INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    4
    0.96
    2
    0.92
    0.90
    5
    0.90
    3
    0.89
    0
    0.89
    1
    0.89
    7
    0.86
    8
    0.85
    Connection
    0.84
    POSITIVE LOGITS
    0.88
    0.81
    yyyyyyyy
    0.75
     βρί
    0.70
     lysosomes
    0.70
    biotics
    0.70
    llis
    0.68
    បាន
    0.67
     তুমি
    0.66
     έχει
    0.66
    Act Density 0.022%

    No Known Activations