INDEX
    Explanations

    phrases indicating functions or roles, particularly those that involve serving as or acting as something significant

    New Auto-Interp
    Negative Logits
     propOrder
    -0.78
     Taktlose
    -0.77
     Sanger
    -0.77
     centrifug
    -0.72
    DataAnnotations
    -0.69
     Atari
    -0.69
    cherichia
    -0.69
     Valladolid
    -0.68
    :✨
    -0.67
     wipers
    -0.67
    POSITIVE LOGITS
    inghouse
    0.69
    ctid
    0.69
    "},
    
    0.65
    tifacts
    0.63
    +");
    0.62
    "),
    
    0.62
     sederhana
    0.62
     px
    0.62
    ுக்கு
    0.61
    "],
    
    0.61
    Act Density 0.151%

    No Known Activations