INDEX
    Explanations

    phrases related to physical structures

    references to different types of structures

    New Auto-Interp
    Negative Logits
    atz
    -0.68
    bat
    -0.65
    sung
    -0.64
    lethal
    -0.64
     Miss
    -0.64
     Medals
    -0.63
    bert
    -0.63
    deals
    -0.63
    hops
    -0.63
     Detective
    -0.62
    POSITIVE LOGITS
     structure
    3.64
     Structure
    2.85
     structures
    2.65
    ructure
    1.78
     stru
    1.71
     Struct
    1.70
     structured
    1.50
     structural
    1.44
     mechanism
    1.43
     struct
    1.39
    Act Density 0.022%

    No Known Activations