INDEX
    Explanations

    references to the word "dim" and its variations

    New Auto-Interp
    Negative Logits
    naire
    -0.16
    zen
    -0.16
    ificant
    -0.15
    ame
    -0.15
    ificate
    -0.15
    aleza
    -0.15
    naires
    -0.15
    athan
    -0.15
    PFN
    -0.15
    alg
    -0.14
    POSITIVE LOGITS
    ENSIONS
    0.27
     Dim
    0.27
    dim
    0.26
    Dim
    0.25
    ENSION
    0.25
    inished
    0.25
     dim
    0.23
    ensions
    0.23
    ethyl
    0.22
    ,dim
    0.21
    Act Density 0.013%

    No Known Activations