INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Press
    -0.07
     Especially
    -0.07
    кова
    -0.07
    .attrs
    -0.06
    کو
    -0.06
    PRESS
    -0.06
    .checkBox
    -0.06
    irection
    -0.06
     pressured
    -0.06
    -0.06
    POSITIVE LOGITS
     dining
    0.14
     Dining
    0.14
     dine
    0.10
     Din
    0.09
     diner
    0.09
     fluent
    0.08
     din
    0.07
    0.07
     Wyn
    0.07
    	cin
    0.06
    Act Density 0.002%

    No Known Activations