INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     violins
    -0.56
     doctors
    -0.54
     rules
    -0.54
    Parcelize
    -0.53
     accountants
    -0.52
     fields
    -0.52
     factories
    -0.52
     autoradio
    -0.51
     cracks
    -0.50
    -0.50
    POSITIVE LOGITS
     beverage
    1.87
     beverages
    1.61
     Beverage
    1.59
     Beverages
    1.27
    Bever
    1.12
     bebidas
    1.07
     boissons
    1.06
     bebida
    1.02
     boisson
    0.94
     Getränke
    0.86
    Act Density 0.003%

    No Known Activations