INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Opera
    0.86
    Opera
    0.83
     pizzeria
    0.79
     opera
    0.76
    Customer
    0.76
     wine
    0.76
    警察
    0.75
     deployed
    0.75
     scotch
    0.73
     преступ
    0.72
    POSITIVE LOGITS
     vitamins
    1.96
     nutrients
    1.91
     vitamin
    1.82
     nutrition
    1.77
     Vitamins
    1.77
     Vitamin
    1.74
     nutrient
    1.69
     nutritional
    1.69
     Nutrition
    1.63
    Vitamin
    1.63
    Act Density 0.845%

    No Known Activations