INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     капю
    -0.87
     Gottlieb
    -0.83
     tigre
    -0.83
     shouldBe
    -0.81
     curtains
    -0.78
     antibodies
    -0.78
    ligere
    -0.78
     DataGridView
    -0.77
    граф
    -0.77
     fabrik
    -0.77
    POSITIVE LOGITS
     serving
    1.68
     Serving
    1.40
    serving
    1.26
    Serving
    1.25
     vase
    0.97
     vases
    0.94
     watering
    0.92
    watering
    0.92
     coffee
    0.86
     drinking
    0.83
    Act Density 0.021%

    No Known Activations