INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     cser
    -0.60
    vited
    -0.52
    odils
    -0.50
     nephro
    -0.49
     fausse
    -0.49
     desorption
    -0.48
     bailando
    -0.48
    staw
    -0.47
    ostante
    -0.47
     kysy
    -0.47
    POSITIVE LOGITS
     vehicle
    3.59
     Vehicle
    3.28
    vehicle
    3.23
    Vehicle
    3.03
     VEHICLE
    2.99
     vehicles
    2.85
     Vehicles
    2.69
    vehicles
    2.54
    VEHICLE
    2.35
    Vehicles
    2.28
    Act Density 0.085%

    No Known Activations