INDEX
    Explanations

    phrases describing specific attributes or features

    quantities and measurements related to objects and their properties

    New Auto-Interp
    Negative Logits
    projects
    -0.89
    OWS
    -0.77
    Train
    -0.76
    onse
    -0.76
    sports
    -0.75
    cakes
    -0.75
    bis
    -0.75
    Products
    -0.71
    Music
    -0.71
    arten
    -0.70
    POSITIVE LOGITS
     tendency
    1.25
     lifespan
    1.19
     diameter
    1.09
     expiration
    1.07
     drawback
    1.04
     propensity
    1.02
     resemblance
    1.02
     radius
    0.97
     capacity
    0.96
     reputation
    0.95
    Act Density 0.207%

    No Known Activations