INDEX
    Explanations

    phrases that indicate conditionality or dependencies

    New Auto-Interp
    Negative Logits
     Horst
    -0.67
     Huerta
    -0.64
    centralwidget
    -0.64
    æa
    -0.62
     Gauthier
    -0.61
     पत्र
    -0.59
     Burch
    -0.58
     çı
    -0.58
    peteer
    -0.57
    -0.57
    POSITIVE LOGITS
     Depends
    1.38
     depends
    1.36
     depending
    1.30
    depends
    1.29
     depend
    1.29
    Depends
    1.25
     depended
    1.25
    depend
    1.23
    depending
    1.20
     depende
    1.11
    Act Density 0.128%

    No Known Activations