INDEX
    Explanations

    phrases related to reasons or causes

    phrases that denote reasons or causes behind events or actions

    New Auto-Interp
    Negative Logits
    Mario
    -0.71
    AIN
    -0.68
    ena
    -0.65
    Italy
    -0.65
    forestation
    -0.64
    enegger
    -0.64
    severe
    -0.64
    isha
    -0.64
    fox
    -0.63
    173
    -0.63
    POSITIVE LOGITS
     differences
    0.89
     discrepancies
    0.84
     difference
    0.84
     workings
    0.82
     motivations
    0.79
     tendencies
    0.79
     priorities
    0.77
     similarities
    0.77
     characteristics
    0.76
     preferences
    0.75
    Act Density 1.126%

    No Known Activations