INDEX
    Explanations

    terms related to dependence or reliance

    New Auto-Interp
    Negative Logits
    lette
    -0.20
    .au
    -0.18
    riage
    -0.17
    lement
    -0.17
    noc
    -0.16
    orra
    -0.16
    logg
    -0.15
    dea
    -0.15
    bed
    -0.15
    ÙĦÙĥ
    -0.14
    POSITIVE LOGITS
     upon
    0.29
    Upon
    0.24
     Upon
    0.24
     reliance
    0.24
     rely
    0.23
     relied
    0.22
     heavily
    0.22
     relies
    0.22
    upon
    0.21
    ably
    0.20
    Act Density 0.020%

    No Known Activations