INDEX
    Explanations

    phrases related to contrasting or comparing different aspects or entities

    phrases that contrast different subjects or points of view

    New Auto-Interp
    Negative Logits
    obin
    -0.65
    hess
    -0.58
    zing
    -0.55
    ahl
    -0.55
     Roose
    -0.55
     <@
    -0.55
     '.
    -0.53
    zn
    -0.53
    GET
    -0.51
     Slash
    -0.51
    POSITIVE LOGITS
     fared
    0.73
     flourished
    0.72
     outper
    0.70
     accommod
    0.68
     disclaim
    0.68
     thri
    0.67
     derives
    0.67
     teaches
    0.66
    pmwiki
    0.66
    lishes
    0.65
    Act Density 0.292%

    No Known Activations