INDEX
    Explanations

    words related to influence, regulation, and impact

    New Auto-Interp
    Negative Logits
    mad
    -0.75
    DragonMagazine
    -0.73
    iphate
    -0.69
    ãĤ»
    -0.69
     sidx
    -0.67
    rit
    -0.65
     topped
    -0.65
    shake
    -0.65
     hey
    -0.63
    vic
    -0.63
    POSITIVE LOGITS
    ively
    0.95
     them
    0.92
    orously
    0.87
     our
    0.83
    uate
    0.80
     the
    0.80
     aspects
    0.78
     him
    0.77
    ibly
    0.76
    ably
    0.72
    Act Density 1.725%

    No Known Activations