INDEX
    Explanations

    verbs related to actions of taking, such as "takes," "took," "taking," and "taken."

    New Auto-Interp
    Negative Logits
    lite
    -0.69
    raid
    -0.69
    è¦ļéĨĴ
    -0.67
     ILCS
    -0.65
    gian
    -0.64
    eous
    -0.62
    lex
    -0.62
     Conclusion
    -0.60
    arie
    -0.60
     constitu
    -0.60
    POSITIVE LOGITS
     advantage
    1.16
     refuge
    0.96
     pains
    0.95
     aim
    0.94
     um
    0.85
     charge
    0.83
     place
    0.82
     part
    0.82
     inspiration
    0.82
     aback
    0.81
    Act Density 0.074%

    No Known Activations