INDEX
    Explanations

    verbs related to action or change

    New Auto-Interp
    Negative Logits
    was
    -0.62
    wasn
    -0.58
    Was
    -0.57
     Wasn
    -0.54
     %@",
    -0.51
    Wasn
    -0.49
    properly
    -0.47
     Eſ
    -0.45
    WAS
    -0.44
    }/${
    -0.44
    POSITIVE LOGITS
     are
    1.13
     gets
    0.83
     generates
    0.83
     goes
    0.81
     enters
    0.80
     takes
    0.80
     comes
    0.78
     extends
    0.76
     arrives
    0.76
     considers
    0.75
    Act Density 0.702%

    No Known Activations