INDEX
    Explanations

    the verb "be" in various forms and contexts

    New Auto-Interp
    Negative Logits
     Bars
    -0.74
    ciating
    -0.67
    terday
    -0.66
    might
    -0.61
    ocre
    -0.61
    plex
    -0.59
    fortunately
    -0.58
     compose
    -0.57
     Bend
    -0.57
    strous
    -0.57
    POSITIVE LOGITS
     able
    1.05
    heading
    0.97
     judged
    0.95
    AMS
    0.95
    fall
    0.95
     replaced
    0.94
     rewarded
    0.93
     subjected
    0.92
     deemed
    0.92
    fitting
    0.91
    Act Density 0.230%

    No Known Activations