INDEX
    Explanations

    phrases related to accountability and responsibility

    instances of a specific character or symbol throughout the text

    New Auto-Interp
    Negative Logits
     metic
    -0.85
     whistle
    -0.80
     radar
    -0.79
     shroud
    -0.77
     carriage
    -0.76
     transit
    -0.76
     swarm
    -0.75
     groom
    -0.75
     badge
    -0.75
     ul
    -0.75
    POSITIVE LOGITS
    these
    1.49
    that
    1.49
    when
    1.46
    they
    1.45
    better
    1.45
    while
    1.45
    say
    1.44
    little
    1.44
    there
    1.43
    the
    1.43
    Act Density 0.088%

    No Known Activations