INDEX
    Explanations

    terms related to veterinary practices and animal welfare

    New Auto-Interp
    Negative Logits
    <unused41>
    -1.07
    <unused17>
    -1.06
    <unused43>
    -1.06
    <unused79>
    -1.06
    <unused28>
    -1.06
    <unused3>
    -1.06
    <unused80>
    -1.06
    <unused74>
    -1.06
    <unused51>
    -1.06
    <pad>
    -1.05
    POSITIVE LOGITS
    </b>
    0.59
    0.47
     a
    0.47
      
    0.46
     (
    0.42
     The
    0.42
     in
    0.41
     the
    0.40
    1
    0.40
    cap
    0.39
    Act Density 3.006%

    No Known Activations