INDEX
    Explanations

    exercises and body parts related to physical fitness

    New Auto-Interp
    Negative Logits
     Hoover
    -0.82
    yrinth
    -0.74
     Clover
    -0.74
     Kafka
    -0.72
     Ans
    -0.67
     Dickens
    -0.67
     Booth
    -0.66
     Doodle
    -0.65
     Nex
    -0.64
     Jarrett
    -0.63
    POSITIVE LOGITS
    guards
    1.43
    guard
    1.23
    building
    1.22
    builders
    1.19
    builder
    1.12
    weight
    1.10
     politic
    1.04
    parts
    1.02
    wash
    0.92
    fat
    0.90
    Act Density 0.036%

    No Known Activations