INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Helm
    -0.07
     intertwined
    -0.06
     Falk
    -0.06
     barrier
    -0.06
     hallmark
    -0.06
     wary
    -0.06
    REFIX
    -0.06
     game
    -0.06
     challenged
    -0.06
     map
    -0.06
    POSITIVE LOGITS
     produced
    0.14
     produce
    0.13
     producing
    0.13
    producer
    0.12
     produces
    0.11
     production
    0.11
     Production
    0.11
    Production
    0.10
     Produced
    0.10
    製作
    0.10
    Act Density 0.091%

    No Known Activations