INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     child
    -2.02
     childhood
    -1.93
     CHILD
    -1.88
    child
    -1.85
    Childhood
    -1.78
     childs
    -1.72
     Childhood
    -1.66
     Child
    -1.65
    CHILD
    -1.55
    Child
    -1.46
    POSITIVE LOGITS
    ,
    0.68
    0.61
    '
    0.60
     was
    0.55
    .
    0.55
    -
    0.55
     is
    0.53
    ?
    0.53
    1
    0.53
    (
    0.52
    Act Density 0.323%

    No Known Activations