INDEX
    Explanations

    instances of the word "take" in various forms and contexts

    New Auto-Interp
    Negative Logits
    ç«ĭãģ¦
    -0.16
    isers
    -0.15
    ifu
    -0.15
    isten
    -0.15
    å¼ĺ
    -0.15
    aylight
    -0.14
    imity
    -0.14
    ãģ¹ãģį
    -0.14
    uplic
    -0.14
    gary
    -0.14
    POSITIVE LOGITS
     advantage
    0.23
     inspiration
    0.21
     existing
    0.20
     us
    0.20
     ideas
    0.19
     concepts
    0.19
     classic
    0.19
     cue
    0.18
     elements
    0.18
     readers
    0.18
    Act Density 0.047%

    No Known Activations