INDEX
    Explanations

    terms related to presentation or proposals

    New Auto-Interp
    Negative Logits
    Ìĥ
    -0.18
     flushed
    -0.16
    vit
    -0.15
    ear
    -0.15
     flushing
    -0.15
    /mit
    -0.15
    amba
    -0.14
     flush
    -0.14
    viz
    -0.14
    isch
    -0.14
    POSITIVE LOGITS
    fork
    0.49
     fork
    0.23
    pitch
    0.21
     Fork
    0.20
     pitch
    0.20
     forks
    0.20
    (es
    0.20
     Pitch
    0.19
     pitched
    0.18
    cock
    0.18
    Act Density 0.007%

    No Known Activations