INDEX
    Explanations

    statements related to speech and communication

    New Auto-Interp
    Negative Logits
    iej
    -0.15
     mouth
    -0.15
    ove
    -0.15
    iosis
    -0.14
     Gap
    -0.14
     mouths
    -0.14
    AMA
    -0.14
    atch
    -0.14
    Sizer
    -0.14
    ì±
    -0.14
    POSITIVE LOGITS
    IDER
    0.14
    assen
    0.14
     gauge
    0.14
    oming
    0.14
    ENU
    0.14
    itom
    0.14
    bish
    0.13
    agna
    0.13
    è¡Ľ
    0.13
    726
    0.13
    Act Density 0.323%

    No Known Activations