INDEX
    Explanations

    instances of reported speech, specifically phrases that indicate someone is conveying information

    New Auto-Interp
    Negative Logits
    zet
    -0.16
     Springs
    -0.15
     Featured
    -0.15
    oud
    -0.15
    zes
    -0.14
     Toy
    -0.14
     index
    -0.14
    rec
    -0.14
    ic
    -0.14
     indexes
    -0.14
    POSITIVE LOGITS
    yb
    0.16
    chwitz
    0.16
    Scalars
    0.15
     Vys
    0.15
    ProcessEvent
    0.15
     olmam
    0.14
    gba
    0.14
    quences
    0.14
    verts
    0.14
    ">ÃĹ</
    0.14
    Act Density 0.023%

    No Known Activations