INDEX
    Explanations

    references to scientific studies and related publication details

    New Auto-Interp
    Negative Logits
    ว
    -0.18
     blank
    -0.14
    ìn
    -0.14
     prov
    -0.14
    PIX
    -0.14
    isz
    -0.14
    nam
    -0.14
    ιά
    -0.13
    ossa
    -0.13
    cue
    -0.13
    POSITIVE LOGITS
    enting
    0.16
    ouser
    0.16
    YES
    0.14
    YE
    0.14
    orget
    0.14
     hunts
    0.14
    RC
    0.14
    -signed
    0.14
    allah
    0.13
    kus
    0.13
    Act Density 0.056%

    No Known Activations