INDEX
    Explanations

    references to Jewish history and cultural topics

    New Auto-Interp
    Negative Logits
    æħİ
    -0.16
     ho
    -0.15
     dest
    -0.15
    dub
    -0.13
    ÑİÑĢ
    -0.13
     ro
    -0.13
    erer
    -0.13
     h
    -0.13
    Untitled
    -0.13
    act
    -0.13
    POSITIVE LOGITS
     volume
    0.24
    volume
    0.22
    -volume
    0.21
     Volume
    0.20
     vol
    0.19
    ;:
    0.18
    Volume
    0.18
    .pb
    0.17
    (volume
    0.17
    .volume
    0.16
    Act Density 0.127%

    No Known Activations