INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ر
    -0.07
    орая
    -0.07
    ْس
    -0.06
    leccion
    -0.06
     flowering
    -0.06
    คราม
    -0.06
     waiver
    -0.06
    щини
    -0.06
     Για
    -0.06
     CD
    -0.06
    POSITIVE LOGITS
     latex
    0.07
     ecl
    0.07
     overlaps
    0.06
     zig
    0.06
     Olympus
    0.06
     gf
    0.06
    EXIST
    0.06
     вост
    0.06
     alleles
    0.06
    ojení
    0.06
    Act Density 0.002%

    No Known Activations