INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     اي
    -0.09
     briefing
    -0.08
     نه
    -0.08
    -0.07
     известно
    -0.07
     معيار
    -0.07
     hay
    -0.07
    enschaft
    -0.07
     е
    -0.07
     brinda
    -0.07
    POSITIVE LOGITS
     moc
    0.08
    illi
    0.08
     JAXB
    0.08
    Sensor
    0.07
    .merge
    0.07
     owl
    0.07
     nghiệp
    0.07
    halb
    0.07
    0.07
     wl
    0.07
    Act Density 0.034%

    No Known Activations