INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
     снижение
    0.42
    ig
    0.40
     privilege
    0.39
     fragility
    0.39
    innen
    0.39
    asiti
    0.39
    وفيق
    0.39
    iski
    0.39
    omorph
    0.38
    されます
    0.38
    POSITIVE LOGITS
    Β
    0.44
    Kah
    0.43
     Zah
    0.43
     ACCEPT
    0.41
    به
    0.40
     BEN
    0.40
    BOARD
    0.40
     Kah
    0.40
     Bursa
    0.39
    0.38
    Act Density 0.000%

    No Known Activations