INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    に興味
    -1.07
     to
    -0.91
     telefono
    -0.90
    звичай
    -0.89
     preferências
    -0.89
     definitiva
    -0.89
     düz
    -0.86
    -0.82
     материалов
    -0.81
    -0.81
    POSITIVE LOGITS
    mixing
    1.08
     Mixed
    0.94
     hemel
    0.93
    0.92
    стреча
    0.90
     filmp
    0.87
    0.87
    mixed
    0.86
    0.85
     accessoires
    0.85
    Act Density 0.022%

    No Known Activations