INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     chế
    -0.09
     decorate
    -0.08
     mode
    -0.08
     quanh
    -0.08
     horario
    -0.08
     expiry
    -0.07
     expir
    -0.07
    -0.07
     frees
    -0.07
     manoe
    -0.07
    POSITIVE LOGITS
    .average
    0.11
    平均
    0.11
     averages
    0.10
    average
    0.10
    Average
    0.10
    (avg
    0.10
    Avg
    0.10
     average
    0.09
     Average
    0.09
    _average
    0.09
    Act Density 0.037%

    No Known Activations