INDEX
    Explanations

    options or recommendations

    New Auto-Interp
    Negative Logits
    主な
    0.45
     नाखून
    0.43
     infamous
    0.42
     ভয়
    0.41
     notorious
    0.40
     slang
    0.40
     என்னும்
    0.39
     Gesetz
    0.38
    nitř
    0.38
     victimes
    0.37
    POSITIVE LOGITS
     either
    0.92
     Either
    0.80
    either
    0.77
    Either
    0.76
     entweder
    0.71
     либо
    0.66
     preferably
    0.65
     eğer
    0.63
     if
    0.61
     combination
    0.61
    Act Density 0.078%

    No Known Activations