INDEX
    Explanations

    news and other languages

    New Auto-Interp
    Negative Logits
     approximately
    0.41
     various
    0.40
     primarily
    0.39
     establish
    0.39
     completely
    0.37
     perpetrated
    0.36
     fundamentals
    0.36
     natively
    0.36
     representatives
    0.35
     quotations
    0.35
    POSITIVE LOGITS
    新たな
    0.47
     новой
    0.47
    जानिए
    0.46
     новый
    0.45
     новая
    0.45
     nowego
    0.44
     нового
    0.44
     νέα
    0.43
     Após
    0.42
     نئے
    0.42
    Act Density 0.012%

    No Known Activations