INDEX
    Explanations

    phrases indicating specific occurrences or referents in context

    New Auto-Interp
    Negative Logits
     more
    -0.54
    tamment
    -0.53
     antaranya
    -0.50
     sonst
    -0.50
     ninh
    -0.49
    ampingi
    -0.49
    êmio
    -0.48
    sonian
    -0.48
    Contactez
    -0.47
     rarement
    -0.46
    POSITIVE LOGITS
    この
    0.70
    這一
    0.69
     este
    0.68
     ఈ
    0.67
     kasarigan
    0.67
    0.67
     この
    0.66
    0.66
    Этот
    0.65
     kind
    0.65
    Act Density 0.137%

    No Known Activations