INDEX
    Explanations

    instances of opening or starting phrases, or segments in text

    the beginning of new sentences or segments

    New Auto-Interp
    Negative Logits
     kasarigan
    -0.47
     recently
    -0.46
     kürzlich
    -0.46
     Anfang
    -0.46
     anfangs
    -0.46
    ITOS
    -0.45
     récents
    -0.45
     autrefois
    -0.44
     earlier
    -0.44
     noong
    -0.43
    POSITIVE LOGITS
    PhysRevD
    0.72
     bezeichneter
    0.71
     مرئيه
    0.70
    OGND
    0.66
    setupUi
    0.65
    λίου
    0.63
    styleable
    0.62
    ingeki
    0.62
    preduce
    0.60
     виправивши
    0.59
    Act Density 0.052%

    No Known Activations