INDEX
    Explanations

    phrases indicating processes, findings, or claims made in research contexts

    New Auto-Interp
    Negative Logits
     and
    -0.64
    Життєпис
    -0.60
    ngdoc
    -0.52
    However
    -0.47
    ביוגרפיה
    -0.46
    atoms
    -0.46
    -0.46
     but
    -0.45
    AxisAlignment
    -0.45
    sidemargin
    -0.44
    POSITIVE LOGITS
    ografija
    0.76
     Paglinawan
    0.68
    Personensuche
    0.67
    xase
    0.65
     Wikimedijinoj
    0.64
     дописавши
    0.61
    abestanden
    0.60
    (!__
    0.59
    hyrchwyd
    0.57
    SharedCtor
    0.57
    Act Density 0.670%

    No Known Activations