INDEX
    Explanations

    phrases related to ethical considerations and statements

    New Auto-Interp
    Negative Logits
    fycat
    -0.59
     kasarigan
    -0.58
    archiviato
    -0.56
     alike
    -0.55
    postmedia
    -0.55
    IVEREF
    -0.54
    むしろ
    -0.54
    odendron
    -0.53
    かつ
    -0.53
    ाहरण
    -0.52
    POSITIVE LOGITS
     which
    3.78
    which
    3.19
     WHICH
    2.74
     Which
    2.68
    Which
    2.63
    hich
    1.94
     laquelle
    1.87
    ซึ่ง
    1.84
     lequel
    1.76
     которая
    1.75
    Act Density 1.365%

    No Known Activations