INDEX
    Explanations

    references to specific programs or evaluations

    New Auto-Interp
    Negative Logits
     moreover
    -0.94
    何より
    -0.93
    むしろ
    -0.90
     furthermore
    -0.89
     therefore
    -0.88
    Поэтому
    -0.87
     esimerkiksi
    -0.81
     Apalagi
    -0.81
     jopa
    -0.81
     apalagi
    -0.80
    POSITIVE LOGITS
     Allí
    0.88
     Prior
    0.74
     Its
    0.74
     该
    0.72
    0.71
    Prior
    0.71
     Briefly
    0.71
    Its
    0.71
     autorytatywna
    0.69
    InjectAttribute
    0.69
    Act Density 0.639%

    No Known Activations