INDEX
    Explanations

    phrases indicating similarity or comparison

    New Auto-Interp
    Negative Logits
     referenties
    -0.56
    Ведь
    -0.52
     cref
    -0.48
    MLLoader
    -0.48
     neuem
    -0.47
     jLabel
    -0.47
    ipot
    -0.47
    >",
    
    -0.45
     solely
    -0.45
     punta
    -0.44
    POSITIVE LOGITS
     likewise
    1.05
    Similarly
    0.94
    Likewise
    0.92
     Similarly
    0.91
     similarly
    0.89
     Likewise
    0.89
    zelfde
    0.85
     sebaliknya
    0.83
     Ditto
    0.81
     كذلك
    0.80
    Act Density 0.264%

    No Known Activations