INDEX
    Explanations

    phrases indicating sets or collections of items

    New Auto-Interp
    Negative Logits
     militaires
    -0.62
     larmes
    -0.62
     alguno
    -0.62
     ibunya
    -0.61
     istrinya
    -0.61
    konomi
    -0.61
     déclar
    -0.60
     foramen
    -0.60
     démission
    -0.60
     inconvénients
    -0.60
    POSITIVE LOGITS
    =[];
    
    0.89
    )";
    
    0.89
    ...");
    
    0.81
     autorytatywna
    0.78
    {
    
    
    0.78
     dozen
    0.78
    %;
    
    0.78
    =[]
    
    0.77
    >";
    
    0.77
     "";
    
    0.76
    Act Density 0.326%

    No Known Activations