INDEX
    Explanations

    phrases expressing comparisons or clarifications in statements

    New Auto-Interp
    Negative Logits
    '));
    -0.86
    ']);
    
    -0.86
    ()));
    
    -0.85
    ']);
    -0.85
    "]);
    
    -0.84
    '));
    
    -0.82
    "));
    
    -0.81
    "]);
    -0.79
    }`).
    -0.79
    "));
    -0.78
    POSITIVE LOGITS
    VersionUID
    0.99
    первых
    0.90
    Obrázky
    0.90
    Kedua
    0.88
    Lähteet
    0.88
    ,
    0.85
    Bronnen
    0.82
     Portanto
    0.82
     Asimismo
    0.78
     CURIAM
    0.75
    Act Density 0.622%

    No Known Activations