INDEX
    Explanations

    Definitions and comparisons

    New Auto-Interp
    Negative Logits
     hasn
    1.00
     Está
    0.96
     చేస్తుంది
    0.92
     βρίσκεται
    0.91
     believes
    0.90
     was
    0.90
    was
    0.89
     Has
    0.89
    Has
    0.87
    పడుతుంది
    0.85
    POSITIVE LOGITS
     have
    1.63
     are
    1.60
     require
    1.50
     provide
    1.46
     vary
    1.44
     resemble
    1.43
     aren
    1.42
     mají
    1.41
     funktionieren
    1.40
     werden
    1.39
    Act Density 0.389%

    No Known Activations