INDEX
    Explanations

    define, assume, call, cover, build

    New Auto-Interp
    Negative Logits
     olabilir
    0.44
     יכול
    0.42
     ويمكن
    0.42
     beschäftigt
    0.42
     essere
    0.42
    പ്പെട
    0.40
    0.38
    を楽し
    0.37
    失望
    0.36
    gerufen
    0.36
    POSITIVE LOGITS
     use
    0.80
     använda
    0.62
     remove
    0.60
     utilize
    0.60
     incorporate
    0.60
     bruke
    0.59
     create
    0.58
     put
    0.57
     использовать
    0.56
     apply
    0.55
    Act Density 0.437%

    No Known Activations