INDEX
    Explanations

    references to improvement or enhancement

    New Auto-Interp
    Negative Logits
    skosten
    -0.69
     fiscale
    -0.64
    Vorte
    -0.62
     Skyscanner
    -0.61
    lectricité
    -0.60
     Steinberg
    -0.60
     Kos
    -0.60
    Jackie
    -0.60
    geology
    -0.60
     jLabel
    -0.59
    POSITIVE LOGITS
     BETTER
    1.45
    Better
    1.44
    better
    1.40
     Better
    1.40
     better
    1.40
     bedre
    0.99
     beter
    0.98
     besseren
    0.93
     mieux
    0.93
     betterment
    0.92
    Act Density 0.060%

    No Known Activations