INDEX
    Explanations

    references to original versions or replicas of objects and their comparisons

    New Auto-Interp
    Negative Logits
    __":
    
    -0.82
     שוליים
    -0.79
    DebuggerNonUser
    -0.78
    WriteAttribute
    -0.78
    complexContent
    -0.78
    ंदीखरीदारी
    -0.77
     estimés
    -0.76
    tagHelperRunner
    -0.76
    ChildIndex
    -0.74
    ContentAsync
    -0.73
    POSITIVE LOGITS
     original
    0.81
    original
    0.72
     originais
    0.65
     originals
    0.65
     Original
    0.63
    Original
    0.63
     originales
    0.61
     originale
    0.59
     ORIGINAL
    0.58
     оригинал
    0.58
    Act Density 0.620%

    No Known Activations