INDEX
    Explanations

    phrases that reference specific parts or sections of a larger context

    New Auto-Interp
    Negative Logits
    MLLoader
    -0.87
     متعلقه
    -0.83
    المناصب
    -0.82
     يتيمه
    -0.81
     wireType
    -0.74
    ArgsConstructor
    -0.73
    harusnya
    -0.69
     downvotes
    -0.68
     atheists
    -0.68
     الحره
    -0.66
    POSITIVE LOGITS
     the
    0.68
     it
    0.55
     our
    0.55
     society
    0.52
     their
    0.50
     transfieras
    0.48
    ++){
    
    0.47
     [
    
    0.47
     her
    0.46
     life
    0.45
    Act Density 0.269%

    No Known Activations