INDEX
    Explanations

    references to mathematical concepts or symbols

    New Auto-Interp
    Negative Logits
     I
    -0.62
     na
    -0.54
     i
    -0.51
     al
    -0.51
     it
    -0.50
     top
    -0.49
     l
    -0.49
     of
    -0.48
     lo
    -0.48
     U
    -0.47
    POSITIVE LOGITS
     MonoBehaviour
    1.05
    sidemargin
    1.03
    ^(@)
    0.98
    IVEREF
    0.93
    TagMode
    0.90
     Jefus
    0.89
     ویکی‌پدی
    0.89
     '\\;'
    0.88
     للاسماء
    0.88
    
    0.87
    Act Density 0.594%

    No Known Activations