INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     any
    1.36
     other
    1.19
    any
    1.12
     restrictions
    1.11
     specific
    1.11
    任何
    1.08
    other
    1.06
     anymore
    1.02
     사항
    1.02
     limitations
    1.01
    POSITIVE LOGITS
     લગભગ
    0.96
     sometime
    0.92
     некоторое
    0.91
    Efficient
    0.87
     прекрасно
    0.87
     наиболее
    0.86
     تقریباً
    0.85
     некоторые
    0.83
     Efficient
    0.82
     множество
    0.81
    Act Density 0.036%

    No Known Activations