INDEX
    Explanations

    numeric values or counts

    New Auto-Interp
    Negative Logits
     itſelf
    -1.00
    "){
    
    -0.96
    '));
    
    -0.95
    колеп
    -0.94
    ">:
    -0.94
    ){}
    -0.94
     '/';
    -0.92
     })}
    -0.92
    ſelf
    -0.92
    ]]]
    -0.90
    POSITIVE LOGITS
     num
    2.13
    num
    2.10
    Num
    1.91
     Num
    1.71
    NUM
    1.58
    setNum
    1.37
     nums
    1.32
    nums
    1.30
     NUM
    1.29
    getNum
    1.27
    Act Density 0.069%

    No Known Activations