INDEX
    Explanations

    mathematical symbols and notation, particularly those related to equations and variables

    New Auto-Interp
    Negative Logits
    -0.65
    )";
    
    -0.49
    ,
    -0.46
     one
    -0.42
    netto
    -0.42
     “
    -0.42
    1
    -0.42
    -0.41
    +"&
    -0.41
    ;}
    
    -0.41
    POSITIVE LOGITS
    \
    1.26
     tartalomajánló
    1.07
     \
    0.99
     виправивши
    0.91
    ########.
    0.91
    ^\
    0.85
     (\
    0.83
     $\$
    0.83
    发表于
    0.83
    ">\
    0.82
    Act Density 0.337%

    No Known Activations