INDEX
    Explanations

    numerical expressions and mathematical symbols

    New Auto-Interp
    Negative Logits
    "]];
    -0.96
    '],
    
    -0.89
    "]);
    
    -0.89
    ."));
    -0.88
    "])
    
    -0.85
    "],
    
    -0.84
    ();*/
    -0.83
    '},
    
    -0.82
    ']);
    
    -0.82
    "]));
    -0.81
    POSITIVE LOGITS
    Климат
    0.65
     Lastly
    0.60
    LabelTagHelper
    0.57
     الحره
    0.56
     Furthermore
    0.56
    Furthermore
    0.55
    Lastly
    0.54
     tropicales
    0.47
     Dolom
    0.46
    至於
    0.46
    Act Density 0.165%

    No Known Activations