INDEX
    Explanations

    references to statements and quotes made by individuals

    New Auto-Interp
    Negative Logits
    [];
    
    -0.77
     الرياضيه
    -0.71
     [],
    
    -0.70
    }*/
    
    -0.68
    .*;
    
    -0.65
     виправивши
    -0.64
     ?>/
    -0.64
    "},
    
    -0.63
    "){
    
    -0.62
    ++
    
    -0.62
    POSITIVE LOGITS
     obé
    0.58
     said
    0.57
     sobbed
    0.53
     honte
    0.53
     replied
    0.53
    said
    0.53
    vocato
    0.52
    ضب
    0.52
    paksa
    0.50
    ništ
    0.49
    Act Density 0.045%

    No Known Activations