INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Paglinawan
    -1.09
    tagHelperRunner
    -1.06
    ьаж
    -0.99
    )))
    
    -0.97
    ']")
    -0.96
     ''}
    -0.92
    -0.92
    )');
    -0.91
    المناصب
    -0.91
    '},
    
    -0.91
    POSITIVE LOGITS
     pleaſure
    0.81
     Monfieur
    0.69
     purpoſe
    0.67
     ſmall
    0.62
     avoient
    0.60
    ↵↵
    0.60
     étoient
    0.60
     auroit
    0.59
     faſt
    0.59
     ſte
    0.59
    Act Density 0.849%

    No Known Activations