INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    jsxFileName
    -0.73
    tagext
    -0.73
    IUrlHelper
    -0.71
    +#+#
    -0.70
    saraba
    -0.66
    σθαι
    -0.65
     testSet
    -0.64
     barbati
    -0.62
     informée
    -0.61
     tartalomajánló
    -0.61
    POSITIVE LOGITS
     much
    0.95
    Much
    0.90
    much
    0.89
     Much
    0.82
     MUCH
    0.71
     mucho
    0.58
     disponibilités
    0.51
    enumi
    0.48
     viel
    0.48
     mucha
    0.47
    Act Density 0.001%

    No Known Activations