INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    transQ
    -0.79
    
    -0.76
     Lightboxes
    -0.74
     ویکی‌پدی
    -0.68
    UrlResolution
    -0.67
    __':
    
    -0.64
     snippetHide
    -0.64
     Silurian
    -0.63
     Efq
    -0.61
     Theſe
    -0.60
    POSITIVE LOGITS
     breaks
    0.70
     break
    0.61
     Break
    0.57
    break
    0.57
    Break
    0.56
     shares
    0.55
     partag
    0.53
     broke
    0.52
     Breaks
    0.50
     dives
    0.49
    Act Density 0.000%

    No Known Activations