INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    <bos>
    -1.24
    AnchorTagHelper
    -0.62
    قایناقلار
    -0.61
    قایناق‌لار
    -0.58
    InitVars
    -0.58
    fortawesome
    -0.56
    ThroughAttribute
    -0.55
    fromnode
    -0.55
    adaptiveStyles
    -0.53
    msgTypes
    -0.53
    POSITIVE LOGITS
     compen
    1.28
     milano
    1.27
     guarante
    1.24
     ?...
    1.23
     nutr
    1.20
     unden
    1.20
     igno
    1.18
     meis
    1.16
     ritard
    1.16
     !...
    1.14
    Act Density 0.168%

    No Known Activations