INDEX
    Explanations

    references to the name "Helder" and its variations, as well as a specific context related to hell

    New Auto-Interp
    Negative Logits
     Italijanski
    -0.57
    dafx
    -0.54
     procéder
    -0.51
    LabelTagHelper
    -0.50
    opportunities
    -0.49
     PROCEED
    -0.49
    jenost
    -0.49
    bamos
    -0.48
    gación
    -0.48
    脚注の使い方
    -0.47
    POSITIVE LOGITS
     Hel
    0.68
     hell
    0.62
     Hell
    0.60
    Hell
    0.58
    Hel
    0.57
     hel
    0.50
    ViewFeatures
    0.50
     HELL
    0.49
    Hentet
    0.44
    hell
    0.43
    Act Density 0.194%

    No Known Activations