INDEX
    Explanations

    Christmas-related terms

    terms related to rules or regulations

    New Auto-Interp
    Negative Logits
    enegger
    -0.79
    ITNESS
    -0.79
    POL
    -0.71
    arella
    -0.68
     Healer
    -0.66
    chrom
    -0.66
    ryption
    -0.65
    MODE
    -0.64
     GOODMAN
    -0.63
     à¨
    -0.63
    POSITIVE LOGITS
    ule
    0.92
    lette
    0.87
    kas
    0.86
    cules
    0.85
    tta
    0.83
    ttle
    0.82
    ffe
    0.81
    bum
    0.81
    pee
    0.80
    quet
    0.80
    Act Density 0.016%

    No Known Activations