INDEX
    Explanations

    phrases related to legal documents and government actions

    New Auto-Interp
    Negative Logits
     hairc
    -1.02
     intersper
    -0.80
     hentai
    -0.78
     indescri
    -0.77
     milf
    -0.77
     funko
    -0.76
     embodi
    -0.76
     amigurumi
    -0.73
     depic
    -0.73
     emphat
    -0.73
    POSITIVE LOGITS
     January
    0.54
    effective
    0.51
    Vidite
    0.49
    January
    0.49
     February
    0.48
     effective
    0.48
     accogli
    0.48
    Effective
    0.48
     kela
    0.47
    生效
    0.47
    Act Density 0.241%

    No Known Activations