INDEX
    Explanations

    phrases related to praising or criticizing specific actions or behaviors

    New Auto-Interp
    Negative Logits
     tricot
    -0.79
     suscep
    -0.75
     vespa
    -0.74
     cushi
    -0.72
     cabrio
    -0.70
     thermomix
    -0.69
     bordeaux
    -0.68
     teflon
    -0.65
     tdci
    -0.62
     lpg
    -0.61
    POSITIVE LOGITS
    Manbalar
    0.86
     Minang
    0.82
     Banjar
    0.81
     Palembang
    0.81
     Karang
    0.79
     Lampung
    0.78
     Banten
    0.69
     Jambi
    0.68
     Pekan
    0.68
     Muhamma
    0.68
    Act Density 0.283%

    No Known Activations