INDEX
    Explanations

    phrases indicating a lack of effectiveness or substance in arguments or discussions

    New Auto-Interp
    Negative Logits
    ViewFeatures
    -0.64
    曖昧さ回避
    -0.48
    GroupLayout
    -0.46
    تری
    -0.46
    μφ
    -0.45
     phá
    -0.45
    Yep
    -0.44
     Parlement
    -0.43
     Yep
    -0.43
     desto
    -0.42
    POSITIVE LOGITS
     alone
    1.19
    だけでは
    1.07
    alone
    0.96
     allein
    0.91
     Alone
    0.89
     alleine
    0.89
     ALONE
    0.85
     insufficient
    0.84
     alene
    0.81
     meaningless
    0.78
    Act Density 0.469%

    No Known Activations