INDEX
    Explanations

    phrases indicating uncertainty or ambiguity

    Uncertainty, speculation, or doubt

    New Auto-Interp
    Negative Logits
    RenderAtEndOf
    -1.10
     autorytatywna
    -1.05
    ագրություններ
    -1.02
     beginnetje
    -0.95
     ModelRenderer
    -0.87
     فريبيس
    -0.81
    AndEndTag
    -0.81
    Controllo
    -0.79
     houſe
    -0.77
     الرياضيه
    -0.77
    POSITIVE LOGITS
     unclear
    0.86
     clear
    0.63
     seems
    0.54
    clear
    0.53
     remains
    0.51
     questionable
    0.49
     unsure
    0.48
     yet
    0.47
     jelas
    0.47
     Seems
    0.46
    Act Density 0.209%

    No Known Activations