INDEX
    Explanations

    the verb "are" in various contexts

    New Auto-Interp
    Negative Logits
     is
    -1.07
     was
    -0.97
    -0.83
     has
    -0.76
    ↵↵
    -0.72
     can
    -0.71
     will
    -0.68
    .
    -0.68
    ,
    -0.68
     "
    -0.66
    POSITIVE LOGITS
     CreateTagHelper
    1.45
     صوتيه
    1.28
     виправивши
    1.25
    tagHelperRunner
    1.23
    Autoritní
    1.20
     pinulongan
    1.20
     متعلقه
    1.18
    InjectAttribute
    1.15
    1.11
     مرئيه
    1.09
    Act Density 0.321%

    No Known Activations