INDEX
    Explanations

    sentences that contain high activation values, indicating important or impactful statements

    New Auto-Interp
    Negative Logits
     te
    -0.45
    Newswire
    -0.45
    -0.43
    лев
    -0.43
    óa
    -0.42
    wj
    -0.42
    vania
    -0.42
    GV
    -0.41
    PostMapping
    -0.41
    mav
    -0.41
    POSITIVE LOGITS
    +#+#
    1.06
    tagHelperRunner
    0.93
     resourceCulture
    0.90
     مشين
    0.86
    writeFieldEnd
    0.82
    WebVitals
    0.82
     iconFacebook
    0.81
    (!__
    0.79
    SharedDtor
    0.78
     BoxFit
    0.77
    Act Density 0.251%

    No Known Activations