INDEX
    Explanations

    attends to various token patterns associated with health-related terms from numerical or metadata tokens

    New Auto-Interp
    Head Attr Weights
    0:0.18
    1:0.40
    2:0.07
    3:0.06
    4:0.06
    5:0.04
    6:0.07
    7:0.08
    Negative Logits
    ArgsConstructor
    -0.50
    RTDA
    -0.49
    RegressionTest
    -0.44
    chtenstein
    -0.43
    AddTagHelper
    -0.43
    verwijspagina
    -0.42
    "]').
    -0.41
     للاسماء
    -0.39
    ParallelGroup
    -0.38
    ]).
    -0.38
    POSITIVE LOGITS
     scienza
    0.41
    formik
    0.39
     חיצוניים
    0.39
    مصادر
    0.38
    padek
    0.37
    IMPORTED
    0.37
    multer
    0.37
    onOptions
    0.36
    jard
    0.35
    sila
    0.35
    Act Density 6.645%

    No Known Activations