INDEX
    Explanations

    specific quantities and variations in data

    New Auto-Interp
    Negative Logits
     of
    -0.51
     to
    -0.48
    wall
    -0.48
    ↵↵↵
    -0.47
    -0.45
     who
    -0.45
    ↵↵↵↵
    -0.45
     L
    -0.45
     with
    -0.44
    Weblinks
    -0.44
    POSITIVE LOGITS
    AddTagHelper
    1.00
    RenderAtEndOf
    0.98
     cherchés
    0.95
     Chwiliwch
    0.93
    ьаж
    0.89
    الحياه
    0.86
    expandindo
    0.85
     Савезне
    0.84
     Italijanski
    0.83
    ArrowToggle
    0.82
    Act Density 1.045%

    No Known Activations