INDEX
    Explanations

    patent numbers

    New Auto-Interp
    Negative Logits
    Transformer
    -0.07
     fuera
    -0.07
    \View
    -0.07
    _SRC
    -0.07
     vistas
    -0.07
     كانوا
    -0.07
     manière
    -0.07
    _indicator
    -0.07
    _inter
    -0.06
    -0.06
    POSITIVE LOGITS
    Europe
    0.08
    0.08
    0.07
    Carthy
    0.07
     aftermath
    0.07
     brewed
    0.07
     France
    0.06
    	copy
    0.06
     justice
    0.06
    	freopen
    0.06
    Act Density 0.005%

    No Known Activations