INDEX
    Explanations

    legal appeals/academic papers

    New Auto-Interp
    Negative Logits
     appeal
    -1.27
     Appeal
    -1.12
     APPEAL
    -1.07
    appeal
    -0.98
    Appeal
    -0.93
     attention
    -0.83
     appealed
    -0.68
     APPEALS
    -0.67
     Attention
    -0.65
    +#+#
    -0.65
    POSITIVE LOGITS
     of
    0.54
     braccia
    0.52
     in
    0.52
     ervan
    0.49
     braccio
    0.49
     őket
    0.49
    šet
    0.47
     with
    0.46
     makl
    0.46
     costumi
    0.46
    Act Density 0.040%

    No Known Activations