INDEX
    Explanations

    words related to legal proceedings or documents

    punctuation and various forms of sentence structure

    New Auto-Interp
    Negative Logits
    tarian
    -0.77
    agonists
    -0.72
     superpower
    -0.71
     Rodham
    -0.69
    avorite
    -0.67
    IFIED
    -0.65
    ModLoader
    -0.64
     Methodist
    -0.64
    Magic
    -0.62
     cloning
    -0.62
    POSITIVE LOGITS
     Amen
    0.94
    liga
    0.93
     alle
    0.92
     Regist
    0.83
     qui
    0.83
     ja
    0.81
    please
    0.80
     lang
    0.80
     si
    0.78
    nda
    0.77
    Act Density 0.159%

    No Known Activations