INDEX
    Explanations

    HTML and table formatting tags

    New Auto-Interp
    Negative Logits
    Rohy
    -0.59
     Spencer
    -0.56
     ses
    -0.55
     Kenney
    -0.55
     Whitney
    -0.55
    anka
    -0.55
    URE
    -0.54
     Burke
    -0.54
     hende
    -0.54
     Gough
    -0.53
    POSITIVE LOGITS
    "],
    
    1.10
    }")
    
    1.03
     itſelf
    1.01
     Chwiliwch
    0.97
    */;
    0.96
    !")
    
    0.95
    "]
    
    0.94
    `),
    0.92
    "])
    
    0.92
    ]");
    0.91
    Act Density 0.083%

    No Known Activations