INDEX
    Explanations

    references to various research institutes and affiliations

    New Auto-Interp
    Negative Logits
    )");
    
    -1.22
     Theſe
    -1.20
     myſelf
    -1.19
     Efq
    -1.19
     ainfi
    -1.15
     Eſ
    -1.12
     ―――――
    -1.10
     Jefus
    -1.10
     Monfieur
    -1.08
    ."));
    -1.07
    POSITIVE LOGITS
    0.75
     lateral
    0.64
    Ni
    0.61
     Met
    0.60
     Ni
    0.58
     in
    0.58
     la
    0.56
     l
    0.55
     f
    0.53
    M
    0.53
    Act Density 0.286%

    No Known Activations