INDEX
    Explanations

    references to authors and their works in academic literature

    New Auto-Interp
    Negative Logits
    .scalablytyped
    -0.22
    agma
    -0.16
    rette
    -0.15
    .Guna
    -0.15
    untu
    -0.15
    iece
    -0.14
    ertiary
    -0.14
    holm
    -0.14
    manship
    -0.14
    idden
    -0.14
    POSITIVE LOGITS
    ascus
    0.15
     second
    0.14
     Second
    0.14
     Horny
    0.14
    uster
    0.14
    owler
    0.14
     SV
    0.14
     personnel
    0.14
     rendered
    0.13
     Stern
    0.13
    Act Density 0.049%

    No Known Activations