INDEX
    Explanations

    proper nouns and specific references in various contexts

    New Auto-Interp
    Negative Logits
    انيف
    -0.60
    spesies
    -0.56
    beta
    -0.53
    Beta
    -0.53
     LoggerFactory
    -0.52
    IRUS
    -0.51
    β
    -0.51
    verwijspagina
    -0.49
     Frankel
    -0.49
     ErrIntOverflow
    -0.48
    POSITIVE LOGITS
     Roast
    0.84
     Roberta
    0.81
     ROB
    0.80
     Rovers
    0.77
     ro
    0.76
     Robe
    0.76
    ro
    0.76
     Rosalie
    0.75
    Ro
    0.73
    RO
    0.73
    Act Density 3.176%

    No Known Activations