INDEX
    Explanations

    mentions of creative or artistic work

    New Auto-Interp
    Negative Logits
     Ukrain
    -0.98
    wcs
    -0.75
    kefeller
    -0.71
    Args
    -0.64
    ylon
    -0.63
     champagne
    -0.61
    idate
    -0.61
    angular
    -0.61
    rition
    -0.60
     Gord
    -0.60
    POSITIVE LOGITS
     ethic
    1.42
    flows
    1.37
    station
    1.24
    aday
    1.15
    manship
    1.10
    bench
    1.08
    horse
    0.98
    mates
    0.90
    forces
    0.89
    papers
    0.89
    Act Density 0.031%

    No Known Activations