INDEX
    Explanations

    phrases that indicate possession or association

    New Auto-Interp
    Negative Logits
    cela
    -0.16
     Bent
    -0.16
    unks
    -0.15
    obo
    -0.14
    nyder
    -0.14
    occan
    -0.14
    assi
    -0.14
    /native
    -0.14
    asaki
    -0.13
    abaj
    -0.13
    POSITIVE LOGITS
    venues
    0.15
    oice
    0.15
     importance
    0.14
    ibu
    0.14
     Shields
    0.14
    .mass
    0.14
    venue
    0.13
     concern
    0.13
    elia
    0.13
     bile
    0.13
    Act Density 0.298%

    No Known Activations