INDEX
    Explanations

    phrases related to introductions

    instances of introductions and self-introductions

    New Auto-Interp
    Negative Logits
     outs
    -0.72
    / 
    -0.64
    usercontent
    -0.64
     expire
    -0.62
    RELATED
    -0.62
     exceeds
    -0.61
    adelphia
    -0.61
     accrued
    -0.61
    storage
    -0.60
     nails
    -0.60
    POSITIVE LOGITS
    ãĥīãĥ©ãĤ´ãĥ³
    0.87
    akeru
    0.75
     new
    0.74
     concepts
    0.73
    äºĶ
    0.70
    )=(
    0.68
    icans
    0.68
     Rw
    0.65
    etus
    0.65
    ãĥīãĥ©
    0.64
    Act Density 0.134%

    No Known Activations