INDEX
    Explanations

    references to social interactions and events involving people

    New Auto-Interp
    Negative Logits
    ENUM
    -0.15
    105
    -0.14
    onaut
    -0.13
    orrh
    -0.13
     foresee
    -0.13
    adlo
    -0.13
    iento
    -0.13
    Fly
    -0.12
    ãģ£ãģ±
    -0.12
    åĵŃ
    -0.12
    POSITIVE LOGITS
     introduced
    0.34
     introdu
    0.34
     introduce
    0.33
     introduction
    0.31
     introducing
    0.29
     Introduced
    0.29
     introduces
    0.28
     approach
    0.26
     greeting
    0.25
     shake
    0.24
    Act Density 0.425%

    No Known Activations