INDEX
    Explanations

    expressions related to friendship and companionship

    New Auto-Interp
    Negative Logits
     friend
    -0.21
     friendships
    -0.21
    	friend
    -0.18
     friends
    -0.17
    friend
    -0.17
    izona
    -0.17
     Friend
    -0.16
     friendship
    -0.16
     antlr
    -0.15
     дÑĢÑĥж
    -0.15
    POSITIVE LOGITS
    rik
    0.16
     marg
    0.15
    (tags
    0.15
    tags
    0.14
    451
    0.14
    gz
    0.14
    าà¸ģ
    0.14
    ascar
    0.14
    øj
    0.14
    rick
    0.14
    Act Density 0.118%

    No Known Activations