INDEX
Explanations
connections and interactions within social contexts, particularly in conversations and relationships
Token after "with" referring to people
people and social connections
New Auto-Interp
Negative Logits
дельник
-0.47
оно
-0.45
instancetype
-0.45
noastră
-0.45
ponses
-0.45
föruts
-0.44
vieles
-0.42
ByUserId
-0.42
obyvateľov
-0.42
wiedzy
-0.42
POSITIVE LOGITS
strangers
1.32
others
1.15
coworkers
1.05
friends
1.04
neighbors
1.03
colleagues
0.99
peers
0.96
other
0.94
classmates
0.94
passers
0.93
Activations Density 0.294%