1. C++ Implementation of memcpy 
 Please provide a simple implementation of the memcpy function in C++. Discuss any assumptions and your approach.
2. Thesis discussion and follow-up questions 
 Your thesis is related to CUDA. Could you briefly introduce your thesis and answer a couple of follow-up questions?