Software Engineer at Snowflake & MSCS at UIUC
Bellevue, Washington WA 98004 US
- Worked with Prof. Tiark Rompf on building Flare (OSDI’ 18, SIGMOD’ 18, & Spark + AI Summit 2018) - an accelerator for Apache Spark with native compilation on Scala front-end and support for distributed data processing, data pipelining and streaming.
- Use Lightweight Modular Staging (LMS) framework to add distributing data processing support with Message Passing Interface (MPI) and outperform Apache Spark by up tp 10 times on standalone and computation-intensive workloads like TPC-H.
- Designed and developed GraphView, a middleware and DLL library for key/value databases within one of Microsoft’s core Azure product - CosmosDB, in collaboration with senior researchers, the Intelligent Cloud and Edge Group and the Azure team.
- Built the testing framework based on YCSB & TPC-C in order to benchmark GraphView and DBx1000 (SIGMOD’ 16), with optimistic concurrency control (OCC) algorithms.
University of Illinois at Urbana-Champaign
The Hong Kong Polytechnic University
Published by IDEALS - University of Illinois, 2020
Published by arXiv, 2019
Yuanjing Shi, Zhaoxing Li
Published by IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (IEEE TCAD), 2018
Zhaoyan Shen, Yuanjing Shi, Zili Shao, Yong Guan
Published by 23rd Asia and South Pacific Design Automation Conference (ASP-DAC), 2018
Yuanjing Shi, Zhaoyan Shen, Zili Shao
Over 20,000 lines C/C++, Java, Scala, and Python
LLVM, TVM, and LMS
Data Intensive Systems
Apache Spark and Hadoop MapReduce & HDFS
Deep Learning Frameworks
TensorFlow/Keras, PyTorch, ONNX, and TVM