About Me
Hello! I’m Jian (Kevin) Xu, a software engineer with 7+ years of experience in distributed systems, big data platforms, and cloud technologies at Amazon.
Amazon Global Selling | Software Engineer | 2021.8 – Present
- Led a team of 4 people to deliver a seller contact service system
- Enabled CI/CD for code pipeline deployment, including integration test coverage tool and rollback mechanism
Amazon Business Data Technologies | Software Engineer | 2019.1 - 2021.8
- Metadata Sync Platform: Designed and implemented metadata sync platform from AWS Glue to Andes data lake
- Sync partition-level metadata, up to 30,000 partitions per day for one table
- Implemented a library to do schema conversion from Hive or Spark to SDL
- Data Discovery: Designed and implemented a data lake table level annotation system for improved data discovery
- Permission Management: Built a new data lake permission management system, unifying permission solutions for tables in different dataplanes
- Resource Catalog: Implemented resource data catalog prototype using AWS Neptune
- ETL Platform: Maintained an ETL platform with 200 EMR clusters
Amazon Music | Software Engineer | 2017.5 - 2019.1
- Music Playlist Service: Development and maintenance of music playlist services
- Changed playlist sync from push to pull, reducing TPS by 90%
- Implemented 2-level cache using Memcached, reducing API latency by 80%
Technical Skills
- Programming Languages: Java, Scala
- Cloud Platforms: Amazon Web Services (AWS)
- Databases: MySQL, NoSQL (DynamoDB), Graph database (Neptune)
- Specializations: Big data platform development, data pipeline construction, distributed system design and implementation
Interests
Software Development • Open Source Projects • Web Technologies • Machine Learning • Big Data & Distributed Systems • Cloud Technologies • Tech Community
This blog is where I share my learnings in technology and software development.