Aug 1, 2023
·
1 min read
Financial data pipeline based on Flink + scheduled tasks.
Key Capabilities:
- Flink CDC real-time data sync (MySQL binlog / HBase change logs)
- Flink SQL streaming ETL: real-time cleaning, aggregation, windowed computation
- Minute-level K-line synthesis, buy/sell pressure ratio, anomaly trade detection
- Checkpoint mechanism ensuring Exactly-Once semantics
- Offline ETL: Airflow + Crontab scheduling, automated daily/monthly reports

Authors
Solo Founder
12+ years of team management experience leading 30+ person teams, with a peak of 60+ technical staff; comprehensive experience in corporate strategy, cost control, and talent development; 80% of time spent on frontline execution;6+ years of CEO experience in Internet finance, building and operating teams from zero to one.
15+ years of full-stack Internet software development (Java, Python, .NET, VUE, JQ) with 3+ years in Rust & Go;10+ years of data solution and architecture experience based on Hadoop, Flink, MySQL, Oracle, SQL Server, MongoDB.
Huge ETH Staking management, digital asset custody, and hundreds of millions in credit fund management experience.