I love learning from others and exploring the implementation details other database engines provide. Percona Live has no shortage of new databases, tools, and discussions on interesting new features or ways of tackling old problems. I went through the agenda and wanted to highlight a few new databases and tools (new to me or new to everyone) that have me interested!
- Sachin Sinha, Author of BangDB is going to be talking about BangDB in his talk: “Convergence of Different Dimensions within BangDB – A High-Performance Modern NoSQL Database”. Talk about interesting implementations, let me leave you with a snippet from the talk abstract:
“The native integration at the buffer pool or IO layer will give the user full control of every single byte being ingested and processed by the system, which will reduce the latency to allow high-speed precision processing. Further siloed (semi siloed) architecture forces too many network hops along with too many copies of data. In this scenario, even with a very high processing efficiency, low latency (or high speed) is not possible with this architecture. We need to minimize network hops and copy of data as much as possible. With convergence, we minimize both the network hops and data copy, thereby improving the performance.”
- Jim Tommaney returns to Percona live, this year talking about DuckDB. His talk DuckDB: “Embedded Analytics with Parallel/Vector/Columnar Performance”. Robert Hodges and I talked a week ago about the analytics track and he said this was the talk he was most interested in hearing. Looking forward to it.
- “openGauss: A Fast Growing Open Source RDBMS Community” from Zhenyu Zheng & Xinyong Xiang & Bo Zhao looks to be an interesting look inside building a community at scale.
- Justin Swanhart returns to Percona Live with a new project: “WarpSQL – a distribution of MySQL 8 with columnar storage, bitmap indexing, and parallel query execution”. Justin is a long-time community contributor and speaker. His passion for analytics and bridging the gap between MySQL and analytical data is well known
- Of course “What is OpenSearch?” presented by Kyle Davis is high on my list. If you have been living under a rock or on an island with no internet connection you may not have seen all of the news around Elastic changing licenses and AWS forking Elastic Search. Learning about the new project is a must for those who have used or are using Elastic.
- Super excited to see “Docstore – Uber’s Highly Scalable Distributed SQL Database” by Ovais Tariq & Himank Chaudhary. Many large companies end up building their own databases and/or enhancing open source. Some of the greatest database advances in the past 10 years came from a company thinking differently and solving their own problems with something new. This is very interesting to me.
- Tim Meehan & Dipti Borkar are giving an “Introduction to Presto: The SQL Engine for Data Platform Teams”. This is the overview talk and just the start of a deep dive into presto as part of the presto community track. If you are not familiar with presto this is a great starting point!
- Andy Pavlo has a talk entitled “OtterTune: Using Machine Learning to Automatically Optimize Database Configurations”. The abstract and the discussion are something I am very interested in. What if there was a tool or a way to analyze historical data and in ML and autotune your configurations?
- Interested in Apache Druid? Rachel Pedreschi delivering a talk entitled “ I’ve Got a Fever and the Only Prescription is Apache Druid”. Real-time analytical data is a hot topic, so this is a talk that should not disappoint.