Guang Liu
Technical Lead at BAAI's Data Research Team, directs the OpenSeek project
Guang Liu, Technical Lead at BAAI's Data Research Team, directs the OpenSeek project. He is the architect behind the Aquila large language model series and the Infinity dataset family. His current research centers on Agentic Data Systems, innovating synthetic data generation for next-generation AI training.
Topic
From Theory to Practice: Analysing the Development Process and Future Prospects of the Aquila Model
In this presentation, I will explore in depth the development process of the Aquila Large Scale Language Model (LLM). I will analyse the background of the development of the Aquila model, the problems it faced, and how we dealt with them, the concrete results of the practice, and its future development direction, from theory to practice. This process covers everything from acquiring and processing the corpus, optimising the model training process, to improving the model's effectiveness and accuracy. This is a typical real-world example showing how to apply and optimise large-scale language models in real projects. My experience will provide in-depth insight and reference value for people who want to learn more about large-scale language model development, rather than just staying at the theoretical level of simple concepts. I will also provide insights into the future of the Aquila model, including how we plan to improve its performance, accuracy, and broaden its application domains, as well as how we will continue to optimise the user experience in future developments.