Design and implementation of IELTS learning resources system based on the data crawler
Li, Guochen (2025)
Kandidaatintyö
Li, Guochen
2025
School of Engineering Science, Tietotekniikka
Kaikki oikeudet pidätetään.
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi-fe2025053056418
https://urn.fi/URN:NBN:fi-fe2025053056418
Tiivistelmä
With the increasing demand for studying abroad and standardized language proficiency tests, IELTS (International English Language Testing System) has become one of the most widely recognized English exams globally. However, IELTS learning resources are often scattered across various websites and platforms, leading to issues such as inefficient retrieval, inconsistent formats, and delayed updates. Therefore, it is necessary to build a centralized and automated resource management system to improve access efficiency and user experience.
This thesis presents the design and implementation of an IELTS learning resource management system based on web crawling technology. The system is developed using the Spring Boot framework for the backend and Thymeleaf as the template engine for server-side rendering. MySQL is used for data persistence. The web crawler module, implemented with the Jsoup library, extracts structured data such as titles, categories, URLs, and summaries from designated educational websites. The system supports key functions including module-based browsing, keyword search, pagination, administrator login authentication, resource management, and manual data collection triggering.
Starting from the analysis of system requirements, this thesis elaborates on the choice of technologies, overall architecture, functional module design, database modeling, and implementation. Functionality testing and interface validation demonstrate that the system runs stably with good user experience. In summary, this system realizes an integrated “collection–management–presentation” workflow, providing practical value for organizing and utilizing IELTS learning resources more effectively.
This thesis presents the design and implementation of an IELTS learning resource management system based on web crawling technology. The system is developed using the Spring Boot framework for the backend and Thymeleaf as the template engine for server-side rendering. MySQL is used for data persistence. The web crawler module, implemented with the Jsoup library, extracts structured data such as titles, categories, URLs, and summaries from designated educational websites. The system supports key functions including module-based browsing, keyword search, pagination, administrator login authentication, resource management, and manual data collection triggering.
Starting from the analysis of system requirements, this thesis elaborates on the choice of technologies, overall architecture, functional module design, database modeling, and implementation. Functionality testing and interface validation demonstrate that the system runs stably with good user experience. In summary, this system realizes an integrated “collection–management–presentation” workflow, providing practical value for organizing and utilizing IELTS learning resources more effectively.