create a unique dataset using web scraping. this is a technique used to extract data from websites. put web data into a csv file, or if the website has an api - then you can extract all data by sending requests. (if data is in wikipedia, pandas enough, if data in javascript driven website, should use selenium)
i like sports so i extract historical football data from different sports websites.
after building dataset, find a problem to solve. i aimed to predict football games using the number of goals scored and conceded by every team in ncompetition. (the point isn’t to build the best model ever but to showcase all yoru ds skills by solving a real-world problem that doesn’t have a solution on the internet.
list all ds skills needed to work on this project here’s what i had to learn before working on project
- learn web scrapign with pandas 2 learn the poison distribution
- learn how to apply poisson formula with python
publish insights
- write an article.