KMI/MLDM1 Machine learning a Data mining 1: Cvičení 02

Čištění dat

Jupyter notebook (.pnynb soubor) použitý na cvičení
Dataset MovieLens

úkol 1

Vytvořte dataset (ve formátu CSV) obsahující 100 nejvíce hodnocených filmů od každého žánru. Pokud pro daný žánr není dostatek filmů uveďte všechny. U každého filmu uveďte název, rok, den, měsíc (zjištěné z atributu timestamp z posledního hodnocení filmu), žánry (true/false podle toho zda film je daného žánru či nikoliv) a průměrné hodnocení filmu.

Data cleaning

Jupyter notebook (.pnynb file) used during the exercise
MovieLens dataset

Task 1

Create a dataset (in CSV format) containing the 100 most-rated movies from each genre. List them all if there are not enough movies for a given genre. For each movie, list the title, year, day, month (determined from the timestamp attribute of the last movie rating), genres (true/false depending on whether the movie is of the given genre or not), and the average movie rating.