Бакалавриат
2023/2024
Python для анализа данных
Лучший по критерию «Полезность курса для расширения кругозора и разностороннего развития»
Лучший по критерию «Новизна полученных знаний»
Статус:
Курс обязательный (Международный бакалавриат по бизнесу и экономике)
Направление:
38.03.01. Экономика
Кто читает:
Департамент менеджмента
Где читается:
Санкт-Петербургская школа экономики и менеджмента
Когда читается:
2-й курс, 1, 2 модуль
Формат изучения:
с онлайн-курсом
Онлайн-часы:
20
Охват аудитории:
для всех кампусов НИУ ВШЭ
Преподаватели:
Волкова Юлия Михайловна,
Ефимова Полина Романовна,
Ефимов Константин Дмитриевич,
Митранкова Мария Михайловна,
Нырка Оксана Викторовна,
Терников Андрей Александрович
Язык:
английский
Кредиты:
4
Контактные часы:
56
Course Syllabus
Abstract
Python is an interpreted high-level general-purpose programming language. It has a set of powerful libraries for data analysis. It is a simple language for beginners to learn, though it is powerful enough for writing large applications. This 2-module course is an introduction to the Python programming language and data science. The average time to complete this course depends on student background. To complete the course, students are supposed to have mathematical skills at the high school level. Students’ academic performance is evaluated using programming assignments: homework and classwork. Also there is one mid-semester exam and final exam. The examples and problems used in this course cover such areas as text processing, HTML and data analytics. This course does not provide lectures and students must finish corresponding week of recommended online course before seminar class.
Learning Objectives
- Teach students how to create basic scripts, understand data types, statements and logical expressions; create own functions and use libraries.
- Collect, store, process and analyze data automatically with the use of scripting languages.
- Can identify the data needed for addressing the financial and business objectives.
Expected Learning Outcomes
- Student can create scripts for data analysis
- Student can explain basic principles of Python programming language
- Student can read and understand simple scripts.
Course Contents
- Basic of Python programming
- Boolean data type and IF conditions
- WHILE loops
- Lists and FOR loops
- Dictionaries and Methods
- Nested data structures. Sorting
- Functions
- Text files, tables, JSON
- Scraping: collection of links from website
- Additional chapters: re
- Additional chapters: pandas
Assessment Elements
- Graded SeminarGraded Seminar cannot be retaken regardless of the reason for absence. Graded Seminar tasks covers all topics from seminars (in particular focusing on text files manipulations, web scraping, and regular expressions). Graded Seminar is organized during the seminar class in offline mode. Graded Seminar tasks are allowed to be completed in groups of students. The seminar assignment has its own deadline (the late submissions after this provided deadline are prohibited and not graded - such the assignment cannot be retaken). The final grade for the Graded Seminar is calculated on the basis of relative scales (based on all solutions from the course participants) and cannot be more than 10. All team-members gets the same grade.
- HomeworkHomework cannot be retaken regardless of the reason for absence. The maximum grade for the Homework is 10, including tasks that check an outstanding student performance.
- Mid-termMid-term covers all topics from the Syllabus (the first module). The length is 60 minutes. The maximum grade is 10.
- ExamExam is not blocking. Exam covers all topics from the Syllabus. The length is 60 minutes. The final maximum number of points for the Exam is not more than 10. Exam format is paper-based. Exam is open-book (only printed materials are allowed; any use of electronic materials and devices are prohibited during the Exam).
Interim Assessment
- 2023/2024 2nd module0.4 * Exam + 0.15 * Graded Seminar + 0.25 * Homework + 0.2 * Mid-term
Bibliography
Recommended Core Bibliography
- 9781491912140 - Vanderplas, Jacob T. - Python Data Science Handbook : Essential Tools for Working with Data - 2016 - O'Reilly Media - https://search.ebscohost.com/login.aspx?direct=true&db=nlebk&AN=1425081 - nlebk - 1425081
Recommended Additional Bibliography
- 9781785284571 - Romano, Fabrizio - Learning Python - 2015 - Packt Publishing - http://search.ebscohost.com/login.aspx?direct=true&db=nlebk&AN=1133614 - nlebk - 1133614