Data Science
Course Description
Data Science is the study of the generalizable extraction of knowledge from data. Being a data scientist requires an integrated skill set spanning mathematics, statistics, machine learning, databases
and other branches of computer science along with a good understanding of the craft of problem formulation to engineer e ective solutions. This course will introduce students to this rapidly growing eld and equip them with some of its basic principles and tools as well as its general mindset. Students will learn concepts, techniques and tools they need to deal with various facets of data
science practice, including data collection and integration, exploratory data analysis, predictive modeling, descriptive modeling, data product creation, evaluation, and e ective communication. The
focus in the treatment of these topics will be on breadth, rather than depth, and emphasis will be placed on integration and synthesis of concepts and their application to solving problems. To make
the learning contextual, real datasets from a variety of disciplines will be used
Overview
Python Programming language is powerful open source language. It is developed with data science tool and which is used to simplify and easily access the data and store the data easily. By R Programming language we can easily manipulate the data, also it can help in the analysis of Data, and we can create the wonderful visualization and helps to access the high-quality content. This Data Science with Python Training provides you to learn data manipulation and cleaning of data using python.
Objectives of the Course
• Complete basics of Data Science
• Basics of R Programming
• Understand the concepts of Big Data and Eco systems.
• Understand the usage and how to use the tools like a tableau, map-reduce…
Pre-requisites of the Course
• Any IT experienced Professional who are interested to build their career in development/ data scientist.
• Any B.E/ B.Tech/ BSC/ MCA/ M.Sc Computers/ M.Tech/ BCA/ BCom College Students in any stream.
• Fresh Graduates.
Who can attend this course
The course can learn by any IT professional having basic knowledge of:
• Mathematics
• Statistics
• Any Programming Language
Course Content:-
Python Introduction
R Programming
Environment Set-Up
Python for Data Analysis-NumPy, Pandas
Big Data ( HDFS, Mapreduce, and Eco Systems)
Data Visualization-Matplotlib & Seaborn
1st Month
Python Introduction
R Programming
Environment Set-Up
2nd Month
Python for Data Analysis-NumPy, Pandas
Big Data (HDFS, MapReduce, and Eco Systems)
3rd Month
Big Data (HDFS, MapReduce, and Eco Systems)
Data Visualization-Matplotlib & Seaborn