SKILL DEVELOPMENT PROGRAMME ON INTRODUCTION TO DATA SCIENCE WITH HANDS ON SESSION

18-11-2020 to 20-11-2020

CHIEF GUEST:  Mr. M. ARAVINTH – MCA Alumni (2014 – 2016 Batch) -Research Scholar, University of Adelaide-Australia

SKILL DEVELOPMENT PROGRAMME ON INTRODUCTION TO DATA SCIENCE WITH HANDS ON SESSION

The Department of COMPUTER APPLICATIONS(MCA) conducted SKILL DEVELOPMENT PROGRAMME ON INTRODUCTION TO DATA SCIENCE WITH HANDS ON SESSION :  –-Programme   from  18-11-2020 to 20-11-2020 . The session was handled by Mr. M. ARAVINTH – MCA Alumni (2014 – 2016 Batch) -Research Scholar University of Adelaide-Australia

Dr.J.Dhilipan- Vice Principal – Academic-Faculty of Science and Humanities & HOD/MCA presented the welcome address. During his address, he welcomed the chief guest of todays function. He added that the department has been conducting skill development program and it  is aimed at elevating the skills of students by giving hands on session. He added that data science is an inter-disciplinary field that uses scientific methods, processes, algorithms and systems to extract knowledge and insights from many structural and unstructured data. Data science is related to data mining, machine learning and big data.

DAY 1 REPORT:18-11-2020

Mr. M. ARAVINTH – MCA Alumni (2014 – 2016 Batch) -Research Scholar University of Adelaide-Australia started the address by exploring the career growth and stating the role of data science

He added  about what is data science?

He then discussed about

?The data sets

?Ames Housing data sets- The Ames Housing dataset, which describes the sale of individual residential property in Ames, Iowa from 2006 to 2010, was compiled by Dean De Cock for use in data science education. It was designed based after the Boston Housing dataset and is now considered a more modernized and expanded version of it.

?How to log in to kaggle.com and downloading data set

Further he discussed about –how to upload the data set in python and continued with hands on session

?Index

?Scenario of Indian IT industry

DAY 2 REPORT:19-11-2020

He   discussed about

?Categorical values

?Discrete values

?Continuous values

?How to  get  data set after cleaning

Further he discussed about –how to calculate the linear regression by explaining the ground value price and also with hands on for the calculation of values

? In addition, he highlighted in data science overfitting and underfitting: overfitting simply means that the learning model is far too dependent on training data while underfitting means that the model has a poor relationship with the training data. Ideally, both of these should not exist in models, but they usually are hard to eliminate

DAY 3 REPORT:20-11-2020

?What is a good RMSE value?- It means that there is no absolute good or bad threshold, however you can define it based on your DV. For a datum which ranges from 0 to 1000, an RMSE of 0.7 is small, but if the range goes from 0 to 1, it is not that small anymore

? How to find the regression values

? What are the courses to be studies for Data science

Basics

probability and statistics

  – discrete random variables

  – continuous random variables

  – bivariate random variables

Linear algebra

Differential calculus

integral calculus

The session was attended by around 100 participants. The presentation was followed by a Q & A session. The event was well-coordinated by the faculty member Mr.D.RAJKUMAR- AP/MCA