Rabislist

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

CONTACT

732-281-7122 / 732-423-8052

COMPUTERS-DATA-ANAYTICS

A data analyst is not into coding as much as a data scientist does, but he must have programming language knowledge but knowledge of statistical programming languages like R/ Python is expected. He must also have knowledge of data bases like SQL (structured query language) to perform data analysis. Though excel is not as powerful as R or Python, a knowledge on excel will help to work on smaller data sets which is the best option for start-ups. Data visualization and presentation is an integral part of the data analyst job, hence knowledge of tableau and business presentation tools like MS Excel, MS PowerPoint is also essential.

 

 

The data analyst course topics are:

 

Statistics

 

Data analytics with R

SAS

Tableau

 

The data analyst course syllabus in general would be:

 

Data and overview of data types

Introduction to Data Types

Numerical parameters to represent data

 

          a. Mean

          b. Mode

          c. Median

          d. Sensitivity

          e. Information Gain

          f.  Entropy

 

Statistical parameters to represent data

 

Probability

 

Uses of probability

Need of probability

Bayesian Inference

Density Concepts

Normal Distribution Curve

 

Statistics

Point Estimation

Confidence Margin

Hypothesis Testing

Levels of Hypothesis Testing

 

Data clustering

Association and Dependence

Causation and Correlation

Covariance

Simpson’s Paradox

Clustering Techniques

 

Data testing techniques

Parametric Test

Parametric Test Types

Non- Parametric Test

Experimental Designing

A/B testing

 

Regression modelling and Distributions

Probability distributions and regression analysis

Linear regression analysis and model

Normal distribution and binomial distribution

Logistic and Regression Techniques

Problem of Collinearity

WOE and IV

Residual Analysis

Heteroscedasticity

Homoscedasticity

 

Introduction to R programming

Various data types in R

Built-in functions in R

Subsetting methods

Data cleansing

Data inspection

Data import from excel and other spreadsheets and text files into R

Data import from SPSS, sas7bdat

Installation of packages for database import

Fundamentals of Web scrapping

Summarize data

 

Exploratory data analysis

Overview of EDA

Understand and implement EDA on various data sets

Understand the EDA functions

Work with box plots, segment plots etc.

 

Data visualization in R

Data visualization and graphical functions and representations in R

Plotting various graphs like tableplot, histogram, boxplot

Understand the working of Deducer, and R commander

Understand the graphical functions in R

Customize graphical parameters to improvise the plots

Overview to spatial plots

 

Data mining and clustering techniques

Introduction to data mining

Overview of machine learning

Machine learning algorithms – supervised and unsupervised

K-Means clustering technique

Association rule

Sentiment analysis

Predictive analysis techniques

Annova technique

 

Decision Trees and Random forest techniques

Concept of decision trees

Classification rules of decision trees

Understand the algorithms for creating decision trees

Create perfect decision trees

Concepts and features of decision trees

Working on Random forest

Entropy and gaining information

 

SAS

Understand the implementation of SAS and the use cases

Understand the different data types in SAS

Understand data step and procedural step

Understand the basic procedural steps

Overview of SAS GUI

Introduction to SAS Window and its contents

Use of formats and informats in SAS

Create instream SAS Data set

 

Process and Integrate with data        

Data import in SAS

Work with permanent and temporary data sets

Set and merge statements

Manipulate influx of data sets into SAS

 

Customize dataset

Program with SAS data sets

Simplify processing with SAS using conditional and iterative processing

Date and time functions

Numeric and character functions

SAS Arrays

SAS statistical procedures

 

Advanced statistical procedures with SAS

Overview of clustering

Concepts of hierarchical clustering

Concepts of Non-Hierarchical clustering(K-clustering)

Simple, multiple and logistic regression

 

Data optimization variables

Concepts of data optimization and optimization models

Introduction to ODS and its benefits

Generate rtf, pdf, html, and doc files

 

SQL

Work with select statements and other select statement clauses

Work with JOINS and UNIONS

Create new tables

CASE expression

 

SAS Macros

Overview and benefits of using SAS Macros

Macro variables

Macro code constituents

Positional parameters to Macros

 

Data visualization

Introduction to data visualization

Introduction to Tableau 10

Data blending and establishing a connection

 

Visual analytics

Manage metadata and extracts

Data granularity using marks card

Work with sort, filter and grouping (static and dynamic)

Graphic visualization

 

Dashboard, stories and Mapping

Introduction and creation of a dashboard layout

Introduction to story point

Introduction to maps and web mapping services

Design dashboard for devices

Interact using action

Create background images

Create polygon maps

Work with custom geocoding

 

Charts in Tableau

Work with Gantt, waterfall, pareto charts

Work with control charts, funnel charts, box and whisker’s plots

 

Integration of Tableau, R and Hadoop

Introduction to big data, R and Hadoop concepts

Integrate R with Hadoop

Integrate Tableau with R

Integrate visualization using Tableau