Introduction
This comprehensive course will be your guide to learning how to use
the power of Python to analyze big data, create beautiful
visualizations, and use powerful machine learning algorithms. This
course is designed for both beginners with basic programming
experience or experienced developers looking to make the jump to Data
Science and big data Analysis.
Duration
5 Days
Course Objective
o Research Design
o Python for Data Science and Machine
o Spark for Big Data Analysis
o Implement Machine Learning Algorithms
o Numbly for Numerical Data
o Pandas for Data Analysis
o Matplotlib for Python Plotting
o Seaborn for statistical plots
o Interactive dynamic visualizations
o SciKit-Learn for Machine Learning Tasks
o K-Means Clustering, Logistic Regression and Linear Regression
o Random Forest and Decision Trees
o Natural Language Processing and Spam Filters
o Neural Networks
o Support Vector Machines
o Research report writing
Who Should Attend?
This is a general course targeting participants with elementary
knowledge of Statistics from Agriculture, Economics, Food Security and
Livelihoods, Nutrition, Education, Medical or public health
professionals among others who already have some statistical
knowledge, but wish to be conversant with the concepts and
applications of statistical modeling using Python
Course content
Module1: Basic statistical terms and concepts
o Introduction to statistical concepts
o Descriptive Statistics
o Inferential statistics
Module 2: Research Design
o The role and purpose of research design
o Types of research designs
o The research process
o Which method to choose?
o Exercise: Identify a project of choice and developing a research
design
Module 3: Survey Planning, Implementation and Completion
o Types of surveys
o The survey process
o Survey design
o Methods of survey sampling
o Determining the Sample size
o Planning a survey
o Conducting the survey
o After the survey
o Exercise: Planning for a survey based on the research design
selected
Module 4: Introduction to Phython
o Course Intro
o Setup
o Installation Setup and Overview
o IDEs and Course Resources
o iPython/Jupyter Notebook Overview
Module 5: Learning Numpy
o Intro to numpy
o Creating arrays
o Using arrays and scalars
o Indexing Arrays
o Array Transposition
o Universal Array Function
o Array Processing
o Array Input and Output
Module 6: Intro to Pandas
o DataFrames
o Index objects
o Reindex
o Drop Entry
o Selecting Entries
o Data Alignment
o Rank and Sort
o Summary Statistics
o Missing Data
o Index Hierarchy
Module 7: Working with Data
o Reading and Writing Text Files
o JSON with Python
o HTML with Python
o Microsoft Excel files with Python
o Merge and Merge on Index
o Concatenate and Combining DataFrames
o Reshaping, Pivoting and Duplicates in Data Frames
o Mapping,Replace,Rename Index,Binning,Outliers and Permutation
o GroupBy on DataFrames
o GroupBy on Dict and Series
o Splitting Applying and Combining
o Cross Tabulation
Module 8: Big Data and Spark with Python
o Welcome to the Big Data Section!
o Big Data Overview
o Spark Overview
o Local Spark Set-Up
o AWS Account Set-Up
o Quick Note on AWS Security
o EC2 Instance Set-Up
o SSH with Mac or Linux
o PySpark Setup
o Lambda Expressions Review
o Introduction to Spark and Python
o RDD Transformations and Actions
Module 9: Data Visualization
o Installing Seaborn
o Histograms
o Kernel Density Estimate Plots
o Combining Plot Styles
o Box and Violin Plots
o Regression Plots
o Heatmaps and Clustered Matrices
Module 10: Data Analysis
o Linear Regression
o Support Vector
o Decision Trees and Random Forests
o Natural Language Processing
o Discrete Uniform Distribution
o Continuous Uniform Distribution
o Binomial Distribution
o Poisson Distribution
o Normal Distribution
o Sampling Techniques
o T-Distribution
o Hypothesis Testing and Confidence Intervals
o Chi Square Test and Distribution
Module 11: Report writing for surveys, data dissemination, demand and
use
o Writing a report from survey data
o Communication and dissemination strategy
o Context of Decision Making
o Improving data use in decision making
o Culture Change and Change Management
o Preparing a report for the survey, a communication and dissemination
plan and a demand and use strategy.
o Presentations and joint action planning
Training Approach
This course will be delivered by our skilled trainers who have vast
knowledge and experience as expert professionals in the fields. The
course is taught in English and through a mix of theory, practical
activities, group discussion and case studies. Course manuals and
additional training materials will be provided to the participants
upon completion of the training.
Tailor-Made Course
This COURSE CAN ALSO BE TAILOR-made to meet organization requirement.
For further inquiries, please contact us on: Email:
training@upskilldevelopment.com Tel: +254 721 331 808
Training Venue
The training will be held at our Upskill Training Centre. We also
offer training for a group (at a discount of 10% to 50%) at requested
location all over the world. The course fee covers the course tuition,
training materials, two break refreshments, and buffet lunch.
Visa application, travel expenses, airport transfers, dinners,
accommodation, insurance, and other personal expenses are catered by
the participant
Certification
Participants will be issued with Upskill certificate upon completion
of this course.
business
courses
workshop
71
Views
19/11/2022 Last update