How to Become a Data Scientist - Amazon S3 · reading, storing and manipulating large quantities of...
Transcript of How to Become a Data Scientist - Amazon S3 · reading, storing and manipulating large quantities of...
set of laws
Framework
share with you a structured learning planthat you can start using right away after this webinar to accelerate your path into data science.
skills experience you need tools
make you Irresistible to all data science Employers
EVERYONE will want YOU
YES CHAT
• You are in High demand
• You make a lot of money ($110,000 Median Base Salary)
• You get to work on interesting and challenging problems
• You will be able to have a massive impact on the places where
you work (and others)
• You have High Security (which we know is rare now a days).
The "PHD" System
P - Program
H - Hack
D - Deliver
Statistical Analysis
P - Program
H - Hack
D - Deliver
Machine Learning
• Cool Tip #1:
• Commercial Software for Data Analysis is:
• Expensive:
• Inconvenient:
• Limiting:
1) Coding Saves Money
2) You Can Do Whatever you want:(You’re not restricted by the software’s capabilities)
3) Very Convenient: (You’ll be able to work Anywhere & for Anyone!)
The "PHD" System
P - Program
H - Hack
D - Deliver
Statistical Analysis
P - Program
H - Hack
D - Deliver
Machine Learning
• Remember:
YES CHAT
only 5 Families
Each Algorithm, is just ONE OPTION.
YOU DON’T NEED THEM ALL!
Machine Learning
Do these group
together?
Is this type A or B? or..etc?
Is this unusual?
How can I make this simpler?
How much – or – How
many?
Machine Learning
Clustering Algorithms
Classification Algorithms
Anomaly detection
Algorithms
Dimensionality Reduction Algorithms
Regression Algorithms
But no-one ever tells you!
most important
Step-by-StepSystem
The 6 Step Statistical Analysis Process
1) Determine Study’s properties
2) Set your significance Level
3) Investigate Data’s properties
4) Determine the appropriate statistical test
5) Run the test
6) Make a Conclusion to answer the study’s question
The 6 Step Statistical Analysis Process
1) Determine Study’s properties
2) Set your significance Level
3) Investigate Data’s properties
4) Determine the appropriate statistical test
5) Run the test
6) Make a Conclusion to answer the study’s question
The 6 Step Statistical Analysis Process
1) Determine Study’s properties
2) Set your significance Level
3) Investigate Data’s properties
4) Determine the appropriate statistical test
5) Run the test
6) Make a Conclusion to answer the study’s question
The 6 Step Statistical Analysis Process
1) Determine Study’s properties
2) Set your significance Level
3) Investigate Data’s properties
4) Determine the appropriate statistical test
5) Run the test
6) Make a Conclusion to answer the study’s question
The 6 Step Statistical Analysis Process
1) Determine Study’s properties
2) Set your significance Level
3) Investigate Data’s properties
4) Determine the appropriate statistical test
5) Run the test
6) Make a Conclusion to answer the study’s question
The 6 Step Statistical Analysis Process
1) Determine Study’s properties
2) Set your significance Level
3) Investigate Data’s properties
4) Determine the appropriate statistical test
5) Run the test
6) Make a Conclusion to answer the study’s question
WHATWHEN HOW
The "PHD" System
P - Program
H - Hack
D - Deliver
Statistical Analysis
P - Program
H - Hack
D - Deliver
Machine Learning
The "PHD" System
P - Program
H - Hack
D - Deliver
Statistical Analysis
P - Program
H - Hack
D - Deliver
Machine Learning
Structured Learning Process
YES CHAT
learning to code
not using Specialised Commercial Software?
Or
The Great Debate:
• First Appeared in August 1993 thanks to Ross Ihaka and Robert Gentleman• R is a language used specifically for Statistical Analysis and Data Science.
Pros:• It has a strong community• Built by statisticians
Cons:• Steep Learning Curve• Standalone Application (less flexible)• Becoming less popular
• First Appeared in February 1991 thanks to Guido van Rossum.• Python is a general purpose programming language.
Pros:• Very easy to learn and use• Cleaner code• Can do pretty much anything.• Allows easy data science integration
with web apps and production databases.
Cons:• Not as Specialised.
easiest flexible brighter future
• “Pandas” is a python library that is designed to make reading, storing and manipulating large quantities of data super easy.
• Pandas is incredibly powerful, and can easily read/writing data to/from spreadsheet or csv files.
• Pandas is your “backbone” for data analysis of all kinds.
• Scipy is a module that provides many scientific computing functions for Python.
• Scipy.stats is the place where hundreds of statistical tests are stored.
• Scipy will become your go-to tool for doing any data science ESPECIALLY statistical analysis.
• Scikit learn is python’s 1-stop-shop for machine learning!
• It is incredibly popular and can do anything!
1) Clustering
2) Classification
3) Regression
4) Anomaly Detection
5) Dimensionality Reduction
And MUCH More!
Module Purpose Documentation
Pandas Data Storing Powerhouse http://pandas.pydata.org/pandas-docs/stable/
Scipy Scientific Functions for Python https://docs.scipy.org/doc/
Scikit-Learn Machine Learning For Python http://scikit-learn.org/stable/documentation.html
Numpy Precursor to Pandas https://docs.scipy.org/doc/numpy/reference/
statsmodels More Scientific Functions for Python.
http://statsmodels.sourceforge.net/stable/
matplotlib Allow you to plot graphs in Python http://matplotlib.org/contents.html
seaborn Makes it easier to plot graphs in Python.
http://seaborn.pydata.org/
Other Useful Python Packages
Amazon
Picture Removed for Privacy Purposes
Amazon
Picture Removed for Privacy Purposes
Great...
YOU YOUR circumstances.
Or
Law #4: The Law of Personalisation
Employment Freelance
• Employment is where you take a full (or part) time job at a company
Pros:• Consistent Income• Colleagues to Learn From• Structured Career Progression
Cons:• Less Control Over Your Future• Less Freedom Over Your Day• Relatively Fixed Income
Employment
Indeed.comGlassdoor.comCareerbuilder.comDice.comIdealist.org (For internships)LinkedIn.comMonster.com
Specific Company Websites
Sites to Find Jobs
• Freelance Work is where you acquire clients for yourself and complete work for them on a project-by-project basis.
Pros:• You’re the boss• Choose who you work with• Set your own standards• Can Work Internationally using
Internet
Cons:• Less Security• Highly dependent on market fluctuations• Competition on Freelance Platforms
Freelance
Upwork.comFiverr.comPeopleperhour.comFreelancer.com
Set up your own website
Sites to Find Freelance Work
Even if you want to get an employed job, get on as many freelance sites as you can!
You can start pitching for jobs and projects and then add them to your CV!
This is actually how a lot of Data Scientists get started! :D
Super Ninja Tip!
So far did we do a good job with just part 1 that even if you left this room right now… (but don’t) you could use some of our advice to accelerate your path into data science starting Right now?
Let me know in the chat by typing Yes (or no)
So let me ask you…
“ I Love Data Science Launch! You guys make it all so easy! I thought to myself there is NO WAY it’s that Easy!
Its amazing work you guys are doing and I can’t wait to start working with clients.”
Picture Removed for Privacy Purposes
2 Training Courses
How to analyse Data Using the 5 Families of Machine Learning Techniques.
The Step by Step Proven Process for Analysing Data Perfectly Every Time Using Statistics
• Our complete Step-by-Step System for statistical analysis.
• Designed to take you by the hand (as in step1, step2, step 3) to analyse data using statistical methods. No ambiguity or confusion.
What exactly is it?
• 4 part step-by-step digital course that walks you through the process of analysing data using statistical methods.
• Full 1080p HD Video with theory and practice lectures so nothing is left out (7 hours of valuable content)
What exactly is it?
• 5 Part course that walks you each of the 5 families of Machine Learning Algorithms, showing you the most important algorithms from each!
• Practicals and Theory Lectures + Projects for you to try out!
$497 Value
Bonus #1
•Preparation Checklist + Roadmaps
The Statistical Analysis Preparation Checklist
• It has 8 Simple questions in it that you will answer using the videos in part 1 of the course.
• The answers to these questions determine the properties of your data.
• The properties of your data will inform which statistical test you need to run to analyse it.
• This guide will walk you though this process so you don’t miss anything and will match perfectly with the course
The Test Selector Roadmaps
• Once you know your data’s properties you need to select the correct test.
• There are hundreds of statistical tests and memorising the use-case for all of them is a nightmare!
The Test Selector Roadmaps
• So what you do instead is use the roadmaps!
• The roadmaps are sets of Yes or No questions about your data’s properties.
• “Does it have property X, Yes or No?”
The Test Selector Roadmaps
• Use the checklist to answer them!
• This will show you the exact tests you need to run
• No need to memorise all the tests.
• Just use the roadmaps, watch the appropriate video and you’re done!
$497 Value
$297 Value
Bonus #2
• Code Swipes
$497 Value
$297 Value
$197
Bonus #3
• The Professional Report Template
$497 Value
$297 Value
$197
$97 Value
Bonus #4
•Epic Project Pack
•PROJECT BASED LEARNING
$497 Value
$297 Value
$197
$97 Value
$97 Value
Bonus #5
•The Perfect Cover Letter
$497 Value
$297 Value
$197
$97 Value
$97 Value
$97 Value
Bonus Training Course: The Python Bible
The Python Bible is Ziyad’s famous Python Course that teaches you the fundamentals of python programming.
The course has over 15,000 students in 143 countries around the world.
Bonus Training Course: The Python Bible
Will take you from absolute NO programming experience (like ever) all the way to confidently writing your own programs
You will build 11 python projects and be able to program confidently in Python by the end of the course EVEN IF you have never coded before IN YOUR LIFE.
$497 Value
$297 Value
$197
$97 Value
$97 Value
$97 Value
$197 Value
$497 Value
$297 Value
$197
$97 Value
$97 Value
$97 Value
$197 Value
Total Value: $1,676
TYPE IT IN THE CHAT
$9167
$1,676
$497 Value
$297 Value
$197
$97 Value
$97 Value
$97 Value
$197 Value
$197 Value
Total Value: $1,676 - SAVE: $1,179FINAL SALE PRICE (Public) $497
Save $200 TODAY on this Webinar
Special Today Only: $297
$497 Value
$297 Value
$197
$97 Value
$97 Value
$97 Value
$197 Value
Value: $1,676 - SAVE: $1,179
Normal Price: $497This Event Only: Save: $200
Special Today Only: $297
Here is What to do now
Click Here
Scroll Down + Click Here
Step 3: Create an Account
Not Done Yet ;)
Step 5: Enjoy!
Inside the Course
Inside the Course
want to work with You.
communication skills
amazing at analysing datacan’t explain the results
LOVE
$497 Value
$297 Value
$197
$97 Value
$97 Value
$97 Value
$197 Value
$497 Value
$297 Value$197
$97 Value
$97 Value$97 Value
$197 Value
$197 Value
Value: $1,676 - SAVE: $1,179
Normal Price: $497This Event Only: Save: $200
Special Today Only: $297
Here is What to do now
Click Here
Scroll Down + Click Here
Step 3: Create an Account
Not Done Yet ;)
Step 5: Enjoy!
Inside the Course
Inside the Course