## Think Stats

Author: Allen B. Downey

Publisher: "O'Reilly Media, Inc."

ISBN: 1491907371

Category: Computers

Page: 226

View: 8963

If you know how to program, you have the skills to turn data into knowledge, using tools of probability and statistics. This concise introduction shows you how to perform statistical analysis computationally, rather than mathematically, with programs written in Python. By working with a single case study throughout this thoroughly revised book, you’ll learn the entire process of exploratory data analysis—from collecting data and generating statistics to identifying patterns and testing hypotheses. You’ll explore distributions, rules of probability, visualization, and many other tools and concepts. New chapters on regression, time series analysis, survival analysis, and analytic methods will enrich your discoveries. Develop an understanding of probability and statistics by writing and testing code Run experiments to test statistical behavior, such as generating samples from several distributions Use simulations to understand concepts that are hard to grasp mathematically Import data from most sources with Python, rather than rely on data that’s cleaned and formatted for statistics tools Use statistical inference to answer questions about real-world data
Release

## Hands-On Data Analysis with Pandas

Efficiently perform data collection, wrangling, analysis, and visualization using Python

Author: Stefanie Molin

Publisher: Packt Publishing Ltd

ISBN: 1789612802

Category: Computers

Page: 716

View: 1152

Get to grips with pandas—a versatile and high-performance Python library for data manipulation, analysis, and discovery Key Features Perform efficient data analysis and manipulation tasks using pandas Apply pandas to different real-world domains using step-by-step demonstrations Get accustomed to using pandas as an effective data exploration tool Book Description Data analysis has become a necessary skill in a variety of positions where knowing how to work with data and extract insights can generate significant value. Hands-On Data Analysis with Pandas will show you how to analyze your data, get started with machine learning, and work effectively with Python libraries often used for data science, such as pandas, NumPy, matplotlib, seaborn, and scikit-learn. Using real-world datasets, you will learn how to use the powerful pandas library to perform data wrangling to reshape, clean, and aggregate your data. Then, you will learn how to conduct exploratory data analysis by calculating summary statistics and visualizing the data to find patterns. In the concluding chapters, you will explore some applications of anomaly detection, regression, clustering, and classification, using scikit-learn, to make predictions based on past data. By the end of this book, you will be equipped with the skills you need to use pandas to ensure the veracity of your data, visualize it for effective decision-making, and reliably reproduce analyses across multiple datasets. What you will learn Understand how data analysts and scientists gather and analyze data Perform data analysis and data wrangling in Python Combine, group, and aggregate data from multiple sources Create data visualizations with pandas, matplotlib, and seaborn Apply machine learning (ML) algorithms to identify patterns and make predictions Use Python data science libraries to analyze real-world datasets Use pandas to solve common data representation and analysis problems Build Python scripts, modules, and packages for reusable analysis code Who this book is for This book is for data analysts, data science beginners, and Python developers who want to explore each stage of data analysis and scientific computing using a wide range of datasets. You will also find this book useful if you are a data scientist who is looking to implement pandas in machine learning. Working knowledge of Python programming language will be beneficial.
Release

## Mathematics for Machine Learning

Author: Marc Peter Deisenroth,A. Aldo Faisal,Cheng Soon Ong

Publisher: Cambridge University Press

ISBN: 1108569323

Category: Computers

Page: N.A

View: 9083

The fundamental mathematical tools needed to understand machine learning include linear algebra, analytic geometry, matrix decompositions, vector calculus, optimization, probability and statistics. These topics are traditionally taught in disparate courses, making it hard for data science or computer science students, or professionals, to efficiently learn the mathematics. This self-contained textbook bridges the gap between mathematical and machine learning texts, introducing the mathematical concepts with a minimum of prerequisites. It uses these concepts to derive four central machine learning methods: linear regression, principal component analysis, Gaussian mixture models and support vector machines. For students and others with a mathematical background, these derivations provide a starting point to machine learning texts. For those learning the mathematics for the first time, the methods help build intuition and practical experience with applying mathematical concepts. Every chapter includes worked examples and exercises to test understanding. Programming tutorials are offered on the book's web site.
Release

## Basic Statistics and Data Analysis

Author: Larry J. Kitchens

Publisher: Brooks/Cole

ISBN: 9780534384654

Category: Mathematics

Page: 637

View: 2471

With an emphasis on exploratory data analysis, BASIC STATISTICS AND DATA ANALYSIS teaches students to identify trends in their data that will help them ask the right questions. Rather than leading students through operations on data, this modern textbook stresses hands-on experience with more than 200 real data sets and approximately 1000 exercises in the book. This new text, a basic version of Larry Kitchens' groundbreaking text, EXPLORING STATISTICS, develops students' statistical intuition and nurtures the development of a statistical way of thinking. The author has shaped this text specifically for the elementary statistics course, leaving out the more advanced topics from his previous book. MINITAB(tm) is the main statistical analysis software utilized in the text.
Release

## Exploratory Data Analysis in Business and Economics

An Introduction Using SPSS, Stata, and Excel

Author: Thomas Cleff

Publisher: Springer Science & Business Media

ISBN: 3319015176

Page: 215

View: 2839

In a world in which we are constantly surrounded by data, figures, and statistics, it is imperative to understand and to be able to use quantitative methods. Statistical models and methods are among the most important tools in economic analysis, decision-making and business planning. This textbook, “Exploratory Data Analysis in Business and Economics”, aims to familiarise students of economics and business as well as practitioners in firms with the basic principles, techniques, and applications of descriptive statistics and data analysis. Drawing on practical examples from business settings, it demonstrates the basic descriptive methods of univariate and bivariate analysis. The textbook covers a range of subject matter, from data collection and scaling to the presentation and univariate analysis of quantitative data, and also includes analytic procedures for assessing bivariate relationships. It does not confine itself to presenting descriptive statistics, but also addresses the use of computer programmes such as Excel, SPSS, and STATA, thus treating all of the topics typically covered in a university course on descriptive statistics. The German edition of this textbook is one of the “bestsellers” on the German market for literature in statistics.
Release

## Exploratory Data Analysis with MATLAB, Second Edition

Author: Wendy L. Martinez,Angel R. Martinez,Angel Martinez,Jeffrey Solka

Publisher: CRC Press

ISBN: 1439812217

Page: 536

View: 6210

Since the publication of the bestselling first edition, many advances have been made in exploratory data analysis (EDA). Covering innovative approaches for dimensionality reduction, clustering, and visualization, Exploratory Data Analysis with MATLAB®, Second Edition uses numerous examples and applications to show how the methods are used in practice. New to the Second Edition Discussions of nonnegative matrix factorization, linear discriminant analysis, curvilinear component analysis, independent component analysis, and smoothing splines An expanded set of methods for estimating the intrinsic dimensionality of a data set Several clustering methods, including probabilistic latent semantic analysis and spectral-based clustering Additional visualization methods, such as a rangefinder boxplot, scatterplots with marginal histograms, biplots, and a new method called Andrews’ images Instructions on a free MATLAB GUI toolbox for EDA Like its predecessor, this edition continues to focus on using EDA methods, rather than theoretical aspects. The MATLAB codes for the examples, EDA toolboxes, data sets, and color versions of all figures are available for download at http://pi-sigma.info
Release

## Exploratory Data Analysis

Author: Frederick Hartwig,Brian E. Dearing

Publisher: SAGE

ISBN: 9780803913707

Category: Mathematics

Page: 83

View: 7991