《Python数据分析基础》PDF下载

  • 购买积分:12 如何计算积分?
  • 作  者:Clinton W.Brownley著
  • 出 版 社:南京:东南大学出版社
  • 出版年份:2017
  • ISBN:9787564170004
  • 页数:326 页
图书介绍:如果你像7.5亿Excel用户中的许多人一样,想要对数据做更多的事情,比如对数百个文件重复类似的分析,或者将许多文件中的数据组合在一起进行分析,这本实用指南向雄心勃勃的非程序员说明了如何通过使用Python来自动化和扩展对不同格式数据的处理和分析。在作者Clinton Brownley 讲解 Python基础知识后,你将能够编写简单的脚本来处理电子表格以及数据库中的数据。你还将学习如何使用多个Python模块来解析文件、分组数据和生成统计数据。这一切都无需编程基础。

1.PythonBasics 1

How to Create a Python Script 1

Howto Run a Python Script 4

Useful Tips for Interacting with the Command Line 7

Python's Basic Building Blocks 11

Numbers 12

Strings 14

Regular Expressions and Pattern Matching 19

Dates 22

Lists 25

Tuples 31

Dictionaries 32

Control Flow 37

Reading a Text File 44

Create a Text File 44

Script and Input File in Same Location 47

Modem File-Reading Syntax 47

Reading Multiple Text Files with glob 48

Create Another Text File 49

Writingto a Text File 52

Add Code to first_script.Py 53

Writing to a Comma-Separated Values(CSV)File 55

print Statements 57

Chapter Exercises 58

2.Comma-SeparatedValues(CSV)Files 59

Base Python Versus pandas 61

Read andWrite a CSV File(Part 1) 62

How Basic String Parsing Can Fail 69

Read and Write a CSV File(Part 2) 70

Filter for Specific Rows 72

Value in Row Meets a Condition 73

Value in Row Is in a Set of Interest 75

Value in Row Matches a Pattern/Regular Expression 77

Select Specific Columns 79

Column Index Values 79

Column Headings 81

Select Contiguous Rows 83

Add a Header Row 86

Reading Multiple CSV Files 88

Count Number of Files and Number of Rows and Columns in Each File 90

Concatenate Data from Multiple Files 93

Sum and Average a Set ofValues per File 97

Chapter Exercises 100

3.Excel Files 101

Introspecting an Excel Workbook 104

Processing a Single Worksheet 109

Read and Write an Excel File 109

Filter for Specific Rows 113

Select Specific Columns 120

Reading All Worksheets in a Wbrkbook 124

Filter for Specific Rows Across All Worksheets 124

Select Specific Columns Across All Worksheets 127

Reading a Set of Worksheets in an Excel Workbook 129

Filter for Specific Rows Across a Set of Worksheets 129

Processing Multiple Workbooks 132

Count Number of Workbooks and Rows and Columns in Each Workbook 134

Concatenate Data from Multiple Workbooks 136

Sum and Average Values per Workbook and Worksheet 138

Chapter Exercises 142

4.Databases 143

Python's Built-in sqlite3 Module 145

Insert New Records into a Table 151

Update Records in a Table 156

MySQL Database 160

Insert New Records into a Table 165

Query a Table and Write Output to a CSV File 170

Update Records in a Table 172

Chapter Exercises 177

5.Applications 179

Find a Set of Items in a Large Collection of Files 179

Calculate a Statistic for Any Number of Categories from Data in a CSV File 192

Calculate Statistics for Any Number of Categories from Data in a Text File 204

Chapter Exercises 213

6.Figures and Plots 215

matplotlib 215

Bar Plot 216

Histogram 218

Line Plot 220

Scatter Plot 222

Box Plot 224

pandas 226

ggplot 228

seaborn 231

7.Descriptive Statistics and Modeling 239

Datasets 239

Wine Quality 239

Customer Churn 240

Wine Quality 241

Descriptive Statistics 241

Grouping,Histograms,and t-tests 243

Pairwise Relationships and Correlation 244

Linear Regression with Least-Squares Estimation 247

Interpreting Coefficients 249

Standardizing Independent Variables 249

Making Predictions 251

Customer Churn 252

Logistic Regression 255

Interpreting Coefficients 257

Making Predictions 259

8.Scheduling Scripts to Run Automatically 261

Task Scheduler(Windows) 261

The cron Utility(macOS and Unix) 270

Crontab File:One-Time Set-up 271

Adding Cron Jobs to the Crontab File 273

9.Where to Go from Here 277

Additional Standard Library Modules and Built-in Functions 278

Python Standard Library(PSL):A Few More Standard Modules 278

Built-in Functions 279

Python Package Index(PyPI):Additional Add-in Modules 280

NumPy 280

SciPy 286

Scikit-Learn 290

A Few Additional Add-in Packages 292

Additional Data Structures 293

Stacks 293

Queues 294

Graphs 294

Trees 295

Where to Go from Here 295

A.Download Instructions 299

B.Answers to Exercises 311

Bibliography 313

Index 315