pandas 練習第二篇

pandas 這麼可愛，我爭取多用幾張很萌很萌的照片，哈哈哈

好的，上一篇文章做了pandas中非常基礎的一些練習題，從這篇文章開始，可能會慢慢複雜起來，當然這個也是學習的一個比較科學的方式。

這個系列的練習題是Github上面的，地址在這

guipsamora/pandas_exercises?

github.com

好的今天接著上篇文章的Filtering and Sorting Data

Ex2 - Filtering and Sorting Data

This time we are going to pull data directly from the internet.
Step 1. Import the necessary libraries

#前面幾題都是很基礎的做法
import pandas as pd
import numpy as np

Step 2. Import the dataset from this address.

euro12 = pd.read_csv(https://raw.githubusercontent.com/jokecamp/FootballData/master/UEFA_European_Championship/Euro%202012/Euro%202012%20stats%20TEAM.csv)

Step 3. Assign it to a variable called euro12.
euro12.head()

Step 4. Select only the Goal column.

euro12.Goals

Step 5. How many team participated in the Euro2012?

euro12.Team.sort_values().count()

Step 6. What is the number of columns in the dataset?
euro12.shape[1]

Step 7. View only the columns Team, Yellow Cards and Red Cards and assign them to a dataframe called discipline

discipline = euro12[[Team,Yellow Cards,Red Cards]]
discipline

Step 8. Sort the teams by Red Cards, then to Yellow Cards

discipline.sort_values([Yellow Cards,Yellow Cards],ascending=False)
#output

Team Yellow Cards Red Cards
7 Italy 16 0
10 Portugal 12 0
13 Spain 11 0
0 Croatia 9 0
6 Greece 9 1
1 Czech Republic 7 0
9 Poland 7 1
14 Sweden 7 0
4 France 6 0
11 Republic of Ireland 6 1
12 Russia 6 0
3 England 5 0
8 Netherlands 5 0
15 Ukraine 5 0
2 Denmark 4 0
5 Germany 4 0

Step 9. Calculate the mean Yellow Cards given per Team

discipline[Yellow Cards].mean()

Step 10. Filter teams that scored more than 6 goals

euro12[euro12.Goals>6]

Step 11. Select the teams that start with G

euro12[euro12.Team.str.startswith(G)]

Step 12. Select the first 7 columns

#找到前7列的數據
#用切片iloc
euro12.iloc[:,0:7]

Step 13. Select all columns except the last 3.
#同樣使用iloc來對列進行切片操作
#題目要求，不包含最後三列，那麼可以使用從0開始到倒數三列的切片模式
euro12.iloc[:,:-3]

Step 14. Present only the Shooting Accuracy from England, Italy and Russia

#前面兩題都是iloc，即對列進行切盼操作，這一題是使用loc對行進行切片操作
#題目還要求必須是英格蘭，義大利和俄羅斯這三個國家，所以考慮使用isin函數來進行判斷
euro12.loc[euro12.Team.isin([England, Italy, Russia]), [Team,Shooting Accuracy]]