攝影或3C

Python機器學習: 決策樹 (DecisionTreeClassifier) ; from sklearn.tree import DecisionTreeClassifier ; tree = DecisionTreeClassifier(criterion = “gini”) #criterion = “entropy” #criterion: 標準，準則

from sklearn.tree import DecisionTreeClassifier

前篇使用了

K-近鄰演算法(K Nearest Neighbor ,簡稱 KNN)

本篇要改用決策樹分類

處理的資料: https://pse.is/3ty6rk

部分資料:

import pandas as pd

from sklearn.neighbors import KNeighborsClassifier

from sklearn.model_selection import train_test_split

folder = “C:\Python\P107\doc”

fname = “student.csv”

import os

fpath = os.path.join(folder,fname)

# fpath = folder + “\\” + fname #同義

df = pd.read_csv(fpath,header=None,skiprows=[0])

df.to_excel(os.path.join(folder,”knsDF.xlsx”))

#df.index.size = 40 #df.columns.size = 5

X = df.drop([4],axis=1).values

y = df[4].values

Xtrain, Xtest, ytrain, ytest =\

train_test_split(X,y,test_size=0.25,

random_state= 42 ,shuffle =True)

XtestDF = pd.DataFrame(Xtest)

XtestDF.to_excel(os.path.join(folder,”knsXtest.xlsx”))

print(“Xtrain.shape:”,Xtrain.shape) #(30, 3)

print(“ytrain.shape:”,ytrain.shape) #(30,)

print(“Xtest.shape:”,Xtest.shape) #(10, 3)

print(“ytest.shape:”,ytest.shape) #(10,)

from sklearn.tree import DecisionTreeClassifier

for cri in [“gini”,”entropy”]:

tree = DecisionTreeClassifier(criterion = cri)

tree.fit(Xtrain,ytrain)

pred = tree.predict(Xtest)

pred2 = tree.predict_proba(Xtest)

print(“prediction:”,pred,”in case criterion”,cri )

print(“prediction proability:”,pred2,

“\nin case criterion”,cri )

howgood = tree.score(Xtest,ytest)

print(“Goodness:”,howgood,”in case ctrterion”,cri)

#前半部同KNN

後半部類似的語法,

主要改用 DecisionTreeClassifier

輸出結果:

就算有第0欄(index,非資料)干擾

決策樹的score仍比

K-近鄰演算法(KNN)去掉第0欄高

匯出決策樹:

改寫後半段程式碼:

from sklearn.tree import DecisionTreeClassifier

from sklearn.tree import export_text

for cri in [“gini”,”entropy”]:

tree = DecisionTreeClassifier(criterion = cri)

tree.fit(Xtrain,ytrain)

r = export_text(tree,feature_names =

[“item”,”English”,”Math”,”Chinese”],

show_weights=True)

print(r)

輸出結果:

criterion = “gini”

criterion = “entropy”

儲蓄保險王

儲蓄險是板主最喜愛的儲蓄工具,最喜愛的投資理財工具則是ETF,最喜愛的省錢工具則是信用卡

Next Python: 如何做矩陣乘法? numpy.matmul (ary1, ary2) 或 ary1 @ ary2 或 numpy.dot (ary1, ary2) »

Previous « Python: 字串(string)的函式.rfind() .replace() 切片與串接; 如何尋找直欄中,含有特定關鍵字的列數? pandas.Series.str.contains("Hz") ;如何將Series中的內容去掉首末的空格並小寫? pandas.Series .str.strip() .str.lower() #需要兩次.str

Python-docx 進階密技：突破限制，在 Word 任意位置插入段落的三種流派: insert_paragraph_before ; body.insert(0, p_elem) #像操作 List 一樣操作文件; target_xml_node.addnext(p_new)

在使用 python-docx...

2 天 ago

攝影或3C

Python-docx 進階解析：為什麼你的程式讀不到表格裡的文字？ —— 深入比較 doc.paragraphs、doc.tables 與底層 doc.element.body; from docx.text.paragraph import Paragraph #將 XML CT_P轉為 Paragraph 物件; from docx.table import Table #將 XML CT_Tbl轉為 Table 物件

在自動化處理 Word 文件時...

3 天 ago

攝影或3C

Python 網頁解析入門：BeautifulSoup 的 find vs select_one; find_all vs select ; Python 風格 vs CSS selector (支援 #id, .class, 層級選擇); from bs4 import BeautifulSoup

在使用 BeautifulSo...

4 天 ago

攝影或3C

Python 現代化路徑管理：用 pathlib 一行搞定 os.mkdir 與 os.makedirs; log_dir.mkdir(parents=True, exist_ok=True)

在 Python 的舊時代，我...

2 週 ago

攝影或3C

Python 現代化路徑管理：用 pathlib 優雅搞定檔案「更名」與「移動」from pathlib import Path; Path.with_name(“新檔名.副檔名”) #更改basename; Path.with_suffix(“.新副檔名”) #更改副檔名

在過去，處理檔案路徑和更名時，...

2 週 ago

攝影或3C

Python 檔案搜尋實戰：glob.glob() vs Path.glob() 誰更好用？

在 Python 自動化腳本中...

2 週 ago

Python機器學習: 決策樹 (DecisionTreeClassifier) ; from sklearn.tree import DecisionTreeClassifier ; tree = DecisionTreeClassifier(criterion = “gini”) #criterion = “entropy” #criterion: 標準，準則

Related Post

Recent Posts

Python-docx 進階密技：突破限制，在 Word 任意位置插入段落的三種流派: insert_paragraph_before ; body.insert(0, p_elem) #像操作 List 一樣操作文件; target_xml_node.addnext(p_new)

Python-docx 進階解析：為什麼你的程式讀不到表格裡的文字？ —— 深入比較 doc.paragraphs、doc.tables 與底層 doc.element.body; from docx.text.paragraph import Paragraph #將 XML CT_P轉為 Paragraph 物件; from docx.table import Table #將 XML CT_Tbl轉為 Table 物件

Python 網頁解析入門：BeautifulSoup 的 find vs select_one; find_all vs select ; Python 風格 vs CSS selector (支援 #id, .class, 層級選擇); from bs4 import BeautifulSoup

Python 現代化路徑管理：用 pathlib 一行搞定 os.mkdir 與 os.makedirs; log_dir.mkdir(parents=True, exist_ok=True)

Python 現代化路徑管理：用 pathlib 優雅搞定檔案「更名」與「移動」from pathlib import Path; Path.with_name(“新檔名.副檔名”) #更改basename; Path.with_suffix(“.新副檔名”) #更改副檔名

Python 檔案搜尋實戰：glob.glob() vs Path.glob() 誰更好用？