Python: pandas.DataFrame (df) 的取值: df [單一字串] 或df [list_of_strings] 選取一個或多個columns; df [切片] 或 df [bool_Series] 選取多個rows #bool_Series長度同rows, index也需要同df.index ,可以使用.equals() 確認: df.index.equals(mask.index)

by 儲蓄保險王 · 2025-04-20

1. `df[]` 行為的邏輯

當 `[]` 裡是單一字串或list of strings

代表你選擇一個或多個欄位（columns）

df['A']          # 取出名為'A'的column（得到Series）
df[['A', 'B']]   # 取出名為'A','B'的columns（得到DataFrame）

輸出結果:

Python: pandas.DataFrame (df) 的取值: df [單一字串] 或df [list_of_strings] 選取一個或多個columns; df [切片] 或 df [bool_Series] 選取多個rows #bool_Series長度同rows, index也需要同df.index ,可以使用.equals() 確認: df.index.equals(mask.index) - 儲蓄保險王

df[[“A”,”B”]] 等效:
pd.DataFrame(dic,columns=[“A”,”B”])

dict轉為DataFrame
不要誤以為columns 參數是
更改column name的意思,
將會取得NaN:

要更改column name,
請建立好 DataFrame 後
賦值給 DataFrame.columns:

當 `[]` 裡是切片（slice）或 bool Series/array

代表你選擇列（rows）

df[0:2]                # 取出第0、1兩行（用切片選row）
df[df['A'] > = 3]        # 用布林篩選row

輸出結果:

注意: bool Series 的長度同 rows ,
index 也必須同DataFrame
若index不一致,將觸發IndexingError:
Unalignable boolean Series provided as indexer
(index of the boolean Series and of the indexed object do not match).
可以使用.equals() 確認 :
df.index.equals(mask.index)
類似:
(df.index == mask.index).all()
主要區別

equals() 通常效能更佳，特別是對較大的索引

處理 NaN 值：

equals() 會將兩個 NaN 視為相等

== 運算符會將 NaN 與任何值（包括另一個 NaN）比較時返回 False

索引類型檢查：

equals() 會檢查索引的類型是否相同

== 只比較值，不嚴格檢查類型

效能表現：equals() 通常效能更佳，特別是對較大的索引

2. 為什麼會這樣設計？

Pandas 設計哲學

DataFrame 是「以欄為主」的結構
df[] 這個語法，最原始的設計，就是像dict一樣去取「欄」。
- df['A'] 就像 d['A']
- df[['A','B']] 就像從dict裡挑兩個key
但為了方便，也允許用切片和布林遮罩直接過濾row
這是為了讓 DataFrame 有點像 Numpy array 的行為：
- df[0:3] 直覺上就是要「取某幾rows」
- df[條件] 就像 array 的「masking」

類比字典、array 的混合行為

像字典：「key」代表欄位
像陣列：「切片」或「布林陣列」代表row的篩選

3. 行為差異的合理解釋

欄位（columns）取法：

單一字串或list => 你是在指定欄位(key)，所以是「選欄」。

列（rows）取法：

切片(slice) => 行的index切片（跟Numpy array一致）
布林Series/array => 哪些row要保留（跟Numpy array一致）

總結：

df[欄位名或list] → 取欄位（columns）
df[切片或布林遮罩] → 取列（rows）

4. 更嚴謹的說法（官方文件）

The behavior of df[] depends on the type of the argument:

If the argument is a string or list of strings: select columns.

If the argument is a slice or boolean array/Series: select rows.

參考：Pandas 官方文件 – Indexing and selecting data

5. 建議

想選row/col都明確的話，建議用 .loc、.iloc，可避免混淆。

6. 記憶口訣

df[欄位key] → 選column
df[布林/切片] → 選row

df[0,"A"]
會觸發KeyError

這個錯誤訊息是因為用了 df[0, "A"] 這種寫法，
但在 Pandas 裡，這不是正確的取值方式。

原因說明

df[] 只接受一個參數，這個參數是 column 名稱或 column 名稱的 list，或是 row 的 slice、boolean mask。
你傳入了一個tuple (0, "A")，Pandas 會把它當成欄位名稱去查找，結果找不到這個名為 (0, "A") 的欄位，所以報 KeyError。

正確的做法

如果你想「取第0列的’A’欄」：

請使用 .loc 或 .iloc：

.loc：用標籤（index 和 column 名稱）
.iloc：用數字位置（row 和 column 的整數 index）

延伸總結

df[0, 'A'] 錯誤，因為只接受一個參數
用 df.iloc[row_idx, col_idx] 或 df.loc[row_label, col_label] 才能同時指定 row 和 column

官方文件參考

儲蓄保險王

儲蓄險是板主最喜愛的儲蓄工具,最喜愛的投資理財工具則是ETF,最喜愛的省錢工具則是信用卡

一	二	三	四	五	六	日
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30	31

Python: pandas.DataFrame (df) 的取值: df [單一字串] 或df [list_of_strings] 選取一個或多個columns; df [切片] 或 df [bool_Series] 選取多個rows #bool_Series長度同rows, index也需要同df.index ,可以使用.equals() 確認: df.index.equals(mask.index)

1. `df[]` 行為的邏輯

當 `[]` 裡是單一字串或list of strings

當 `[]` 裡是切片（slice）或 bool Series/array

2. 為什麼會這樣設計？

Pandas 設計哲學

類比字典、array 的混合行為

3. 行為差異的合理解釋

欄位（columns）取法：

列（rows）取法：

4. 更嚴謹的說法（官方文件）

5. 建議

6. 記憶口訣

原因說明

正確的做法

如果你想「取第0列的’A’欄」：

延伸總結

官方文件參考

You may also like...

發佈留言取消回覆

hahow

近期文章

分類

近期留言

熱門討論

FB粉絲團

瀏覽量

月曆

Python: pandas.DataFrame (df) 的取值: df [單一字串] 或df [list_of_strings] 選取一個或多個columns; df [切片] 或 df [bool_Series] 選取多個rows #bool_Series長度同rows, index也需要同df.index ,可以使用.equals() 確認: df.index.equals(mask.index)

1. df[] 行為的邏輯

當 [] 裡是單一字串或list of strings

當 [] 裡是切片（slice） 或 bool Series/array

2. 為什麼會這樣設計？

Pandas 設計哲學

類比字典、array 的混合行為

3. 行為差異的合理解釋

欄位（columns）取法：

列（rows）取法：

4. 更嚴謹的說法（官方文件）

5. 建議

6. 記憶口訣

原因說明

正確的做法

如果你想「取第0列的’A’欄」：

延伸總結

官方文件參考

You may also like...

Python:圖片轉文字pip install pytesseract ; pytesseract. pytesseract. tesseract_cmd

Python: matplotlib繪圖 如何用 bbox_to_anchor 控制legend (圖例)位置? ax.legend( bbox_to_anchor = (1, 1), borderaxespad=0)

WORD合併列印:TQC考題510,三天二夜自由行,Skip Record If : 地址 不等於 A*,SHIFT F9展開功能變數,選取A再用交互參照(書籤:市區)取代

發佈留言 取消回覆

hahow

近期文章

分類

近期留言

熱門討論

FB粉絲團

瀏覽量

月曆

1. `df[]` 行為的邏輯

當 `[]` 裡是單一字串或list of strings

當 `[]` 裡是切片（slice）或 bool Series/array

Python: matplotlib繪圖如何用 bbox_to_anchor 控制legend (圖例)位置? ax.legend( bbox_to_anchor = (1, 1), borderaxespad=0)

WORD合併列印:TQC考題510,三天二夜自由行,Skip Record If : 地址不等於 A*,SHIFT F9展開功能變數,選取A再用交互參照(書籤:市區)取代

發佈留言取消回覆