攝影或3C

Python如何做excel的樞紐分析? DataFrame .pivot_table (values=None, index=None, columns=None, aggfunc=’mean’) ; df.groupby([‘A’, ‘B’, ‘C’], sort=False)[‘D’].sum().unstack(‘C’)

Python如何做excel的樞紐分析? DataFrame .pivot_table (values=None, index=None, columns=None, aggfunc='mean') ; df.groupby(['A', 'B', 'C'], sort=False)['D'].sum().unstack('C') - 儲蓄保險王

DataFrame.pivot_table(values=None, index=None, columns=None, aggfunc='mean', fill_value=None, margins=False, dropna=True, margins_name='All', observed=False, sort=True)

點此或下圖連結pandas官網:

官網的範例:

df = pd.DataFrame({"A": ["foo", "foo", "foo", "foo", "foo",
                          "bar", "bar", "bar", "bar"],
                    "B": ["one", "one", "one", "two", "two",
                          "one", "one", "two", "two"],
                    "C": ["small", "large", "large", "small",
                          "small", "large", "small", "small",
                          "large"],
                    "D": [1, 2, 2, 3, 3, 4, 5, 6, 7],
                    "E": [2, 4, 5, 5, 6, 6, 8, 9, 9]})

只能意會,很難言傳

table = pd.pivot_table(df, values='D', index=['A', 'B'],
columns=['C'], aggfunc=np.sum)

stack overflow用以下語法達到一樣的效果:

df.groupby(['A', 'B', 'C'], sort=False)['D'].sum().unstack('C')

這段程式碼使用了 groupby 和 unstack 方法來建立一個資料透視表（pivot table），其中 sort=False 參數用於關閉對資料進行排序。具體來說，首先對 DataFrame 進行分組操作，按照 ‘A’, ‘B’, ‘C’ 欄位進行分組，同時關閉排序，最後對 ‘D’ 欄位進行加總，得到一個多層次索引的 Series。接著調用 unstack 方法，將 ‘C’ 欄位的唯一值作為列索引，同時將 ‘A’, ‘B’ 欄位作為欄索引，得到一個 DataFrame。在這個 DataFrame 中，’D’ 欄位的值就對應了每個 (‘A’, ‘B’) 組合下，不同 ‘C’ 值的加總結果。

aggfunc:

最簡單放function,

也可以放 list of functions

也可以放dict

{column name : function name }

{column name : list of functions }

官網舉例 list of functions

table = pd.pivot_table(df, values=['D', 'E'],

index=['A', 'C'],
aggfunc={'D': np.mean,
'E': [min, max, np.mean]})

‘E’: [min, max, np.mean]

list是有序的,依序為

min, max, np.mean

但return的table

col name排序為

max, mean, min

(依字母順序)

雖然有sort 參數

可以改設定 sort = False

實測對col 不一定有效

只影響index:

若要依據自己指定的順序

可以參考前篇使用reindex()重新排列

table.reindex( columns=[ lisCol[0],lisCol[-1],lisCol[1],lisCol[2] ] )

使用 margins=True

import pandas as pd

data = {
'class': ['A', 'A', 'B', 'B', 'C', 'C'],
'subject': ['Math', 'Science', 'Math', 'Science', 'Math', 'Science'],
'score': [80, 90, 85, 95, 70, 80]
}

df = pd.DataFrame(data)

result = pd.pivot_table(df, index='class', columns='subject', values='score',
aggfunc='mean', margins=True, margins_name='Total')

print(result)

result1 = pd.pivot_table(df,
index='class',
values='score',
aggfunc=['mean', 'sum', 'count'],
margins=True,
margins_name='Total')

儲蓄保險王

儲蓄險是板主最喜愛的儲蓄工具,最喜愛的投資理財工具則是ETF,最喜愛的省錢工具則是信用卡

Next Python如何做excel的樞紐分析? groupbyObj = df.groupby(['A', 'B']) ; groupbyObj.apply() 跟 groupbyObj.agg() 差異為何? result = groupbyObj .apply( function(df) -> Series ) ; result_agg = groupbyObj .agg( ['mean', 'std'] ) ; aggfunc(Series) -> float »

Previous « Python如何對DataFrame內容做函式運算? pandas.DataFrame.apply() ; result_broadcast = df.apply(func, axis=1, result_type=’broadcast’)

Python 如何用 Mistune 將 Markdown (md)轉成 AST (Abstract Syntax Tree , 抽象語法樹)並匯出成 JSON; markdown = mistune.create_markdown( renderer=’ast’ )

想過把 Markdown 文件...

4 天 ago

攝影或3C

Python × Ollama 教學：用本地 LLM (Large Language Model 大語言模型)將 JSON 逐筆自動轉成中文自然語言

前言使用 OpenAI AP...

6 天 ago

攝影或3C

Python如何串接OpenAI /Claude /Gemini API自動將大量維修紀錄JSON轉自然語言描述（並避免中斷資料遺失）response = client.chat.completions.create() ; reply = response.choices[0].message.content

前言在產線或維修記錄分析時，...

6 天 ago

攝影或3C

Python Pandas：to_json( orient = “records” ) 與 to_dict( orient = “records” ) 圖文教學與常見陷阱

Pandas 是 Python...

2 週 ago

攝影或3C

Python使用OpenAI API文字轉語音(Text To Speech, TTS) : from openai import OpenAI ; client = OpenAI (api_key = api_key) ; response = client .audio .speech .create( model= “tts-1-hd”, input= text_content, response_format= “mp3”)

mailer.txt的內容:《...

3 週 ago

攝影或3C

Python: openai-whisper 語音轉文字(Speech To Text, STT)完整教學; pip install openai-whisper ; 如何購買openAI API key?如何生成字幕檔?

本地免費 vs 雲端付費API...