代码之家 › 专栏 › 技术社区 › OptimusPrime

如何在for循环中创建多个新数据帧?

pandas python

OptimusPrime · 技术社区 · 6 年前

我想创建一个不覆盖现有数据帧的for\u循环?

for df in 2011, 2012, 2013:
       df = pd.pivot_table(df, index=["income"], columns=["area"], values=["id"], aggfunc='count')

2011_pivot, 2012_pivot, 2013_pivot

3 回复 | 直到 6 年前

Sven Harris 6 年前

我通常不鼓励您创建大量具有相关名称的变量,这在Python中是一种危险的设计模式(尽管在SAS中很常见)。一个更好的选择是创建一个dataframes字典,其中key作为“变量名”

df_dict = dict()
for df in 2011, 2012, 2013:
   df_dict["pivot_"+df.name] = pd.pivot_table(df, index=["income"], columns=["area"], values=["id"], aggfunc='count')

Colonder 6 年前

除了创建数据帧列表或字典之外,我看不到其他方法,您必须手动命名它们。

df_list = [pd.pivot_table(df, index=["income"], columns=["area"], values=["id"], aggfunc='count') for df in 2011, 2012, 2013]

here .

jpp 6 年前

dict 或 list 相反,例如通过字典或列表理解。

MultiIndex 列和单个 pd.pivot_table 电话:

dfs = {2011: df_2011, 2012: df_2012, 2013: df_2013}
comb = pd.concat([v.assign(year=k) for k, v in dfs.items()], ignore_index=True)

df = pd.pivot_table(comb, index='income', columns=['year', 'area'],
                    values='id', aggfunc='count')

pivot_2011 = df.iloc[:, df.columns.get_level_values(0).eq(2011)]

推荐文章

Mainland · Python数据帧规范化值错误:列的长度必须与键相同

1 年前

user026 · 如何根据特定窗口的平均值(行数)创建新列?

1 年前

rpn · 如何在列[1]中连续第二次出现“0”时返回列[0]的值

1 年前

asmgx · 为什么合并数据帧不能按照python中的预期方式工作

1 年前

Gtoth · 如何分割Pandas DataFrame中包含多个日期的两个时间戳之间的差异

1 年前

Domarius · 使用loc为多行设置多列值

1 年前

Swastik Bhattacharyya · 如何在同一类别类型的多列上运行get_dummies()函数?

1 年前

DrZoidberg09 · 如何在字典列表中创建一个新关键字,该关键字是另一个关键字的总和?

1 年前

armstrong3701 · 如何有效地处理熊猫数据框中缺失的数据并计算条件统计?

1 年前

msts1906 · 大熊猫向乳胶的适当多品种出口

1 年前