代码之家 › 专栏 › 技术社区 › Rodolphe LAMPE

当只有一个值groupedby时,pandas groupby

pandas-groupby pandas

Rodolphe LAMPE · 技术社区 · 6 年前

我要计算给定变量的累积计数。所以我希望下面的代码可以工作

import pandas as pd
import numpy as np

df = pd.DataFrame.from_records({'x': [0, 1, 0, 1, 1]})
df2 = pd.DataFrame.from_records({'x': [0, 0, 0, 0, 0]})

result = df.groupby('x').apply(lambda x: pd.Series(np.arange(len(x)), index=x.index)).reset_index(level=0, drop=True).sort_index()
assert (result == [0, 0, 1, 1, 2]).all()

result2 = df2.groupby('x').apply(lambda x: pd.Series(np.arange(len(x)))).reset_index(level=0, drop=True).sort_index()
assert (result2 == [0, 1, 2, 3, 4]).all()

第一个断言为真,但不是第二个断言。为什么?

1 回复 | 直到 6 年前

harvpan 6 年前

这似乎是一个悬而未决的问题。

参见 BUG: inconsistent return format of Dataframe group apply function .

解决方法可以是:

assert (result2.values == [0, 1, 2, 3, 4]).all()

推荐文章

Joan · 基于多个panda列的唯一值进行分组

2 年前

d_frEak · 具有装箱条件的dataframe groupby聚合计数函数

2 年前

Andre Nevares sj95126 · 如何在Pandas中为特定键的唯一值添加新列(问题agregate)

2 年前

T_Ner · 如何筛选最后一行中的任何组是负数还是正数,只需显示该组即可。熊猫

2 年前

The Great · Pandas groupby并计算多列中NA值的比率

2 年前

yurnero · 熊猫groupby:当前组的坐标

2 年前

EugLP · Groupby multiple columns&Sum-使用添加的If条件创建新列

2 年前

R Shriya · 基于python中另一列中的AND条件在一列中获取值

2 年前

Anakin Skywalker · 修复列名并在将数据框按两列分组后重命名

2 年前

deppep · Pandas根据另一列的值创建一个包含索引的新列

2 年前