代码之家 › 专栏 › 技术社区 › Boosted_d16

排序数据框,但从排序中排除前两列

sorting pandas

Boosted_d16 · 技术社区 · 5 年前

我想对df排序,但我想从排序中排除前两列。

我可以在排除前1列的同时成功排序,但我想更新下面的代码以排除前2列。

from pandas import DataFrame
Cars = {'Dimensions': [0.48,0.44,0.4,0.6],
        'Price': [0.3,0.25,0.74,0.5],
        'Year': [0.41,0.38,0.64,0.65],
        'Range': [0.95,0.98,0.24,0.42],
        'Height': [0.75,0.88,0.84,0.95],
        }

df = DataFrame(Cars, columns= ['Dimensions', 'Price','Year', 'Range', 'Height'], 
               index=['Honda Civic','Toyota Corolla','Ford Focus','Audi A4'])

s = df.iloc[0]
df = df.iloc[:, ((-s[1:]).argsort() + 1).reindex(df.columns, fill_value=0)]

我原以为这能行,但没用

s = df.iloc[0:1]
df = df.iloc[:, ((-s[1:]).argsort() + 1).reindex(df.columns, fill_value=0)]

错误:

AttributeError: 'DataFrame' object has no attribute 'argsort'

预期产量:

                Dimensions  Price  Range  Height  Year  
Honda Civic           0.48   0.30   0.95    0.75  0.41  
Toyota Corolla        0.44   0.25   0.98    0.88  0.38 
Ford Focus            0.40   0.74   0.24    0.84  0.64  
Audi A4               0.60   0.50   0.42    0.95  0.65

1 回复 | 直到 5 年前

jezrael 5 年前

问题是 Series 索引返回一行 DataFrame ,因为 argsort 系列 引发错误:

s = df.iloc[0:1]
print (s)
             Dimensions  Price  Year  Range  Height
Honda Civic        0.48    0.3  0.41   0.95    0.75

对于排除前两列,添加助手前两列由 0,1 并添加 s[2:]

s = df.iloc[0]

new = pd.Series([0,1], index=df.columns[:2])
df = df.iloc[:, new.append( ((-s[2:]).argsort() + 2))]
print (df)
                Dimensions  Price  Range  Height  Year
Honda Civic           0.48   0.30   0.95    0.75  0.41
Toyota Corolla        0.44   0.25   0.98    0.88  0.38
Ford Focus            0.40   0.74   0.24    0.84  0.64
Audi A4               0.60   0.50   0.42    0.95  0.65

推荐文章

Mainland · Python数据帧规范化值错误:列的长度必须与键相同

1 年前

user026 · 如何根据特定窗口的平均值(行数)创建新列?

1 年前

rpn · 如何在列[1]中连续第二次出现“0”时返回列[0]的值

1 年前