代码之家 › 专栏 › 技术社区 › Jan

用熊猫的数据填充矩阵。数据帧,跳过NaN

nan pandas python-2.7 python

Jan · 技术社区 · 7 年前

我想填充矩阵 ref 使用 pd.DataFrame xxx 但是跳过 NaN .

print xxx
OUT >> 
   intensity name  rowtype1  rowtype2
0        100    A         1       4.0
1        200    A         2       NaN
2        300    B         3       5.0

然后我填充矩阵 ref[rowtype,col] = intensity 我有2个 rowtype .

ref = np.zeros(shape=(7,4))
for idx, inte, name, r1, r2 in xxx.itertuples():
    ref[r1,idx] = inte
    ref[r2,idx] = inte # error because of NaN in rowtype2

print ref

我怎么能跳过 楠 在这里我知道一种使用方法 drop.na() 但它必须创建新的数据帧 rowtype2 和 intensity . 我想有一个快速简单的方式,就像刚刚跳过 楠 具有 intensity = 200 到下一个 rowtype2 = 5 具有 intensity = 300 .

其他信息:

1) 下面是如何创建 xxx个

prot = ['A','A','B']
calc_m = [1,2,3]
calc_m2 = [4, np.nan,5]
inte = [100,200,300]
xxx = pd.DataFrame({'name' : pd.Series(prot),
                    'rowtype1': pd.Series(calc_m),
                    'rowtype2': pd.Series(calc_m2),
                    'intensity': pd.Series(inte)
                    })

2 回复 | 直到 7 年前

DJK 7 年前

您可以使用以下选项: melt ,然后设置 ref 使用numpy索引与使用for循环

set = xxx.reset_index().melt(['intensity','index'],['rowtype1','rowtype2']).dropna()

ref[set.value.astype(int).values,set['index'].values] = set.intensity.values

这给了你

array([[   0.,    0.,    0.,    0.],
       [ 100.,    0.,    0.,    0.],
       [   0.,  200.,    0.,    0.],
       [   0.,    0.,  300.,    0.],
       [ 100.,    0.,    0.,    0.],
       [   0.,    0.,  300.,    0.],
       [   0.,    0.,    0.,    0.]])

n3utrino 7 年前

我不确定我是否完全理解您在寻找什么行为,但pandas dropna()命令有“subset”参数。。。例如,删除rowtype2列中具有NaN的所有行可以使用

xxx.dropna(subset=['rowtype2'],inplace=True)

这样,您将只在rowtype2列中删除带有NaN的行。

推荐文章

Mainland · Python数据帧规范化值错误:列的长度必须与键相同

1 年前

user026 · 如何根据特定窗口的平均值(行数)创建新列?

1 年前

rpn · 如何在列[1]中连续第二次出现“0”时返回列[0]的值

1 年前

asmgx · 为什么合并数据帧不能按照python中的预期方式工作

1 年前

Gtoth · 如何分割Pandas DataFrame中包含多个日期的两个时间戳之间的差异

1 年前

Domarius · 使用loc为多行设置多列值

1 年前

Swastik Bhattacharyya · 如何在同一类别类型的多列上运行get_dummies()函数?

1 年前

DrZoidberg09 · 如何在字典列表中创建一个新关键字,该关键字是另一个关键字的总和?

1 年前

armstrong3701 · 如何有效地处理熊猫数据框中缺失的数据并计算条件统计?

1 年前

msts1906 · 大熊猫向乳胶的适当多品种出口

1 年前