代码之家 › 专栏 › 技术社区 › Tim Wilcox

如何最好地在R中执行此特定行操作?

pivot data.table dplyr r

-1

Tim Wilcox · 技术社区 · 1 年前

所以对于这个任务,在我的真实数据集中。我有18行,indcode=000000,ownership code=10。区分因素是面积。同样,我有18行,indcode=4911,ownership code=10。下面的示例数据将其缩小到4,以便于计算。一些上下文。。在我的真实数据集中,我有从1月2日到6月23日的年度(02)和月份(1月)的月度数据。910是新的indcode。。它代表了特定地区和时间内联邦政府的总就业人数。联邦就业定义为indcode=000000减去indcode=4911。indcode=55只是为了使其更加现实。

附言,我对“02 Jan”有一些困难,所以可以随意将其重命名为Jan。只是想让它与真正的产品保持一致。

 indcode <- c("000000","000000","000000","000000", "55", "4911","4911","4911","4911")
 ownership <- c("10","10","10","10","10","10","10","10","10")
 area <- c("000000","031","029","017","029","000000","031","029","017")
 "02-Jan" <- c(1000,600,300,100,50,100,50,40,10)
 "02-Feb" <- c(1003,601,301,101,51,101,51,41,11)

  first <- data.frame(indcode, ownership, area, `02-Jan`, `02-Feb`)

对于每个区域,这里都有一个例子。实际的02值不是1000-100,而是900,但我认为这会让它更清楚。

    indcode    ownership    area     02-Jan    02-Feb
      910          10        000000    1000-100     1003-101  
      910          10        031       600-50       601-51

1 回复 | 直到 1 年前

Jon Spring 1 年前

library(dplyr)
first |>
  summarize(across(3:4, ~max(.)-min(.)), 
  # OLD: summarize(across(3:4, ~paste(rev(range(.)), collapse = "-")), 
            .by = area) |>
    #"3:4" refers to the 3rd and 4th column once we set aside the area grouping
    # We could alternated specify the columns by name, e.g. X02.Jan:X02.Feb
  mutate(indcode = 910, ownership = 10, .before = 1)

后果

  indcode ownership   area X02.Jan X02.Feb
1     910        10 000000     900     902
2     910        10    031     550     550
3     910        10    029     260     260
4     910        10    017      90      90

推荐文章

Marc B. · 使用ggplot2创建条形图时“缺少值”

1 年前

deschen · tidyverse与外部向量发生突变,该外部向量的元素是数据帧中的列值

1 年前

Laura · 在Shiny中使用可排序的包拖放名称,这些名称将成为图表

1 年前

Mallikarjun M · 如何使用随机森林进行时间序列预测?

1 年前

ly li · 模型摘要:当表格形状改变时,拟合优度消失

1 年前

C.Robin · 将marginaffects::predictions()的结果连接回main df?

1 年前

monotonic · 如何将格式为“col1+col3+col4”的数据帧的行名转换为一列数字向量“c(1,3,4)”?

2 年前

Shawn Hemelstrand · 为什么我的自定义errorbar函数不能在R中工作?

2 年前

RoyBatty · 统计每个字符在整个数据集中出现的次数

2 年前

stats_noob · R: 记录某个“行为”发生的循环的索引?

2 年前