订购一个“混合”的vector(数字与字母)
我如何订购一个像
c("7","10a","10b","10c","8","9","11c","11b","11a","12") -> alph
在
alph [1] "7","8","9","10a","10b","10c","11a","11b","11c","12"
并用它来sortingdata.frame,就像
V1 <- c("A","A","B","B","C","C","D","D","E","E") V2 <- 2:1 V3 <- alph df <- data.frame(V1,V2,V3)
并命令行获得(命令V2然后V3)
V1 V2 V3 C 1 9 A 1 10a B 1 10c D 1 11b E 1 12 A 2 7 C 2 8 B 2 10b E 2 11a D 2 11c
> library(gtools) > mixedsort(alph) [1] "7" "8" "9" "10a" "10b" "10c" "11a" "11b" "11c" "12"
要sorting一个data.frame,你可以使用mixedorder
> mydf <- data.frame(alph, USArrests[seq_along(alph),]) > mydf[mixedorder(mydf$alph),] alph Murder Assault UrbanPop Rape Alabama 7 13.2 236 58 21.2 California 8 9.0 276 91 40.6 Colorado 9 7.9 204 78 38.7 Alaska 10a 10.0 263 48 44.5 Arizona 10b 8.1 294 80 31.0 Arkansas 10c 8.8 190 50 19.5 Florida 11a 15.4 335 80 31.9 Delaware 11b 5.9 238 72 15.8 Connecticut 11c 3.3 110 77 11.1 Georgia 12 17.4 211 60 25.8
多个向量上的mixedorder
(列)
显然mixedorder
不能处理多个向量。 我已经做了一个函数,通过将所有字符向量转换为具有混合sorting的级别的因子,并将所有向量传递给标准order
函数来避免这种情况。
multi.mixedorder <- function(..., na.last = TRUE, decreasing = FALSE){ do.call(order, c( lapply(list(...), function(l){ if(is.character(l)){ factor(l, levels=mixedsort(unique(l))) } else { l } }), list(na.last = na.last, decreasing = decreasing) )) }
然而,在你的特定情况下, multi.mixedorder
会得到与标准order
相同的结果,因为V2
是数字。
df <- data.frame( V1 = c("A","A","B","B","C","C","D","D","E","E"), V2 = 19:10, V3 = alph, stringsAsFactors = FALSE) df[multi.mixedorder(df$V2, df$V3),] V1 V2 V3 10 E 10 12 9 E 11 11a 8 D 12 11b 7 D 13 11c 6 C 14 9 5 C 15 8 4 B 16 10c 3 B 17 10b 2 A 18 10a 1 A 19 7
注意到
-
19:10
相当于c(19:10)
。c
意味着concat ,就是使得一个长向量不是很多,但是在你的情况下你只有一个向量(19:10
),所以不需要连接任何东西。 然而,在V1
的情况下,你有10个vector长度为1,所以你需要连接,就像你已经做的那样。 - 您需要
stringsAsFactors=FALSE
不将V1
和V3
转换为(错误sorting)因子(默认值)。