2017-12-16 5 views
0

の複数の列にリストを分割すると、(jsonファイルの)リストがあります。ここでは、列のウェブサイトのサンプルです:私のデータセットのWEBSITE列のR

> dataset$WEBSITE[[1]]) 
[1] "list(Headers = list(MaxTopicsRootDomain = 30, MaxTopicsSubDomain = 20, MaxTopicsURL = 10, TopicsCount = 3), Data = list(ItemNum = 0, Item = \"https://mywebsite.com/\", ResultCode = \"OK\", Status = \"Found\", ExtBackLinks = 1398, RefDomains = 452, AnalysisResUnitsCost = 1398, ACRank = 4, ItemType = 3, IndexedURLs = 1, GetTopBackLinksAnalysisResUnitsCost = 5000, DownloadBacklinksAnalysisResUnitsCost = 25000, DownloadRefDomainBacklinksAnalysisResUnitsCost = 25000, RefIPs = 323, \n RefSubNets = 273, RefDomainsEDU = 0, ExtBackLinksEDU = 0, RefDomainsGOV = 0, ExtBackLinksGOV = 0, RefDomainsEDU_Exact = 0, ExtBackLinksEDU_Exact = 0, RefDomainsGOV_Exact = 0, ExtBackLinksGOV_Exact = 0, CrawledFlag = \"True\", LastCrawlDate = \"2017-10-05\", LastCrawlResult = \"HTTP_404_NotFound\", RedirectFlag = \"False\", FinalRedirectResult = \"\", OutDomainsExternal = \"5\", OutLinksExternal = \"11\", OutLinksInternal = \"162\", OutLinksPages = \"1\", LastSeen = \"\"... <truncated> 

> dataset$WEBSITE[[2]]) 
[2] "list(Headers = list(MaxTopicsRootDomain = 30, MaxTopicsSubDomain = 20, MaxTopicsURL = 10, TopicsCount = 3), Data = list(ItemNum = 0, Item = \"http://www.website.uk\", ResultCode = \"OK\", Status = \"Found\", ExtBackLinks = 254, RefDomains = 76, AnalysisResUnitsCost = 254, ACRank = 9, ItemType = 3, IndexedURLs = 1, GetTopBackLinksAnalysisResUnitsCost = 5000, DownloadBacklinksAnalysisResUnitsCost = 25000, DownloadRefDomainBacklinksAnalysisResUnitsCost = 25000, RefIPs = 75, RefSubNets = 56, \n RefDomainsEDU = 0, ExtBackLinksEDU = 0, RefDomainsGOV = 0, ExtBackLinksGOV = 0, RefDomainsEDU_Exact = 0, ExtBackLinksEDU_Exact = 0, RefDomainsGOV_Exact = 0, ExtBackLinksGOV_Exact = 0, CrawledFlag = \"True\", LastCrawlDate = \"2017-12-14\", LastCrawlResult = \"DownloadedSuccessfully\", RedirectFlag = \"False\", FinalRedirectResult = \"\", OutDomainsExternal = \"2\", OutLinksExternal = \"2\", OutLinksInternal = \"19\", OutLinksPages = \"1\", LastSeen = \"\", Title = \"Dedic... <truncated> 

> dataset$WEBSITE[[3]]) 
[3] "list(Headers = list(MaxTopicsRootDomain = 30, MaxTopicsSubDomain = 20, MaxTopicsURL = 10, TopicsCount = 3), Data = list(ItemNum = 0, Item = \"http://www.website.uk\", ResultCode = \"OK\", Status = \"Found\", ExtBackLinks = 254, RefDomains = 76, AnalysisResUnitsCost = 254, ACRank = 9, ItemType = 3, IndexedURLs = 1, GetTopBackLinksAnalysisResUnitsCost = 5000, DownloadBacklinksAnalysisResUnitsCost = 25000, DownloadRefDomainBacklinksAnalysisResUnitsCost = 25000, RefIPs = 75, RefSubNets = 56, \n RefDomainsEDU = 0, ExtBackLinksEDU = 0, RefDomainsGOV = 0, ExtBackLinksGOV = 0, RefDomainsEDU_Exact = 0, ExtBackLinksEDU_Exact = 0, RefDomainsGOV_Exact = 0, ExtBackLinksGOV_Exact = 0, CrawledFlag = \"True\", LastCrawlDate = \"2017-12-14\", LastCrawlResult = \"DownloadedSuccessfully\", RedirectFlag = \"False\", FinalRedirectResult = \"\", OutDomainsExternal = \"2\", OutLinksExternal = \"2\", OutLinksInternal = \"19\", OutLinksPages = \"1\", LastSeen = \"\", Title = \"Dedic... <truncated> 

私のデータセットには、次のようになります。

COLOR  | SIZE  | WEBSITE 
Blue  | 13456  | list(Headers = list(MaxTopicsRootDomain = 30, MaxTopicsSubDomain = 20, MaxTopicsURL = 10 
Green  | 17487  | list(Headers = list(MaxTopicsRootDomain = 30, MaxTopicsSubDomain = 20, MaxTopicsURL = 10, 
Red   | 65438  | list(Headers = list(MaxTopicsRootDomain = 30, MaxTopicsSubDomain = 20, MaxTopicsURL = 10, To 

私の目標は、私のデータセットは、このように見えるようにするために、専用の列に各JSONノードを有効にすることです:

COLOR  | SIZE  | MaxTopicsRootDomain | MaxTopicsSubDomain | MaxTopicsURL 
Blue  | 13456  | 30     | 20     | 10 
Green  | 17487  | 30     | 20     | 10 
Red   | 65438  | 30     | 20     | 10 

...私はこの方法を試してみましたが、私はわからない私は、正しい方法でmは
dataset$WEBSITE <- as.character(dataset$WEBSITE) #character needed for a strsplit() 
hello <- strsplit(dataset$WEBSITE, split = ",") 
hello <- data.frame(COLOR = rep(dataset$Color, 
          sapply(hello, length)), 
          WEBSITE = unlist(hello)) 

ご協力いただきありがとうございます!

+0

再現可能な例を投稿すると、ヘルプが表示される可能性が高くなります。つまり、 'jsonlite :: flatten'関数を見てみるといいでしょう。 – A5C1D2H2I1M1N2O1R2T1

答えて

0

私は最後にanwserを見つける。

おそらく完璧ではありませんが、機能します。

dataset_2 <- do.call(rbind, dataset$WEBSITE) 
dataset_2 <- cbind(dataset[c("COLOR")], dataset_2) 
dataset <- merge(dataset,dataset_2,by="COLOR") 
dataset <- unique (dataset) 
-1

purrrとmap_dfを使用するとうまくいくはずです。しかし、私は今私のノートパソコンではない

関連する問題