I'm trying to scrape xml from this url: data.gov.in/sites/default/files/Potato_2013.xml using R (version : 3.1.0)
Tried
library(XML)
url<- "data.gov.in/sites/default/files/Potato_2013.xml"
doc<- xmlParse(url,useInternalNodes=TRUE)
But am getting an error saying
Error: The XML content does not seem to be xml:
Any way to fix this??
typing just
doc<-xmlParse(url)
gives a document is empty error.
I am looking to extract the values for the nodes State, Commodity, Arrival Date etc.
Thanks!