NCIB中pubmed文献数据的抓取
Pubmed数据库中含有大量的文献相关信息,但是抓取这些数据的时候会有很多的问题和困难,但是有了pubmed自己的工具就可以随心所欲的抓取了!!!!
http://www.ncbi.nlm.nih.gov/books/NBK25499/
这里面有各种工具和参数介绍!!
这里是EFetch的介绍:
Base URL
http://eutils.ncbi.nlm.nih.gov/entrez/eutils/efetch.fcgi
Functions
Returns formatted data records for a list of input UIDs Returns formatted data records for a set of UIDs stored on the Entrez History server Required Parameters
db
Database from which to retrieve records. The value must be a valid (default = pubmed). Currently EFetch does not support all Entrez databases. Please see in Chapter 2 for a list of available databases.
Required Parameter – Used only when input is from a UID list
id
UID list. Either a single UID or a comma-delimited list of UIDs may be provided. All of the UIDs must be from the database specified by db. There is no set maximum for the number of UIDs that can be passed to EFetch, but if more than about 200 UIDs are to be provided, the request should be made using the HTTP POST method.
efetch.fcgi?db=protein&id=15718680,157427902,119703751
Required Parameters – Used only when input is from the Entrez History server query_key