Procedure to collect proteins and genes based on protein domains

 

1.          Find the InterPro ID (IPRxxxxxx) for domains from Interpro databases http://www.ebi.ac.uk/interpro/index.html or Pfam http://pfam.sanger.ac.uk/search?tab=searchKeywordBlock

 

2.          Use the InterPro ID to find proteins under the protein family in the website of http://www.ebi.ac.uk/

 

 

3.    Translate the gene symbols to Gene IDs and Refseq IDs by using batch search in Panther database http://www.pantherdb.org/genes/batchIdSearch.jsp . For proteins which don¡¦t have Gene Names, use Accession numbers such as Q9UBT2 to find gene IDs in NCBI database http://www.ncbi.nlm.nih.gov/sites/entrez?db=gene&cmd=search&term=

 

 

4.    For Gene Names not match in Panther database, search them in NCBI database http://www.ncbi.nlm.nih.gov/sites/entrez?db=gene&cmd=search&term= ()