解析使用biopython

问题描述:

我需要帮助解析以下BLASTP输出BLASTP输出对准两个序列:解析使用biopython

BLASTP 2.2.28+ 


Query= 
Length=237 

Subject= 
Length=268 


Score = 429 bits (1104), Expect = 2e-157, Method: Compositional matrix adjust. 
Identities = 237/268 (88%), Positives = 237/268 (88%), Gaps = 31/268 (12%) 

Query 1 MERYENLFAQLNDRREGAFVPFVTLGDPGIEQSLKIIDTLIDAGADALELGVP------- 53 
      MERYENLFAQLNDRREGAFVPFVTLGDPGIEQSLKIIDTLIDAGADALELGVP  
Sbjct 1 MERYENLFAQLNDRREGAFVPFVTLGDPGIEQSLKIIDTLIDAGADALELGVPFSDPLAD 60 

Query 54 --TIQNANLRAFAAGVTPAQCFEMLALIREKHPTIPIGLLMYANLVFNNGIDAFYARCEQ 111 
       TIQNANLRAFAAGVTPAQCFEMLALIREKHPTIPIGLLMYANLVFNNGIDAFYARCEQ 
Sbjct 61 GPTIQNANLRAFAAGVTPAQCFEMLALIREKHPTIPIGLLMYANLVFNNGIDAFYARCEQ 120 

Query 112 VGVDSVLVADVPVEESAPFRQAALRHNIAPIFICPPNADDDLLRQVASYGRGYTYL---- 167 
      VGVDSVLVADVPVEESAPFRQAALRHNIAPIFICPPNADDDLLRQVASYGRGYTYL  
Sbjct 121 VGVDSVLVADVPVEESAPFRQAALRHNIAPIFICPPNADDDLLRQVASYGRGYTYLLSRS 180 

Query 168 ---------------LIEKLKEYHAAPALQG-GISSPEQVSAAVRAGAAGAISGSAIVKI 211 
          LIEKLKEYHAAPALQG GISSPEQVSAAVRAGAAGAISGSAIVKI 
Sbjct 181 GVTGAENRGALPLHHLIEKLKEYHAAPALQGFGISSPEQVSAAVRAGAAGAISGSAIVKI 240 

Query 212 IEKNLASP--MLAELRSFVSAMKAASRA 237 
      IEKNLASP MLAELRSFVSAMKAASRA 
Sbjct 241 IEKNLASPKQMLAELRSFVSAMKAASRA 268 



Lambda  K  H  a   alpha 
    0.320 0.136 0.386 0.792  4.96 

Gapped 
Lambda  K  H  a   alpha sigma 
    0.267 0.0410 0.140  1.90  42.6  43.6 

Effective search space used: 51972 




Matrix: BLOSUM62 
Gap Penalties: Existence: 11, Extension: 1 
Neighboring words threshold: 11 
Window for multiple hits: 40 
+2

你试了一下?你有什么问题?你检查了biopython教程吗? – Llopis 2014-10-06 10:08:47

你必须阅读教程此项:http://biopython.org/DIST/docs/tutorial/Tutorial.html#sec100

您将学习如何启动本地BLAST并获取XML输出。一旦你有你的XML文件(而不是你粘贴在你的问题的文本),你可以解析它:

NCBIXML.parse(xml_results)