解析使用biopython
问题描述:
我需要帮助解析以下BLASTP输出BLASTP输出对准两个序列:解析使用biopython
BLASTP 2.2.28+
Query=
Length=237
Subject=
Length=268
Score = 429 bits (1104), Expect = 2e-157, Method: Compositional matrix adjust.
Identities = 237/268 (88%), Positives = 237/268 (88%), Gaps = 31/268 (12%)
Query 1 MERYENLFAQLNDRREGAFVPFVTLGDPGIEQSLKIIDTLIDAGADALELGVP------- 53
MERYENLFAQLNDRREGAFVPFVTLGDPGIEQSLKIIDTLIDAGADALELGVP
Sbjct 1 MERYENLFAQLNDRREGAFVPFVTLGDPGIEQSLKIIDTLIDAGADALELGVPFSDPLAD 60
Query 54 --TIQNANLRAFAAGVTPAQCFEMLALIREKHPTIPIGLLMYANLVFNNGIDAFYARCEQ 111
TIQNANLRAFAAGVTPAQCFEMLALIREKHPTIPIGLLMYANLVFNNGIDAFYARCEQ
Sbjct 61 GPTIQNANLRAFAAGVTPAQCFEMLALIREKHPTIPIGLLMYANLVFNNGIDAFYARCEQ 120
Query 112 VGVDSVLVADVPVEESAPFRQAALRHNIAPIFICPPNADDDLLRQVASYGRGYTYL---- 167
VGVDSVLVADVPVEESAPFRQAALRHNIAPIFICPPNADDDLLRQVASYGRGYTYL
Sbjct 121 VGVDSVLVADVPVEESAPFRQAALRHNIAPIFICPPNADDDLLRQVASYGRGYTYLLSRS 180
Query 168 ---------------LIEKLKEYHAAPALQG-GISSPEQVSAAVRAGAAGAISGSAIVKI 211
LIEKLKEYHAAPALQG GISSPEQVSAAVRAGAAGAISGSAIVKI
Sbjct 181 GVTGAENRGALPLHHLIEKLKEYHAAPALQGFGISSPEQVSAAVRAGAAGAISGSAIVKI 240
Query 212 IEKNLASP--MLAELRSFVSAMKAASRA 237
IEKNLASP MLAELRSFVSAMKAASRA
Sbjct 241 IEKNLASPKQMLAELRSFVSAMKAASRA 268
Lambda K H a alpha
0.320 0.136 0.386 0.792 4.96
Gapped
Lambda K H a alpha sigma
0.267 0.0410 0.140 1.90 42.6 43.6
Effective search space used: 51972
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 11
Window for multiple hits: 40
答
你必须阅读教程此项:http://biopython.org/DIST/docs/tutorial/Tutorial.html#sec100
您将学习如何启动本地BLAST并获取XML输出。一旦你有你的XML文件(而不是你粘贴在你的问题的文本),你可以解析它:
NCBIXML.parse(xml_results)
你试了一下?你有什么问题?你检查了biopython教程吗? – Llopis 2014-10-06 10:08:47