Protein sequence pattern-matching python

I'm working on a matching algorithm for protein sequences. I'm starting with an aligned protein sequence, and I am attempting to convert a mis-aligned sequence into the correctly aligned one. Here is an example:

original aligned sequence: ----AB--C-D-----

mis-aligned sequence: --A--BC---D-

The expected output should look like this:

original aligned sequence: ----AB--C-D-----

mis-aligned sequence: ----AB--C-D----- (both are now the same)

I'm told to be very specific about my problem, but the sequences I'm trying to match are >4000 characters long, and look ridiculous when pasted here. I'll post two sequences representative of my problem, though, and that should do.

seq="---A-A--AA---A--"newseq="AA---A--A-----A-----"seq=list(seq) #changing maaster sequence from string to listnewseq=list(newseq) #changing new sequence from string to listn=len(seq) #obtaining length of master sequencenewseq.extend('.') #adding a tag to end of new sequence to account for terminal gapsprint(seq, newseq,n) #verification of sequences in list form and lengthfor i in range(n)    if seq[i]!=newseq[i]:        if seq[i] != '-': #gap deletion            del newseq[i]        elif newseq[i] != '-':            newseq.insert(i,'-') #gap insertion        elif newseq[i] == '-':            del newseq[i]old=''.join(seq) #changing list to stringnew=''.join(newseq) #changing list to stringnew=new.strip('.') #removing tagprint(old) #verification of master-sequence fidelityprint(new) #verification of matching sequence

The output I get from this particular code and set of sequences is:

---A-A--AA---A--

---A-A--A----A-----A-----

I can't seem to get the loop to correctly delete unwanted dashes in between the letters more than once, because the rest of the loop iterations are used in an add dash/delete dash pair.
This is a good start to the problems here.

How can I write this loop successfully to obtain my desired output (two identical sequences)

Protein sequence pattern-matching python

Trending Articles

Moondru Mudichu 27-05-2016 – Polimer tv Serial

Snes4Sym emulator for nokia s60v3

the range cannot be deleted (6028) in microsoft word

GTA 5 PPSSPP Zip File Download For Android Mediafire 382 MB

Download: FK ft Shenky – Nakuyewa ”Prod by: Shenky”

NCERT Solutions for Class 9th Sanskrit Chapter 3 पाथेयम्

Man to stand trial on three charges of money laundering

DRP MAKER WITH CHEMICALS 9491234553

Chai Status, Funny Tea Quotes in Hindi, चाय पर शायरी

Gulabi kallu Lyrics and translation | GAV / Govindhudu andhari vadele (2014)

Revised GDS Gratuity, Severance Amount and SDBS contribution - Social...

Password Reset on SX6036?

Throw Back: Samini — Where My Baby Dey (Prod by Kaywa)

Name Of Parts Of The Day In hindi And English-List Of Part Of Days In Hindi

VMOU RSCIT Result 2017, RSCIT Result VMOU rkcl.vmou.ac.in Name Wise

Joshua Pigden from Bristol faces trial over rape and Diazepam...

Samuel Llewellyn Richards

Practice Sheet of Right form of verbs for HSC Students

SPY CAMERAS: Bus lane clampdown will be running in Derby by the end of November

Cheriyal Mandal Sarpanch Mobile Numbers List Warangal District in Telangana...