新四季網

使用編碼metH的棒桿菌細菌發酵生產含硫精細化學品的方法

2023-07-06 13:13:21

專利名稱:使用編碼metH的棒桿菌細菌發酵生產含硫精細化學品的方法
技術領域:
本發明涉及通過使用表達編碼甲硫氨酸合酶(metH)基因的核苷酸序列的細菌發酵生產含硫精細化學品,尤其是L-甲硫氨酸的方法。
背景技術:
含硫精細化學品如,甲硫氨酸、高半胱氨酸、S-腺苷甲硫氨酸、穀胱甘肽、半胱氨酸、生物素、硫胺、硫辛酸在細胞中通過天然代謝過程產生並用於許多的工業分支中,包括食品、動物飼料、化妝品和製藥工業。這些總稱為「含硫精細化學品」的物質包括有機酸、蛋白原性(Proteinogenic)胺基酸和非蛋白原性胺基酸、維生素和輔因子。在每種情況下通過培養被開發以產生並分泌大量所需物質的細菌的方法,可以大規模、最便利地生產這些含硫精細化學品。尤其適於該目標的生物體為棒桿菌細菌,一種革蘭氏陽性非致病性細菌。
公知可以通過棒桿菌細菌,尤其是穀氨酸棒狀桿菌(corynebacteriumglutamicum)株系發酵生產胺基酸。由於巨大的重要性,生產方法不斷被改進。方法改進可涉及與發酵的技術方面有關的措施,例如,攪拌和氧氣提供,或者涉及營養培養基組成例如,發酵過程中糖濃度,或者涉及給出產物的後處理,例如通過離子交換層析後處理,或者涉及微生物自身的內在性能特點。
已經通過株系選擇開發了從一組含硫精細化學品選擇性組合產生期望的化合物的多種突變株系。通過應用誘變、選擇和突變選擇的方法,所述微生物的性能特點在特定分子的生產方面得到提高。然而,這是一種費時且困難的方法。以這種方法可以得到各種菌株,例如對於抗代謝物或者抑制劑例如,甲硫氨酸類似物α-甲基甲硫氨酸、乙硫氨酸、正亮氨酸、N-乙醯基正亮氨酸、S-三氟甲基高半胱氨酸、2-氨基-5-heprenoitic acid、硒代甲硫氨酸、甲硫氨酸磺醯亞胺、methoxine、1-氨基環戊烷甲酸具有抗性的菌株或者對於在調節中起重要作用的代謝物是營養缺陷型的菌株和產生含硫精細化學品,例如,L-甲硫氨酸的菌株。
重組DNA技術也被使用了一些年以通過擴增各胺基酸生物合成基因和研究對胺基酸生產的作用來改善生產L-胺基酸的棒狀桿菌菌株。
WO-A-02/10209描述了使用生產L-甲硫氨酸的棒桿菌細菌發酵生產L-甲硫氨酸的方法,該細菌中至少metH基因被過表達並且此metH編碼序列來自穀氨酸棒狀桿菌ATCC 13032。
發明概述本發明的一個目的是提供含硫精細化學品,尤其是L-甲硫氨酸的新的改進的發酵生產方法。
我們已經發現通過提供一種含硫精細化學品的發酵生產方法可以實現該目的,該方法包括在棒桿菌細菌中表達編碼具有metH活性的蛋白的異源核苷酸序列。
本發明首先涉及至少一種含硫精細化學品的發酵生產方法,該方法包括下面的步驟a)發酵產生期望的含硫精細化學品的棒桿菌細菌培養物,該棒桿菌細菌表達至少一種編碼具有甲硫氨酸合酶(metH)活性的蛋白的異源核苷酸序列;b)在培養基或細菌細胞中富集含硫精細化學品,和c)分離優選含有L-甲硫氨酸的含硫精細化學品。
上面的metH-編碼核苷酸序列與來自穀氨酸棒狀桿菌ATCC 13032的metH-編碼序列的同源性優選小於70%。metH-編碼序列優選來自下列表I的生物體中的任一種
列表I

ATCC美國典型培養物保藏中心,Rockville,MD,USAPCC藍細菌巴斯德培養物保藏中心,法國巴黎根據本發明使用的metH編碼序列優選含有根據SEQ ID NO1、3、5、7、9、11、13、15、17、19、21、23、25、27、29、31、33、35、37、39、41、43、45、47、49和51的編碼序列或者編碼具有metH活性的蛋白的與上面序列同源的核苷酸序列。
此外,根據本發明使用的metH編碼序列優選編碼具有metH活性的蛋白,所述蛋白含有根據SEQ ID NO2、4、6、8、10、12、14、16、18、20、22、24、26、28、30、32、34、36、38、40、42、44、46、48、50和52的胺基酸序列或者代表具有metH活性的蛋白的與上面序列同源的胺基酸序列。
metH編碼序列優選為可以在棒桿菌細菌中複製或者穩定整合到染色體的DNA或RNA。
根據優選的實施方案,本發明方法通過如下方式實施a)使用質粒載體轉化的細菌菌株,其中所述質粒載體攜帶至少一份處於調節序列控制下的metH編碼序列的拷貝或者b)使用一種菌株,該菌株中編碼metH的序列已經被整合到細菌染色體中。
此外,優選為發酵而過表達編碼metH的序列。
此外,還可能希望發酵這樣的細菌,在該細菌中期望的含硫精細化學品的生物合成途徑或者與之相關的生物合成途徑或其他代謝途徑中的至少另一基因也已經被擴增;和/或者至少一種減少期望的含硫精細化學品的生產的代謝途徑至少部分地被關閉。
還可能希望發酵這樣細菌,其中期望的含硫精細化學品的生物合成途徑中的至少另一種基因的活性不受到代謝物的不希望的影響。
因此,根據本發明方法的另一個實施方案,發酵棒桿菌細菌,其中同時,至少一種選自a)天冬氨酸激酶-編碼基因lysC,b)天冬氨酸-半醛脫氫酶-編碼基因asd,c)甘油醛-3-磷酸脫氫酶-編碼基因gap,
d)3-磷酸甘油酸激酶-編碼基因pgk,e)丙酮酸羧化酶-編碼基因pyc,f)丙糖磷酸異構酶-編碼基因tpi,g)高絲氨酸O-乙醯轉移酶-編碼基因metA,h)胱硫醚γ-合酶-編碼基因metB,i)胱硫醚γ-裂合酶-編碼基因metC,j)絲氨酸羥甲基轉移酶-編碼基因glyA,k)O-乙醯高絲氨酸硫化氫解酶-編碼基因metY,l)亞甲基四氫葉酸還原酶-編碼基因metF,m)磷酸絲氨酸氨基轉移酶-編碼基因serC,n)磷酸絲氨酸磷酸酶-編碼基因serB,o)絲氨酸乙醯基轉移酶-編碼基因cysE,p)高絲氨酸脫氫酶-編碼基因hom的基因被過表達。
根據本發明的另一個實施方案,發酵棒桿菌,其中,同時,至少一種選自上面提到的a)到p)的基因被突變,從而尤其使得與未突變的蛋白相比,相應突變的蛋白的活性受代謝物的影響程度較小(如果存在),並且尤其是精細化學品的本發明生產沒有受到不利影響。由於突變,該蛋白也可能具有更高活性(底物轉化)和/或底物特異性,從而增強期望的精細化學品的生產。
根據本發明的另一個實施方案,發酵棒桿菌,其中,同時,至少一種選自q)高絲氨酸激酶-編碼基因thrB,r)蘇氨酸脫水酶-編碼基因ilvA,s)蘇氨酸合酶-編碼基因thrC,t)內消旋-二氨基庚二酸D-脫氫酶-編碼基因ddh,u)磷酸烯醇丙酮酸羧基激酶-編碼基因pck,v)葡萄糖-6-磷酸6-異構酶-編碼基因pgi,
w)丙酮酸氧化酶-編碼基因poxB,x)二氫吡啶二羧酸合酶-編碼基因dapA,y)二氫吡啶二羧酸還原酶-編碼基因dapB;或z)二氨基吡啶甲酸脫羧酶-編碼基因lysA的基因被弱化,尤其是通過減小相應基因的表達速率,或者通過表達具有較低活性(底物轉化)的蛋白來實現此目的。
根據本發明方法的另一個實施方案,發酵棒桿菌細菌,其中,同時,至少一種選自上面q)到z)的基因被突變,從而使得相應蛋白的酶活性被部分或完全減小。
優選在本發明方法中使用微生物物種穀氨酸棒狀桿菌。
在該方法的另一個實施方案中,使用抗至少一種甲硫氨酸生物合成抑制劑的微生物。這些抑制劑為甲硫氨酸類似物,如α-甲基甲硫氨酸、乙硫氨酸、正亮氨酸、N-乙醯基正亮氨酸、S-三氟甲基高半胱氨酸、2-氨基-5-heprenoic acid、硒代甲硫氨酸、甲硫氨酸磺醯亞胺、methoxine、1-氨基環戊烷甲酸。
本發明還涉及從發酵液生產含L-甲硫氨酸的動物飼料添加劑的方法,其包括下面的步驟a)在發酵培養基中培養和發酵產生L-甲硫氨酸的微生物;b)從含有L-甲硫氨酸的發酵液除去水;c)除去按重量計發酵過程中形成的生物量的0到100%;和d)乾燥根據b)和/或c)得到的發酵液,以便得到期望的粉末或顆粒形式的動物飼料添加劑。
本發明還涉及第一次從上面的微生物分離的編碼metH的序列,涉及該序列編碼的甲硫氨酸合酶,以及這些多核苷酸和蛋白質的相應功能同系物。
尤其,本發明還涉及實施上面的方法所需要的表達構建體和微生物。
因此,本發明還涉及如下方面-編碼lysC thr311ile的質粒pCIS lysC thr311ile或其功能等價物,即具有比野生型更大的相應天冬氨酸激酶活性的lysC突變體;-用質粒pCIS lysC thr311ile轉化的,尤其是選自棒狀桿菌屬微生物,尤其是穀氨酸棒狀桿菌種的宿主生物體,如轉化菌株LU 1479 lysC 311ile;-編碼天藍色鏈黴菌(Streptomyces coelicolor)metH的質粒pC PhsdhmetH;-如上定義的宿主生物體,其被編碼外源metH的質粒轉化;尤其是用質粒pC Phsdh metH Sc轉化;-具有抗至少一種甲硫氨酸生物合成抑制劑的如上定義的宿主生物體,如轉化菌株LU 1479 lysC 311ile ET-16,其任選用外源編碼metH的序列轉化,如轉化菌株LU 1479 lysC 311ile ET-16 pC Phsdh metH Sc。
發明詳述a)一般術語具有甲硫氨酸合酶(簡寫為metH(系統名5-甲基四氫葉酸高半胱氨酸S-甲基轉移酶;EC 2.1.1.13))生物活性的蛋白質指能夠使用輔因子5-甲基四氫葉酸(MTHF)、鈷胺素(維生素B12)和S-腺苷甲硫氨酸將高半胱氨酸轉化成甲硫氨酸和四氫葉酸的那些蛋白。儘管輔因子5-甲基四氫葉酸按化學劑量進入反應(1mol MTHF/1 mol形成的甲硫氨酸),但是S-腺苷甲硫氨酸是按亞化學計量轉變,如文獻所描述的。另一方面,鈷胺素在轉化中起催化作用。metH蛋白的其他細節是技術人員公知的。(Banerjee R.V.,Matthews R.G.,FASEB J.,41450-1459,1990,Ludwig ML.,MatthewesRG.,Annual Review of Biochemistry.66269-313,1997,Drennan CL.,Matthews RG.,Ludwig ML.,Current Opinion in Structural Biology.4919-29,1994)。技術人員可區分依賴鈷胺素的5-甲基四氫葉酸高半胱氨酸S-甲基轉移酶的活性和獨立於鈷胺素的5-甲基四氫蝶醯基三穀氨酸高半胱氨酸S-甲基轉移酶(EC 2.1.1.14),也稱為metE的活性。技術人員可使用酶測定法檢測metH的酶活性,測定法方案可以是Jarrett JT.,GouldingCW.,Fluhr K.,Huang S.,Matthews RG.,Methods in Enzymology.281196-213,1997。
在本發明的範圍內,術語「含硫精細化學品」包括含有至少一個共價結合的硫原子並且可通過本發明的發酵方法得到的任何化學化合物。其非限制性實例為甲硫氨酸、高半胱氨酸、S-腺苷甲硫氨酸,尤其是甲硫氨酸和S-腺苷甲硫氨酸。
在本發明的範圍內,術語「L-甲硫氨酸」、「甲硫氨酸」、高半胱氨酸和S-腺苷甲硫氨酸也包括相應的鹽如,甲硫氨酸鹽酸鹽或甲硫氨酸硫酸鹽。
「多核苷酸」通常指多聚核糖核苷酸(RNA)和多聚脫氧核糖核苷酸(DNA),其可以分別是未修飾的RNA和DNA,或者分別是修飾的RNA和DNA。
根據本發明,「多肽」指含有通過肽鍵相連的兩個或多個胺基酸的肽或蛋白質。
術語「代謝物」指在生物體的代謝中作為中間物或者終產物產生並且,除了作為化學構件之外,也可能對酶和它們的催化活性具有調節作用的化學化合物。從文獻已知這種代謝物可以以抑制和刺激的方式作用於酶的活性(Biochemistry,Stryer,Lubert,1995 W.H.Freeman Company,NewYork,New York)。文獻中也已有報導稱可以在生物體中生產所受到的代謝物的影響已經被改變的酶,所述改變通過如用UV輻射、離子化輻射或誘變物質突變基因組DNA,並隨後選擇特定表型來實現(Sahm H.,EggelingL.,de Graaf AA.,Biological Chemistry 381(9-10)899-910,2000;Eikmanns BJ.,Eggeling L.,Sahm.H.,Antonie van Leeuwenhoek.,64145-63,1993-94)。這些改變的性質也可以通過特定措施實現。技術人員知道怎樣在酶基因中特異地修飾編碼蛋白的DNA的特定核苷酸使得從表達的DNA序列得到的蛋白具有某種新的性質,例如改變代謝物對未修飾的蛋白的調節作用。
可以影響酶活性,從而減小反應速度或者改變對底物的親和性或者改變多個反應速率。
術語「表達」和「擴增」或「過表達」在本發明的上下文中描述微生物中相應DNA編碼的一種或多種酶的產生或其胞內活性的增加。為此,例如,可以向生物體中導入基因、將現有基因替代為另一個基因、增加一種或多種基因的拷貝數,使用強啟動子或者使用編碼具有高活性的相應酶的基因,並且適宜時,這些措施可以組合使用。
b)本發明的metH蛋白本發明還包括上面列表I中特別公開的生物體的metH酶的「功能等價物」。
在本發明範圍內,具體公開的多肽的「功能等價物」或類似物為與具體公開的多肽不同的多肽,這些多肽還具有期望的生物學活性如,底物特異性。
根據本發明,「功能等價物」尤其指在上面提到的序列位置的至少一個位置具有不同於具體提到的胺基酸的胺基酸,但是仍然具有一種上面提到的生物學活性的突變體。「功能等價物」從而還包括可以通過一個或多個胺基酸添加、置換、缺失和/或倒位得到的突變體,所述修飾可以在序列的任何位置發生,只要它們導致具有本發明的性質譜的突變體即可。尤其是當突變的和未修飾的多肽的反應模式在質上相匹配,即例如,相同的底物以不同速度被轉化時,存在功能等價物。
「功能等價物」自然還包括可以從其他生物體得到的多肽,以及天然發生的變體。例如,可通過序列比較發現同源序列區,並且可以按照本發明的具體指導方案確立等價酶。
「功能等價物」還包括例如具有期望的生物學功能的本發明多肽的片段,優選各結構域或序列基序。
「功能等價物」還包括融合蛋白,其具有一個上面提到的多肽序列或衍生自該多肽序列的功能等價物和以功能性方式(即融合蛋白各部分的功能受到可忽略的功能削弱)N-或C-連接的至少一個功能上不同的異源序列。這種異源序列的非限制性實例為,例如,信號肽、酶、免疫球蛋白、表面抗原、受體或受體配體。
根據本發明,「功能等價物」包括具體公開的蛋白的同系物。這些同系物具有,例如在全長上,與具體公開的序列之一至少30%,或者約40%、50%,優選至少約60%、65%、70%或75%,尤其是至少85%,例如,90%、95%或99%的同源性,該同源性通過Pearson和Lipman,Proc.Natl.Acad.,Sci.(USA)85(8),1988,2444-2448的算法計算。同源性程度尤其反映了修飾的和未修飾的序列之間的同一性程度。
本發明的蛋白或多肽的同系物可通過誘變,例如,通過蛋白的點突變或截短產生。如此處所用的術語「同系物」,也涉及蛋白的變體形式,其可以作為蛋白活性的激動劑或拮抗劑。
本發明蛋白的同系物可通過篩選突變體,例如,截短突變體的組合文庫鑑定。可以,例如,通過核酸水平上的組合誘變,例如,通過合成的寡核苷酸混合物的酶促連接產生蛋白質變體的多樣化文庫。有許多方法可用於從簡併寡核苷酸序列製備潛在同系物的文庫。簡併基因序列的化學合成可以在自動DNA合成儀上進行,然後合成的基因可被連接到適宜的表達載體中。一組簡併基因的使用使得可以在一個混合物中提供編碼期望的一組潛在蛋白質序列的全部序列。合成簡併寡核苷酸的方法是技術人員公知的(例如,Narang,S.A.,(1983)Tetrahedron 393;Itakura等,(1984)Annu.Rev.Biochem.53323;Itakura等,(1984)Science 1981056;Ike等,(1983)Nucleic acid Res.11477)。
此外,具有蛋白密碼子的片段的文庫可用於產生蛋白質片段的多樣化群體以備篩選和隨後選擇本發明蛋白的同系物。在一個實施方案中,可通過將編碼序列的雙鏈PCR片段用核酸酶在每個分子僅發生約1次切割的條件下處理,變性雙鏈DNA,再次退火DNA以形成可含有不同帶切口產物的有義/反義對的雙鏈DNA,用S1核酸酶處理從新形成的雙鏈體除去單鏈部分並將所得片段文庫連接到表達載體,從而產生編碼序列片段的文庫。可以通過此方法設計編碼本發明蛋白質的不同大小的N-末端、C-末端和內部片段的表達文庫。
在現有技術中公知一些技術可用於從通過點突變或截短產生的組合文庫篩選基因產物和從cDNA文庫篩選具有選擇的性質的基因產物。這些技術可經改變以適於快速篩選通過本發明的同系物的組合誘變產生的基因文庫。高通量分析篩選大基因文庫最經常使用的技術包括將基因文庫克隆到可複製的表達載體中,用所得載體文庫轉化適宜的細胞並在一定條件下表達組合基因,在該條件下期望活性的檢測方便了編碼該基因(其產物已經被檢測)的載體的分離。遞歸整體誘變(REM)——一種增加文庫中功能突變體的頻率的技術——可與篩選試驗組合使用以鑑定同系物(Arkin undYourvan(1992)PNAS 887811-7815;Delgrave等(1993)Protein Engineering6(3)327-331)。
c)本發明的多核苷酸本發明還涉及編碼上面的metH酶和可通過例如使用人工核苷酸類似物得到的該metH酶的功能等價物的核酸序列(單-和雙鏈DNA和RNA序列,例如cDNA和mRNA)。
本發明不僅涉及編碼本發明多肽或蛋白質或者其生物活性部分的分離的核酸分子,還涉及可用作例如,用以鑑定或擴增本發明的編碼核酸的雜交探針或引物的核酸片段。
此外,本發明的核酸分子可含有來自該基因的編碼區的3』和/或5』末端的非翻譯序列。
「分離的」核酸分子與存在於該核酸的天然來源中的其他核酸分子分離並且如果其通過重組技術製備那麼還可以基本上沒有其他細胞物質或培養基,或者如果其通過化學合成那麼還可以基本上沒有化學前體或其他化學物質。
本發明還包括與具體描述的核苷酸序列或其部分互補的核酸分子。
本發明的核苷酸序列使得可以產生可用於鑑定和/或克隆其他細胞類型和生物體中的同源序列的探針和引物。這些探針和引物通常互補於一個核苷酸序列區,該序列區在嚴緊條件下雜交本發明核酸序列的有義鏈或相應的反義鏈的至少約12個,優選至少約25個,例如40、50或75個連續核苷酸。
本發明的其他核酸序列衍生自SEQ ID NO1、3、5、7、9、11、13、15、17、19、21、23、25、27、29、31、33、35、37、39、41、43、45、47、49或51並且通過加入、置換、插入或缺失一個或多個核苷酸而不同,但是仍然編碼具有期望的性質譜的多肽。這些可以是與上面序列,例如在全長範圍內,在至少約50%、55%、60%、65%、70%、80%或90%,優選至少約95%、96%、97%、98%或99%的序列位置相同的多核苷酸。
本發明還包括按照特定來源或宿主生物體的密碼子使用與具體提到的序列相比含有「沉默」突變或被修飾的那些核酸序列,以及天然發生的變體,例如,剪接變體或等位基因變體。本發明還涉及可通過保守核苷酸置換(即相關胺基酸被相同電荷、大小、極性和/或溶解性的胺基酸代替)得到的序列。
本發明還涉及通過序列多態性從具體公開的核酸衍生的分子。因為群體內個體間的天然變異而可能存在這些遺傳多態性。這些天然變異通常導致基因的核苷酸序列中1到5%的差異。
本發明還包括與上面提到的編碼序列雜交或與其互補的核酸序列。這些多核苷酸可以在基因組或cDNA文庫的篩選時被發現,並且適宜時,可通過PCR,使用適宜的引物從這些文庫擴增,然後,例如,用適宜的探針分離。另一個可能方案是用本發明的多核苷酸或載體轉化適宜的微生物,繁殖微生物從而擴增多核苷酸,然後分離它們。再一個可能方案是通過化學途徑合成本發明的多核苷酸。
能夠與多核苷酸「雜交」的性質指多核苷酸或寡核苷酸能夠在嚴緊條件下結合幾乎互補的序列,而在這些條件下不存在非互補序列之間的非特異性結合。為此,序列應該70-100%,優選90-100%互補。互補序列能夠相互特異結合的性質可以例如在Northern或Southern印跡技術或者,對於引物結合,在PCR或在RT-PCR中被利用。具有30個或更多個鹼基對長度的寡核苷酸通常用於此目的。嚴緊條件指,例如,在Northern印跡技術中在50-70℃,優選60-65℃使用洗滌溶液,例如,具有0.1%SDS的0.1xSSC緩衝液(20×SSC;3M NaCl,0.3M檸檬酸鈉,pH7.0),用於洗脫非特異雜交的cDNA探針或寡核苷酸。在該情況下,如上面所提到的,僅僅具有高度互補性的核酸保持相互結合。嚴緊條件的設置是技術人員公知的並且在例如Ausubel等,Current Protocols in Molecular Biology,John WileySons,N.Y.(1989),6.3.1-6.3.6.beschrieben中描述。
d)metH編碼基因的分離編碼酶甲硫氨酸合酶(EC 2.1.1.13)的metH基因可以以本身已知的方式從上面列表I的生物體分離。
為了分離上面列表I的生物體的metH基因或者其他基因,首先在大腸桿菌(E.Coli)中產生該生物體的基因文庫。基因文庫的產生在通常已知的教科書和手冊中有詳細描述。可提及的的實例為Winnacker的教科書Gene und Klone,Eine Einführung in die Gentechnologie(Verlag Chemie,Weinheim,Germany,1990),和Sambrook等的手冊Molecular Cloning,ALaboratory Manual(Cold Spring Harbor Laboratory Press,1989)。一個非常熟知的基因文庫是大腸桿菌K-12株系W3110的基因文庫,其通過Kohara等(Cell 50,495-508(198))在λ載體中產生。
為了在大腸桿菌中產生列表I的生物體的基因文庫,可以使用粘粒載體SuperCos1(Wahl等,1987,Proceedings of the National Academy ofSciences USA,842160-2164),或者質粒如pBR322(BoliVal;Life Sciences,25,807-818(1979))或pUC9(Vieira等,1982,Gene,19259-268)。適宜的宿主尤其是限制性和重組缺陷的那些大腸桿菌菌株。其一個實例是Grant等(Proceedings of the National Academy of Sciences USA,87(1990)4645-4649)描述的菌株DH5αmcr。通過粘粒幫助克隆的長DNA片段又可以被亞克隆到適於測序的通用載體中並隨後被測序,測序方法如在例如Sanger等(Proceedings of the National Academy of Sciences of theUnited States of America,745463-5467,1977)中所描述的。
所得DNA序列可以使用已知的算法或序列分析程序例如,Staden的程序(Nucleic Acids Research 14,217-232(1986))、Marck的程序(NucleicAcids Research 16,1829-1836(1988))或Butler的GCG程序(Methods ofBiochemical Analysis 39,74-97(1998))進行研究。
發現了來自根據上面的表I的生物體的編碼metH的DNA序列。具體地,發現了根據SEQ ID NO1、3、5、7、9、11、13、15、17、19、21、23、25、27、29、31、33、35、37、39、41、43、45、47、49和51的DNA序列。此外,使用上面描述的方法,從存在的所述DNA序列得到相應蛋白質的胺基酸序列。SEQ ID NO2、4、6、8、10、12、14、16、18、20、22、24、26、28、30、32、34、36、38、40、42、44、46、48、50和52描述了得到的metH基因產物的胺基酸序列。
由於遺傳密碼簡併性從根據SEQ ID NO1、3、5、7、9、11、13、15、17、19、21、23、25、27、29、31、33、35、37、39、41、43、45、47、49和51的序列得到的DNA編碼序列也是本發明的主題。同樣,本發明也涉及與所述序列或者來自於其的序列部分雜交的DNA序列。
通過雜交鑑定DNA序列的教導可由技術人員在例如手冊BoehringerMannheim GmbH的「The DIG System Users Guide für FilterHybridization」(Mannheim,Germany,1993)和在Leibl等(InternationalJournal of Systematic Bacteriology(1991)41255-260)中發現。利用聚合酶鏈式反應(PCR)擴增DNA序列的教導可由技術人員在例如手冊GaitOligonucleotide synthesisA Practical Approach(IRL Press,Oxford,UK,1984)和在Newton和GrahamPCR(Spektrum Akademischer Verlag,Heidelberg,德國,1994)中發現。
還公知蛋白的N-和/或C-末端的變化不會實質上損害其功能或者甚至可能穩定所述功能。相關信息可由技術人員在Ben-Bassat等(Journal ofBacteriology 169751-757(1987))、O』Regan等(Gene 77237-251(1989))、Sahin-Toth等(Protein Sciences 3240-247(1994))、Hochuli等(Biotechnology 61321-1325(1988))和遺傳學和分子生物學的已知的教科書中發現。
相應從SEQ ID NO2、4、6、8、10、12、14、16、18、20、22、24、26、28、30、32、34、36、38、40、42、44、46、48、50和52得到的胺基酸序列也同樣是本發明的部分。
e)根據本發明使用的宿主細胞本發明還涉及作為宿主細胞的微生物,尤其是棒桿菌細菌,其中,該微生物含有載體,尤其是穿梭載體或質粒載體,載體上攜帶至少一個如本發明所定義的metH基因,或者在該微生物中本發明的metH基因被表達或擴增。
這些微生物可以從葡萄糖、蔗糖、乳糖、果糖、麥芽糖、糖蜜、澱粉、纖維素或從甘油和乙醇產生含硫精細化學品,尤其是L-甲硫氨酸。所述微生物優選棒桿菌細菌,尤其是棒狀桿菌屬。對於棒狀桿菌屬,必須提到尤其是現有技術中公知能夠產生L-胺基酸的穀氨酸棒狀桿菌菌株。
可以提及的適宜的棒桿菌細菌菌株的實例為棒狀桿菌屬菌株,尤其是穀氨酸棒狀桿菌種,如穀氨酸棒狀桿菌ATCC13032,醋谷棒狀桿菌(Corynebacterium acetoglutamicum)ATCC 15806,嗜乙醯乙酸棒狀桿菌(Corynebacterium acetoacidophilum)ATCC13870,Corynebacterium thermoaminogenes FERM BP-1539,棲糖蜜棒狀桿菌(Corynebacterium melassecola)ATCC 17965或者短桿菌屬(Brevibacterium),如黃色短桿菌(brevibacterium flavum)ATCC 14067乳發酵短桿菌(Brevibacterium lactofermentum)ATCC 13869和擴展短桿菌(Brevibacterium divaricatum)ATCC 14020;或者同樣產生期望的精細化學品或其前體的衍生的菌株如穀氨酸棒狀桿菌KFCC10065穀氨酸棒狀桿菌ATCC21608縮寫KFCC指韓國培養物保藏中心聯盟(Korean Federatioin ofCulture Collection),縮寫ATCC指美國典型培養物保藏中心(AmericanType Strain Culture Collection),縮寫FERM指日本工業科學技術機構國立生命科學和人體技術研究所保藏中心(National Institute of Bioscienceand Human Technology)。
f)實施本發明的發酵根據本發明,發現棒桿菌細菌在過表達來自表I的生物體的metH基因後,以有利的方式產生含硫精細化學品,尤其是L-甲硫氨酸。
為了實現過表達,技術人員可以採取單獨的或組合的不同措施。因此,可以增加合適的基因的拷貝數或者突變位於結構基因上遊的啟動子和調節區或者核糖體結合位點。整合到結構基因的上遊的表達盒以相同方式起作用。誘導型啟動子使得還可以在發酵的L-甲硫氨酸生產過程中增加表達。表達還可以通過延長mRNA的壽命來提高。此外,通過防止酶蛋白的降解還可以增強酶活性。基因或基因構建體可以以變化的拷貝數存在於質粒中或者被整合到染色體上並在其中擴增。另一個可能的備選方案是通過改變培養基組分和培養的操作來實現相關基因的過表達。
用於此目的的教導可由技術人員在例如,Martin等(Biotechnology5,137-146(1987))、Guerrero等(Gene 138,35-41(1994)),Tsuchiya和Morinaga(Bio/Technology 6,428-430(1988))、Eikmanns等(Gene 102,93-98(1991))、歐洲專利0472869、美國專利4,601,893、Schwarzer和Pühler(Biotechnology 9,84-87(1991)、Remscheid等(Applied andEnvironmental Microbiology 60,126-132(1994))、Labarre等(Journal ofBacteriology 175,1001-1007(1993))、專利申請WO 96/15246、Malumbres等(Gene134,15-24(1993))、日本公開說明書JP-A-10-229891、Jensen undHammer(Biotechnology and Bioengineering 58,191-195(1998))、Makrides(Microbiological Reviews 60512-538(1996)和遺傳學和分子生物學的已知的教科書中發現。
本發明因此還涉及含有處於調節核酸序列的遺傳控制下的編碼本發明多肽的核酸序列的表達構建體;涉及含有至少一個所述表達構建體的載體。本發明的這種構建體優選包括特定編碼序列的5』上遊的啟動子和3』下遊的終止子以及,適宜時,其它調節元件,在每種情況下這些調節元件可操作地連接到編碼序列上。「可操作地連接」指啟動子、編碼序列、終止子和適宜時,其它調節元件的順序排列,從而每個調節元件可在編碼序列表達中正確執行其功能。可操作連接的序列的實例為活化序列和增強子,等等。其他調節元件包括可選擇標記、擴增信號、複製起點,等等。適宜的調節序列在例如Goeddel,基因表達技術酶學方法185,Academic Press,SanDiego,CA(1990)中描述。
除了人工調節序列,天然調節序列仍然可存在於實際結構基因的上遊。遺傳修飾可以,適宜時,關閉該天然調節並增加或減少基因的表達。然而,基因構建體也可以具有更簡單的設計,即沒有額外的調節信號被插入到結構基因的上遊並且天然啟動子及其調節作用沒有被除去。相反,可以突變天然調節序列使調節作用不再發生並且基因表達得到增加或減少。基因構建體可含有核酸序列的一份或多份拷貝。
有用的啟動子的實例為來自穀氨酸棒狀桿菌啟動子的ddh、amy、lysC、dapA、lysA,以及革蘭氏陽性細菌啟動子SPO2(見「枯草芽孢桿菌及其近親」,Sonenshein,Abraham L.,Hoch,James A.,Losick,Richard;ASM Press,華盛頓哥倫比亞特區,和Patek M.Eikmanns BJ.,Patek J.,Sahm H.,Microbiology.142 1297-309,1996)或者優選在革蘭氏陰性細菌中應用的cos、tac、trp、tet、trp-tet、lpp、lac、lpp-lac、laclq、T7、T5、T3、gal、trc、ara、SP6、λ-PR和λ-PL啟動子。還優選使用誘導型啟動子,例如光可誘導的啟動子以及尤其是溫度可誘導的啟動子,如PrPl啟動子。原則上可以使用所有天然啟動子及其調節序列。此外,還可以有利地使用合成啟動子。
所提及的調節序列旨在使得核酸序列可以特異表達。取決於宿主生物,這可能意味著,例如,基因僅僅在誘導後表達或過表達,或者其立即表達和/或過表達。
關於這一點,調節序列和因子可以優選對表達具有有益作用,從而增加或減少表達。因此,可以有利地通過使用強轉錄信號如啟動子和/或增強子在轉錄水平增強調節元件。然而,除此之外還可以通過例如提高mRNA的穩定性增強翻譯。
通過將適宜的啟動子、適宜的Shine-Dalgarno序列融合到metH核苷酸序列和適宜的終止信號上製備表達盒。為此,使用一般重組和克隆技術,例如在Current Protocols in Molecular Biology,1993,John Wiley Sons,Incorporated,New York,New York,PCR Methods,Gelfand,David H.,Innis,Michael A.,Sninsky,John J.,1999,Academic Press,Incorporated,California,San Diego,PCR Cloning Protocols,Methods in MolecularBiology,Ser.,Vol.192,第二版,Humana Press,New Jersey,Totowa.T.Maniatis,E.F.Fritsch和J.Sambrook,分子克隆實驗室指南,冷泉港實驗室,冷泉港,NY(1989)和T.J.Silhavy,M.L.Berman und L.W.Enquist,基因融合實驗,冷泉港實驗室,冷泉港,NY(1984)和Ausubel,F.M.等,Current Protocols in Molecular Biology,Greene Publishing Assoc.andWiley Interscience(1987)中描述的技術。
重組核酸構建體或基因構建體在適宜的宿主生物體中表達,這可通過將構建體有利地插入宿主特異性載體中實現,該載體使得這些基因在宿主中的最佳表達成為可能。載體是技術人員熟知的並且可在例如,「克隆載體」(Pouwels P.H.等,Hrsg,Elsevier,Amsterdam-New York-Oxford,1985)中發現。術語「載體」指質粒和技術人員公知的所有其他載體,例如,噬菌體、轉座子、IS元件、質粒、粘粒和線性或環狀DNA。這些載體可在宿主生物體中自主複製或者隨染色體複製。
通過例如使用游離型質粒過表達本發明的metH基因而擴增這些基因。適宜的質粒為在棒桿菌細菌中複製的那些質粒。許多公知的質粒載體,例如,pZ1(Menkel等,Applied and Environmental Microloiotogy(應用和環境微生物學)(1989)64549-554)、pEKEx1(Eikmanns等,Gene 10293-98(1991))或pHS2-1(Sonnen等,Gene 10769-74(1991))以隱蔽性質粒pHM1519、pBL1或pGA1為基礎。其他質粒載體,例如,pCLiK5MCS,或者那些基於pCG4(US-A4,489,160)或pNG2(Serwold-Davis等,FEMSMicrobiology Letters 66,119-124(1990))或pAG1(US-A5,158,891)的質粒可以以相同的方式使用。
適宜的質粒載體還包括在其幫助下可以應用基因擴增方法(見例如Remscheid等(Applied and Environmental Microbiology 60,126-132(1994))通過整合到染色體中來複製和擴增hom-thrB操縱子的質粒。在該方法中,完整基因被克隆到可以在宿主(通常大腸桿菌)中複製但是不能在穀氨酸棒狀桿菌中複製的質粒載體中。合適的載體為,例如,pSUP301(Sirnon等,Bio/Technology 1,784-791(1983)),pK 18mob或pK 19mob(Sch_fer等,Gene 145,69-73(1994)),Bernard等,Journal of Molecular Biology,234534-541(1993))、pEM1(Schrumpf等,1991,Journal of Bacteriology 1734510-4516)或pBGS8(Spratt等,1986,Gene 41337-342)。然後將含有將要擴增的基因的質粒通過轉化轉移到期望的穀氨酸棒狀桿菌菌株中。轉化方法例如在Thierbach等(Applied Microbiology and Biotechnology 29,356-362(1988))、Dunican和Shivnan(Biotechnology 7,1067-1070(1989))和Tauch等(FEMS Microbiological Letters 123,343-347(1994))中描述。
酶的活性可以通過相應基因中的突變來影響,從而使酶反應的速度部分或完全地減小。這種突變的實例是技術人員公知的(Motoyama H.,YanoH.,Terasaki Y.,Anazawa H.,Applied Environmental Microbiology.673064-70,2001,Eikmanns BJ.,Eggeling L.,Sahm H.,Antonie vanLeeuwenhoek.64145-63,1993-94)。
此外,對於生產含硫精細化學品,尤其是L-甲硫氨酸,除了表達和擴增本發明的metH基因,可能有利的是,還擴增參與甲硫氨酸生物合成途徑或者與其相關(即與其功能上相聯繫)的生物合成途徑或其他代謝途徑,如半胱氨酸、賴氨酸或蘇氨酸代謝途徑,如尤其是天冬氨酸半醛合成、糖酵解、糖回補、磷酸戊糖代謝、檸檬酸循環或胺基酸輸出的一種或多種酶。
從而,可以擴增一種或多種下面的基因以產生含硫精細化學品,尤其是L-甲硫氨酸(即,例如,以更高拷貝數存在或者編碼具有更高活性或特異性的酶)-天冬氨酸激酶-編碼基因lysC(EP 1 108 790 A2;DNA-SEQ NO.281),-天冬氨酸-半醛脫氫酶-編碼基因asd(EP 1 108 790 A2;DNA-SEQ NO.282),-甘油醛-3-磷酸脫氫酶-編碼基因gap(Eikmanns(1992),Journalof Bacteriology 1746076-6086),-3-磷酸甘油酸激酶-編碼基因pgk(Eikmanns(1992),Journal ofBacteriology 1746076-6086),-丙酮酸羧化酶-編碼基因pyc(Eikmanns(1992),Journal ofBacteriology 1746076-6086),-丙糖磷酸異構酶-編碼基因tpi(Eikmanns(1992),Journal ofBacteriology 1746076-6086),-高絲氨酸O-乙醯轉移酶-編碼基因metA(EP 1 108 790 A2;DNA-SEQ NO.725),-胱硫醚γ-合酶-編碼基因metB(EP 1 108 790 A2;DNA-SEQ NO.3491),-胱硫醚γ-裂合酶-編碼基因metC(EP 1 108 790 A2;DNA-SEQNO.3061),-絲氨酸羥甲基轉移酶-編碼基因glyA(EP 1 108 790 A2;DNA-SEQ NO.1110),-O-乙醯高絲氨酸硫化氫解酶-編碼基因metY(EP 1 108 790 A2;DNA-SEQ NO.726),-亞甲基四氫葉酸還原酶-編碼基因metF(EP 1 108 790 A2;DNA-SEQ NO.2379),-磷酸絲氨酸氨基轉移酶-編碼基因serC(EP 1 108 790 A2;DNA-SEQ NO.928),-磷酸絲氨酸磷酸酶-編碼基因serB(EP 1 108 790 A2;DNA-SEQNO.334,DNA-SEQ NO.467,DNA-SEQ NO.2767),-絲氨酸乙醯基轉移酶-編碼基因cysE(EP 1 108 790 A2;DNA-SEQ NO.2818),-高絲氨酸脫氫酶-編碼基因hom(EP 1 108 790 A2;DNA-SEQNO.1306)。
從而,對棒桿菌中含硫精細化學品,尤其是L-甲硫氨酸的生產,可能有利的是同時突變至少一種下面的基因,尤其是使得相應蛋白的活性與未突變蛋白的相比受到代謝物的影響較小或者根本不受影響-天冬氨酸激酶-編碼基因lysC(EP 1 108 790 A2;DNA-SEQ NO.281),-丙酮酸羧化酶-編碼基因pyc(Eikmanns(1992),Journal ofBacteriology 1746076-6086),-高絲氨酸O-乙醯轉移酶-編碼基因metA(EP 1 108 790 A2;DNA-SEQ NO.725),-胱硫醚γ-合酶-編碼基因metB(EP 1 108 790 A2;DNA-SEQ NO.3491),-胱硫醚γ-裂合酶-編碼基因metC(EP 1 108 790 A2;DNA-SEQNO.3061),-絲氨酸羥甲基轉移酶-編碼基因glyA(EP 1 108 790 A2;DNA-SEQ NO.1110),-O-乙醯高絲氨酸硫化氫解酶-編碼基因metY(EP 1 108 790 A2;DNA-SEQ NO.726),-亞甲基四氫葉酸還原酶-編碼基因metF(EP 1 108 790 A2;DNA-SEQ NO.2379),
-磷酸絲氨酸氨基轉移酶-編碼基因serC(EP 1 108 790 A2;DNA-SEQ NO.928),-磷酸絲氨酸磷酸酶-編碼基因serB(EP 1 108 790 A2;DNA-SEQNO.334,DNA-SEQ NO.467,DNA-SEQ NO.2767),-絲氨酸乙醯基轉移酶-編碼基因cysE(EP 1 108 790 A2;DNA-SEQ NO.2818),-高絲氨酸脫氫酶-編碼基因hom(EP 1 108 790 A2;DNA-SEQNO.1306)。
對含硫精細化學品,尤其是L-甲硫氨酸生產,可能還有利的是除了表達和擴增本發明的metH基因之一,還弱化一種或多種下面的基因,尤其是減少其表達,或者將其關閉-高絲氨酸激酶-編碼基因thrB(EP 1 108 790 A2;DNA-SEQ NO.3453),-蘇氨酸脫水酶-編碼基因ilvA(EP 1 108 790 A2;DNA-SEQ NO.2328),-蘇氨酸合酶-編碼基因thrC(EP 1 108 790 A2;DNA-SEQ NO.3486),-內消旋-二氨基庚二酸D-脫氫酶-編碼基因ddh(EP 1 108 790 A2;DNA-SEQ NO.3494),-磷酸烯醇丙酮酸羧基激酶-編碼基因pck(EP 1 108 790 A2;DNA-SEQ NO.3157),-葡萄糖-6-磷酸6-異構酶-編碼基因pgi(EP 1 108 790 A2;DNA-SEQ NO.950),-丙酮酸氧化酶-編碼基因poxB(EP 1 108 790 A2;DNA-SEQ NO.2873),-二氫吡啶二羧酸合酶-編碼基因dapA(EP 1 108 790 A2;DNA-SEQ NO.3476),-二氫吡啶二羧酸還原酶-編碼基因dapB(EP 1 108 790 A2;DNA-SEQ NO.3477);-二氨基吡啶甲酸脫羧酶-編碼基因lysA(EP 1 108 790 A2;DNA-SEQ NO.3451)。
對含硫精細化學品,尤其是L-甲硫氨酸生產還有利的是除了在棒桿菌中表達和擴增本發明的metH基因之一,同時還突變至少一種下面的基因使得相應蛋白質的酶促活性部分或全部減小-高絲氨酸激酶-編碼基因thrB(EP 1 108 790 A2;DNA-SEQ NO.3453),-蘇氨酸脫水酶-編碼基因ilvA(EP 1 108 790 A2;DNA-SEQ NO.2328),-蘇氨酸合酶-編碼基因thrC(EP 1 108 790 A2;DNA-SEQ NO.3486),-內消旋-二氨基庚二酸D-脫氫酶-編碼基因ddh(EP 1 108 790 A2;DNA-SEQ NO.3494),-磷酸烯醇丙酮酸羧基激酶-編碼基因pck(EP 1 108 790 A2;DNA-SEQ NO.3157),-葡萄糖-6-磷酸6-異構酶-編碼基因pgi(EP 1 108 790 A2;DNA-SEQ NO.950),-丙酮酸氧化酶-編碼基因poxB(EP 1 108 790 A2;DNA-SEQ NO.2873),-二氫吡啶二羧酸合酶-編碼基因dapA(EP 1 108 790 A2;DNA-SEQ NO.3476),-二氫吡啶二羧酸還原酶-編碼基因dapB(EP 1 108 790 A2;DNA-SEQ NO.3477);-二氨基吡啶甲酸脫羧酶-編碼基因lysA(EP 1 108 790 A2;DNA-SEQ NO.3451)。
對含硫精細化學品,尤其是L-甲硫氨酸生產還有利的是除了表達和擴增本發明的metH基因,還消除不想要的例如減少精細化學品產率的次級反應(Nakayama「胺基酸生產微生物的培養」,微生物產品的過量生產(Overproduction of Microbial Products),Krumphanzl,Sikyta,Vanek(編者),Academic Press,倫敦,英國,1982)。
根據本發明產生的微生物可以連續的或者分批的或者以補料分批或者反覆補料分批方法培養以產生含硫精細化學品,尤其是L-甲硫氨酸。公知的培養方法的綜述可在Chmiel的教科書(Bioproze β technik 1.Einführungin die Bioverfahrenstechnik(Gustav Fischer Verlag,Stuttgart,1991))或者在Storhas的教科書(Bioreaktoren und periphere Einrichtungen(ViewegVerlag,Braunschweig/Weisbaden,1994))中找到。
使用的培養基必須以適宜的方式滿足具體菌株的要求。美國細菌學學會的教科書「Manual of Methods for General Bacteriology」(WashingtonD.C.,USA,1981)包含各種微生物培養基的描述。
可根據本發明使用的所述培養基通常含有一種或多種碳源、氮源、無機鹽、維生素和/或微量元素。
優選的碳源為糖,如單糖、二糖或多糖。非常好的碳源的實例為葡萄糖、果糖、甘露糖、半乳糖、核糖、山梨糖、核酮糖、乳糖、麥芽糖、蔗糖、棉籽糖、澱粉和纖維素。也可以通過複雜化合物如糖蜜或糖精練的其他副產物將糖加入培養基。也可能有利的是加入不同碳源的混合物。其他可能的碳源為油和脂肪,例如,大豆油、向日葵油、花生油和椰油,脂肪酸,例如,棕櫚酸、硬脂酸和亞油酸,醇,例如,甘油、甲醇和乙醇,以及有機酸,例如,乙酸和乳酸。
氮源通常為有機或無機氮化合物或含有所述化合物的物質。氮源的實例包括氨氣和銨鹽,如硫酸銨、氯化銨、磷酸銨、碳酸銨和硝酸銨、硝酸鹽、尿素、胺基酸和複雜氮源如玉米漿、大豆粉、大豆蛋白、酵母提取物、肉膏和其他。這些氮源可單獨地或者作為混合物使用。
培養基中可包含的無機鹽化合物包括鈣、鎂、鈉、鈷、鉬、鉀、錳、鋅、銅和鐵的鹽酸鹽、磷酸鹽或硫酸鹽。
含硫無機化合物,例如,硫酸鹽、亞硫酸鹽、連二亞硫酸鹽、連四硫酸鹽、硫代硫酸鹽、硫化物或其他有機硫化合物如硫醇和巰基類化合物可用作硫源以生產含硫精細化學品,尤其是甲硫氨酸。
磷酸、磷酸二氫鉀或磷酸氫二鉀或相應的含鈉鹽可用作磷源。
可向培養基中加入螯合劑以將金屬離子保留在溶液中。尤其適宜的螯合劑包括二羥基酚類,如兒茶酚或原兒茶酚和有機酸如檸檬酸。
根據本發明使用的發酵培養基通常也含有其他生長因子如維生素或生長促進物質,其包括,例如,生物素、核黃素、硫胺素、葉酸、煙酸、泛素和吡哆醇。生長因子和鹽經常來自複雜培養基組分如酵母提取物、糖蜜、玉米漿等等。此外可以向培養基加入適宜的前體。培養基的確切組成很大程度依賴於具體實驗並且應針對每個各例單獨決定。關於優化培養基的信息可在教科書「Applied Microbiol.Physiology,A Practical Approach」(編者P.M.Rhodes,P.F.Stanbury,IRL Press(1997)pp.53-73,ISBN 0 199635773)中發現。生長培養基也可從供應商得到,例如Standardl(Merck)或BHI(腦心浸液,DIFCO)等等。
所有培養基組分都通過加熱(1.5巴及121℃20分鐘)或者通過無菌過濾除菌。各組分可一起或者,如果需要,分開滅菌。所有培養基組分可以在培養開始時加入或者按需要連續地或者分批加入。
培養溫度通常為15℃到45℃,優選25℃到40℃,並且在實驗期間可保持恆定或者可以變化。培養基的pH應該為5到8.5,優選約7.0。可以在培養過程中通過加入鹼性化合物如氫氧化鈉、氫氧化鉀、氨和氨水或酸性化合物如磷酸或硫酸控制培養的pH。可通過使用消泡劑,例如,脂肪酸聚乙二醇酯控制起泡。為了保持質粒穩定,可以向培養基中加入具有選擇作用的適宜物質,例如抗生素。通過導入氧氣或含有氧氣的氣體混合物,例如,空氣到培養物中保持有氧條件。培養的溫度通常為20到45℃。持續培養直到期望產物的形成最大。該目標通常在10到160小時內實現。
以這種方法得到的發酵液,尤其是含有L-甲硫氨酸的發酵液通常含有按重量計7.5到25%的幹生物量。
此外,有利的是至少在發酵最後,但是尤其是在發酵期的至少30%後在糖限制下進行發酵。這意味著在該時間內發酵培養基中的可利用糖的濃度保持在或者減少到≥0到3g/l。
然後進一步處理髮酵液。根據需要,生物質可以通過分離方法,例如,離心、過濾、傾析或這些方法的組合從發酵液完全或部分除去或者完全留在所述發酵液中。
然後,使用公知的方法,例如,通過旋轉蒸發器、薄膜蒸發器、降膜蒸發器、反滲透或者通過納過濾將發酵液變稠或濃縮。
然而,也可以進一步純化含硫精細化學品,尤其是L-甲硫氨酸。為此,含有產物的發酵液在除去生物質後使用適宜樹脂進行層析,期望產物或汙染物被完全或部分保留在層析樹脂上。如果需要,可以使用相同或不同的層析樹脂重複這些層析步驟。技術人員熟悉適宜的層析樹脂的選擇和它們最有效的應用方式。純化的產物可通過過濾或超濾濃縮並保存在產物的穩定性最大的溫度。
分離的化合物的身份和純度可通過本領域技術確定。這些技術包括高效液相層析(HPLC)、光譜方法、染色方法、薄層層析、NIRS、酶測定法或微生物學測定法。這些分析方法概述於Patek等(1994)Appl.Environ.Microbiol.60133-140;Malakhova等(1996)Biotekhnologiya 11 27-32;和Schmidt等(1998)Bioprocess Engineer.1967-70;Ulmann’s Encyclopediaof Industrial Chemistry(1996)Bd.A27,VCHWeinheim,pp.89-90.pp.521-540,pp.540-547,pp.559-566,575-581和pp.581-587;Michal,G.,(1999)生物化學途徑生物化學和分子生物學手冊(Biochemical PathwaysAnAtlas of Biochemistry and Molecular Biology),John Wiley and Sons;Fallon,A.等(1987)HPLC在生物化學中的應用,《生物化學和分子生物學實驗技術》(Laboratory Techniques in Biochemistry and MolecularBiology),17卷。
下面的非限制性實施例更詳細描述本發明實施例1pCLiK5MCS的構建首先,通過聚合酶鏈式反應(PCR)使用寡核苷酸p1.3(SEQ ID NO53)和p2.3(SEQ ID NO54)擴增載體pBR322的氨苄青黴素抗性和複製起點。
p1.3(SEQ ID NO53)5』-CCCGGGATCCGCTAGCGGCGCGCCGGCCGGCCCGGTGTGAAATACCGCACAG-3』p2.3(SEQ ID NO54)5』-TCTAGACTCGAGCGGCCGCGGCCGGCCTTTAAATTGAAGACGAAAGGGCCTCG-3』除了與pBR322互補的序列,寡核苷酸p1.3(SEQ ID NO53)還以5』-3』方向含有限制性核酸酶SmaI、BamHI、NheI和AscI的切割位點,寡核苷酸p2.3(SEQ ID NO54)以5』-3』方向含有限制性內切核酸酶XbaI、XhoI、NotI和DraI的切割位點。根據標準方法如Innis等的方法(PCR Protocols.A Guide to Methods and Applications,Academic Press(1990))使用PfuTurbo聚合酶(Stratagen,La Jolla,USA)進行PCR反應。使用GFXTMPCR、DNA和凝膠條帶純化試劑盒(Amersham Pharmacia,Freiburg)根據生產商的使用說明書純化所得大小約為2.1kb的DNA片段。使用快速DNA連接試劑盒(Roche Diagnostics,Mannheim)根據生產商的使用說明書將DNA片段的鈍端相互連接,並根據標準方法,如在Sambrook等(分子克隆實驗室手冊,冷泉港,(1989))中描述的方法將連接混合物轉化到感受態大腸桿菌XL-1Blue(Stragagen,La Jolla,USA)中。通過塗含有氨苄青黴素(50μg/ml)的LB瓊脂選擇攜帶質粒的細胞(Lennox,1955,Virology,1190)。
各單克隆的質粒DNA使用Qiaprep spin miniprep試劑盒(Qiagen,Hilden)根據生產商的使用說明書進行分離並通過限制性消化檢查。以這種方法得到的質粒被稱為pCLik1。
從用作PCR反應的模板的質粒pWLT1(Liebl等,1992)開始,使用寡核苷酸neo1(SEQ ID NO55)和neo2(SEQ ID NO56)擴增卡那黴素抗性盒。
neo1(SEQ ID NO55)
5』-GAGATCTAGACCCGGGGATCCGCTAGCGGGCTGCTAAAGGAAGCGGA-3』neo2(SEQ ID NO56)5』-GAGAGGCGCGCCGCTAGCGTGGGCGAAGAACTCCAGCA-3』,除了與pWLT1互補的序列,寡核苷酸neo1還以5』-3』方向含有限制性內切核酸酶XbaI、SmaI、BamHI、NheI的切割位點,寡核苷酸neo2(SEQID NO56)以5』-3』方向含有限制性內切核酸酶AscI和NheI的切割位點。根據標準方法如Innis等的方法(PCR Protocols.A Guide to Methods andApplications,Academic Press(1990))使用PfuTurbo聚合酶(Stratagen,LaJolla,USA)進行PCR反應。使用GFXTMPCR、DNA和凝膠帶純化試劑盒(Amersham Pharmacia,Freiburg)根據生產商的使用說明書純化所得大小約為1.3kb的DNA片段。DNA片段用限制性內切核酸酶XbaI和AscI(NewEngland Biolabs,Beverly,USA)切割,之後再次使用GFXTMPCR、DNA和凝膠帶純化試劑盒(Amersham Pharmacia,Freiburg)根據生產商的使用說明書純化。載體pCLiK1同樣用限制性核酸內切酶XbaI和AscI切割並使用鹼性磷酸酶(Roche Diagnostics,Mannheim)根據生產商的使用說明書去磷酸化。在濃度0.8%的瓊脂糖凝膠中電泳後,使用GFXTMPCR、DNA和凝膠帶純化試劑盒(Amersham Pharmacia,Freiburg)根據生產商的使用說明書分離線性化載體(約2.1kb)。使用快速DNA連接試劑盒(RocheDiagnostics,Mannheim)根據生產商的使用說明書將切割的PCR片段連接到載體片段上,並根據標準方法,如在Sambrook等(分子克隆實驗室手冊,冷泉港,(1989))中描述的方法將連接混合物轉化到感受態大腸桿菌XL-1Blue(Stratagene,La Jolla,USA)中,通過塗含有氨苄青黴素(50μg/ml)和卡那黴素(20μg/ml)的LB瓊脂選擇攜帶質粒的細胞(Lennox,1955,Virology,1190)。
各單克隆的質粒DNA使用Qiaprep spin miniprep試劑盒(Qiagen,Hilden)根據生產商的使用說明書進行分離並通過限制性消化檢查。以這種方法得到的質粒被稱為pCLik2。
載體pCLiK2用限制性內切核酸酶DraI(New England Biolabs,Beverly,USA)切割。在濃度0.8%的瓊脂糖凝膠中電泳後,使用GFXTMPCR、DNA和凝膠帶純化試劑盒(Amersham Pharmacia,Freiburg)根據生產商的使用說明書分離到約2.3kb載體片段。該載體片段使用快速DNA連接試劑盒(Roche Diagnostics,Mannheim)根據生產商的使用說明書重新連接,並根據標準方法,如在Sambrook等(分子克隆實驗室手冊,冷泉港,(1989))中描述的方法將連接混合物轉化到感受態大腸桿菌XL-1Blue(Stratagene,La Jolla,USA)中,通過塗含有卡那黴素(20μg/ml)的LB瓊脂選擇攜帶質粒的細胞(Lennox,1955,Virology,1190)。
各單克隆的質粒DNA使用Qiaprep spin miniprep試劑盒(Qiagen,Hilden)根據生產商的使用說明書進行分離並通過限制性消化檢查。以這種方法得到的質粒被稱為pC Lik3。
從用作PCR反應的模板的質粒pWLQ2(Liebl等,1992)開始,使用寡核苷酸cg1(SEQ ID NO57)和cg2(SEQ ID NO58)擴增pHM1519的複製起點。
cg1(SEQ ID NO57)5』-GAGAGGGCGGCCGCGCAAAGTCCCGCTTCGTGAA-3』cg2(SEQ ID NO58)5』-GAGAGGGCGGCCGCTCAAGTCGGTCAAGCCACGC-3』除了與pWLQ2互補的序列,寡核苷酸cg1(SEQ ID NO57)和cg2(SEQID NO58)還含有限制性核酸酶NotI的切割位點。根據標準方法如Innis等的方法(PCR Protocols.A Guide to Methods and Applications,AcademicPress(1990))使用PfuTurbo聚合酶(Stratagene,La Jolla,USA)進行PCR反應。使用GFXTMPCR、DNA和凝膠帶純化試劑盒(Amersham Pharmacia,Freiburg)根據生產商的使用說明書純化所得大小約為2.7kb的DNA片段。DNA片段用限制性內切核酸酶NotI(New England Biolabs,Beverly,USA)切割,之後再次使用GFXTMPCR、DNA和凝膠帶純化試劑盒(AmershamPharmacia,Freiburg)根據生產商的使用說明書純化。載體pCLiK3同樣用限制性核酸內切酶NotI切割並使用鹼性磷酸酶(Roche Diagnostics,Mannheim)根據生產商的使用說明書進行去磷酸化。在濃度0.8%的瓊脂糖凝膠中電泳後,使用GFXTMPCR、DNA和凝膠帶純化試劑盒(AmershamPharmacia,Freiburg)根據生產商的使用說明書分離線性化載體(約2.3kb)。使用快速DNA連接試劑盒(Roche Diagnostics,Mannheim)根據生產商的使用說明書將切割的PCR片段連接到此載體片段上,並根據標準方法,如在Sambrook等(分子克隆實驗室手冊,冷泉港,(1989))中描述的方法將連接混合物轉化到感受態大腸桿菌XL-1Blue(Stratagene,La Jolla,USA)中,通過塗含有卡那黴素(20μg/ml)的LB瓊脂選擇攜帶質粒的細胞(Lennox,1955,Virology,1190)。
單克隆的質粒DNA使用Qiaprep spin miniprep試劑盒(Qiagen,Hilden)根據生產商的使用說明書進行分離並通過限制性消化檢查。以這種方法得到的質粒被稱為pCLik5。
通過混合兩條合成的基本互補的含有限制性內切核酸酶SwaI、XhoI、AatI、ApaI、Asp718、MluI、NdeI、SpeI、EcoRV、SalI、ClaI、BamHI、XbaI和SmaI的切割位點的HS445((SEQ ID NO59)和HS446(SEQ IDNO60))寡核苷酸,通過將它們一起加熱到95℃然後緩慢冷卻得到雙鏈DNA片段,從而用多克隆位點(MCS)延伸pCLik5。
HS445(SEQ ID NO59)5′-TCGAATTTAAATCTCGAGAGGCCTGACGTCGGGCCCGGTACCACGCGTCATATGACTAGTTCGGACCTAGGGATATCGTCGACATCGATGCTCTTCTGCGTTAATTAACAATTGGGATCCTCTAGACCCGGGATTTAAAT-3『HS446(SEQ ID NO60)5′-GATCATTTAAATCCCGGGTCTAGAGGATCCCAATTGTTAATTAACGCAGAAGAGCATCGATGTCGACGATATCCCTAGGTCCGAACTAGTCATATGACGCGTGGTACCGGGCCCGACGTCAGGCCTCTCGAGATTTAAAT-3『載體pCLiK5用限制性核酸內切酶XhoI和BamHI(New EnglandBiolabs,Beverly,USA)切割並使用鹼性磷酸酶(Roche Diagnostics,Mannheim)根據生產商的使用說明書進行去磷酸化。在濃度0.8%的瓊脂糖凝膠中電泳後,使用GFXTMPCR、DNA和凝膠帶純化試劑盒(AmershamPharmacia,Freiburg)根據生產商的使用說明書分離線性化載體(約5.0kb)。使用快速DNA連接試劑盒(Roche Diagnostics,Mannheim)根據生產商的使用說明書將合成的雙鏈DNA片段連接到此載體片段上,並根據標準方法,如在Sambrook等(分子克隆實驗室手冊,冷泉港,(1989))中描述的方法將連接混合物轉化到感受態大腸桿菌XL-1Blue(Stratagene,LaJolla,USA)中,通過塗含有卡那黴素(20μg/ml)的LB瓊脂選擇攜帶質粒的細胞(Lennox,1955,Virology,1190)。
單克隆的質粒DNA使用Qiaprep spin miniprep試劑盒(Qiagen,Hilden)根據生產商的使用說明書進行分離並通過限制性消化檢查。以這種方法得到的質粒被稱為pCLik5MCS。
根據Sanger等(1977)Proceedings of the National Acedemy of ScienceUSA 745463-5467進行測序反應。將測序反應物並通過ABI Prism 377(PEApplied Biosystems,Weiterstadt)分級分離並分析。
所得質粒pCLiK5MCS被列為SEQ ID NO63。
實施例2pCLik5MCS integrative sacB的構建從作為PCR反應模板的質粒pK 19mob(Sch_fer等,Gene 145,69-73(1994))開始,使用寡核苷酸BK1732和BK1733擴增枯草芽孢桿菌sacB基因(編碼果聚糖蔗糖酶)。
BK1732(SEQ ID NO61)5』-GAGAGCGGCCGCCGATCCTTTTTAACCCATCAC-3』BK1733(SEQ ID NO62)5』-AGGAGCGGCCGCCATCGGCATTTTCTTTTGCG-3』除了與pEK19mobsac互補的序列,寡核苷酸BK1732和BK1733還含有限制性核酸內切酶NotI的切割位點。根據標準方法如Innis等的方法(PCR Protocols. A Guide to Methods and Applications,AcademicPress(1990))使用PfuTurbo聚合酶(Stratagen,La Jolla,USA)進行PCR反應。使用GFXTMPCR、DNA和凝膠帶純化試劑盒(Amersham Pharmacia,Freiburg)根據生產商的使用說明書純化所得大小約為1.9kb的DNA片段。DNA片段用限制性內切核酸酶NotI(New England Biolabs,Beverly,USA)切割,之後再次使用GFXTMPCR、DNA和凝膠帶純化試劑盒(AmershamPharmacia,Freiburg)根據生產商的使用說明書純化。
載體pCLiK5MCS(根據實施例1製備)同樣用限制性核酸內切酶NotI切割並使用鹼性磷酸酶(Roche Diagnostics,Mannheim)根據生產商的使用說明書進行去磷酸化。在濃度0.8%的瓊脂糖凝膠中電泳後,使用GFXTMPCR、DNA和凝膠帶純化試劑盒(Amersham Pharmacia,Freiburg)根據生產商的使用說明書分離大小約2.4kb的載體片段。使用快速DNA連接試劑盒(Roche Diagnostics,Mannheim)根據生產商的使用說明書將切割的PCR片段連接到此載體片段上,並根據標準方法,如在Sambrook等(分子克隆實驗室手冊,冷泉港,(1989))中描述的方法將連接混合物轉化到感受態大腸桿菌XL-1Blue(Stratgagene,La Jolla,USA)中,通過塗含有卡那黴素(20μg/ml)的LB瓊脂選擇攜帶質粒的細胞(Lennox,1955,Virology,1190)。
單克隆的質粒DNA使用Qiaprep spin miniprep試劑盒(Qiagen,Hilden)根據生產商的使用說明書進行分離並通過限制性消化檢查。以這種方法得到的質粒被稱為pCLik5MCS integrative sacB。
根據Sanger等(1977)Proceedings of the National Acedemy of ScienceUSA 745463-5467進行測序反應。將測序反應物並通過ABI Prism 377(PEApplied Biosystems,Weiterstadt)分級分離並分析。
所得質粒pCLik5MCS integrativ sacB被列為SEQ ID NO64。
可以以類似方法製備適於metH基因的本發明表達或過量生產的其它載體。
下面的實施例3到8描述了被稱為LU 1479 lysC 311ile ET-16 pCPhsdh metH Sc的改進的甲硫氨酸生產菌株的逐步構建。
實施例3從穀氨酸棒狀桿菌菌株LU1479分離lysC基因將在主幹構建的第一步進行下面稱為LU1479的穀氨酸棒狀桿菌ATCC13032中編碼天冬氨酸激酶的lysC野生型基因的等位基因交換。將在lysC基因中進行核苷酸交換從而使311位的胺基酸Thr在所得蛋白中被交換成胺基酸Ile。
從作為PCR反應的模板的LU1479染色體DNA開始,用寡核苷酸引物SEQ ID NO65和SEQ ID NO66 lysC通過Pfu-Turbo PCRSystem(Stratagene USA),按照生產商的使用說明書進行擴增。按照Tauch等(1995)Plasmid 33168-179或Eikmanns等(1994)Microbiology 1401817-1828的方法製備穀氨酸棒狀桿菌ATCC 13032染色體DNA。擴增的片段在5』端側翼有一個SalI限制性切割位點,在3』端側翼有一個MluI限制性切割位點。克隆前,將擴增的片段通過這兩種限制酶消化並使用GFXTMPCR、DNA和凝膠帶純化試劑盒(Amersham Pharmacia,Freiburg)純化。
SEQ ID NO655『-GAGAGAGAGACGCGTCCCAGTGGCTGAGACGCATC-3『SEQ ID NO665『-CTCTCTCTGTCGACGAATTCAATCTTACGGCCTG-3『將所得多核苷酸通過SalI和MluI反應切割,克隆到pCLiK5MCSintegrative SacB(此後稱為pCIS;實施例2的SEQ ID NO64)中並轉化大腸桿菌XL-1blue。通過塗含有卡那黴素(20μg/ml)的LB瓊脂選擇攜帶質粒的細胞(Lennox,1955,Virology,1190)。分離質粒並通過測序驗證預期的核苷酸序列。使用Qiagen的材料和方法進行質粒DNA的製備。
根據Sanger等(1977)Proceedings of the National Acedemy of ScienceUSA 745463-5467進行測序反應。將測序反應物通過ABI Prism 377(PEApplied Biosystems,Weiterstadt)分離並評價。所得質粒pCIS lysC被列為SEQ ID NO77。
序列SEQ ID NO77含有下面的基本部分-區域

1)編碼序列2)在互補菌株上實施例4穀氨酸棒狀桿菌lysC基因的誘變使用QuickChange試劑盒(Stratagene/USA)按照生產商的使用說明書進行穀氨酸棒狀桿菌lysC基因(實施例3)的定向誘變。誘變在質粒pCISlysC,SEQ ID NO77中進行。為了將thr311交換成311ile,通過Quickchange方法(Stratagene)合成下面的寡核苷酸引物SEQ ID NO675『-CGGCACCACCGACATCATCTTCACCTGCCCTCGTTCCG-3『SEQ ID NO685『-CGGAACGAGGGCAGGTGAAGATGATGTCGGTGGTGCCG-3『在Quickchange反應中這些寡核苷酸的使用引起lysC基因中932位核苷酸的交換(C被T代替)(參見SEQ ID NO75)和相應酶中311位的胺基酸置換(Thr-Ile)(參見,SEQ ID NO76)。在轉化大腸桿菌XL 1-blue和質粒製備後,LysC基因中所得胺基酸交換Thr311ile通過測序證實。
質粒被命名為pCIS lysC thr311ile並被列為SEQ ID NO78。
序列ID NO78含有下面的基本部分/區

1)編碼序列2)在互補菌株上通過如Liebl等(1989)FEMS Microbiology Letters 53299-303所描述的方法將質粒pCIS lysC thr311ile轉化到穀氨酸棒狀桿菌LU 1470中。該方案的修改之處在DE-A-10046870中描述。各個轉化體的lysC基因座的染色體安排使用Southern印跡和雜交按如Sambrook等(1989),分子克隆實驗室指南,冷泉港的標準方法驗證。這樣確保轉化株是通過同源重組在lysC基因座整合了轉化的質粒的轉化株。這些菌落在沒有抗生素的培養基中生長過夜然後將細胞塗在蔗糖CM瓊脂培養基上(10%蔗糖)並在30℃孵育24小時。
由於sacB基因(其存在於載體pCIS lysC thr311ile中)將蔗糖轉化成毒性產物,因此只有通過野生型lysC基因和突變基因lysC thr311ile之間的第二同源重組步驟缺失掉sacB基因的那些菌落才能夠建立生長。在此同源重組步驟過程,野生型基因或突變基因可以與sacB基因一起缺失。如果sacB基因與野生型基因一起被除去,那麼導致突變的轉化體。
挑取具有已確立的生長的菌落並研究其卡那黴素敏感表型。缺失Sac基因的菌落一定同時顯示出卡那黴素敏感生長行為。在搖瓶中研究這種卡那黴素敏感克隆的賴氨酸生產力(見實施例6)。為了比較,生長未處理的菌株LU1479。選擇賴氨酸生產超過對照的克隆,得到染色體DNA,並將lysC基因的匹配區通過PCR反應擴增並測序。這種具有增加的賴氨酸合成和在lysC的932位具有確認的突變的克隆被稱為LU1479lysC 311ile。
實施例5乙硫氨酸抗性穀氨酸棒狀桿菌菌株的產生在主幹構建的第二步中,處理所得菌株LU1479lysC311ile(實施例4)以誘導乙硫氨酸抗性(Kase,H.Nakayama K.Agr.Biol.Chem.39,153-106,1975,通過穀氨酸棒狀桿菌的甲硫氨酸類似物抗性突變株生產L-甲硫氨酸)將BHI培養基(Difco)中的過夜培養物用檸檬酸緩衝液(50mM pH5.5)洗滌並在30℃下用N-甲基亞硝基胍(10mg/ml,於50mM檸檬酸鹽中,pH5.5)處理20分鐘。用化學誘變劑N-甲基亞硝基胍處理後,洗滌細胞(檸檬酸緩衝液50mM pH5.5)並塗布在由下面成分組成的培養基上,所述組成基於500ml為10g(NH4)2SO4,0.5g KH2PO4,0.5g K2HPO4,0.125gMgSO4.7H2O,21g MOPS,50mg CaCl2,15mg原兒茶酸,0.5mg生物素,1mg硫胺素,5g/l D,L-乙基硫氨酸(Sigma Chemicals德國),pH7.0。此外,該培養基含有10g/l FeSO4.7H2O,1g/l MnSO4·H2O,0.1g/l ZnSO4·7H2O,0.02g/l CuSO4,0.002g/l NiCl2·6H2O的0.5ml微量營養鹽溶液。所有鹽溶於0.1M HCl中。將完成的培養基過濾除菌,加入40ml無菌50%葡萄糖溶液,加入液態無菌瓊脂至終濃度1.5%並將混合物導入培養皿中。
將已經經歷誘變處理的細胞塗布於含有上述培養基的平板中並在30℃下孵育3-7天。分離所得克隆,在選擇培養基上分離至少一次然後在搖瓶中培養基II中檢驗它們的甲硫氨酸生產力(見實施例6)。
實施例6使用菌株LU1479lysC311ile ET-16生產甲硫氨酸在實施例5中產生的菌株於30℃在含有CM培養基的瓊脂板上生長2天。
CM瓊脂10.0g/l D-葡萄糖,2.5g/l NaCl,2.0g/l尿素,10.0g/l細菌用-腖(Difco),5.0g/l酵母提取物(Difco),5.0g/l牛肉膏(Difco),22.0g/l瓊脂(Difco),高壓滅菌(20min,121℃)隨後從平板上刮下細胞並重懸於鹽水中。對於主要培養,在100ml錐形瓶的10ml培養基II和0.5g高壓滅菌的CaCO3(Riedel de Haen)中接種細胞懸浮物到OD600nm為1.5並在定軌搖床上以200rpm在30℃下孵育72小時。
培養基II40g/l蔗糖60g/l糖蜜(基於100%糖含量)10g/l(NH4)2SO40.4g/l MgSO4·7H2O0.6g/l KH2PO40.3mg/l 硫胺素·HCl1mg/l生物素(來自1mg/ml過濾除菌的母液,其已經用NH4OH調節到pH8.0)2mg/lFeSO42mg/lMnSO4用NH4OH調節到pH7.8並高壓滅菌(121℃,20min)。此外,將來自母液(200μg/ml,過濾除菌)的維生素B12(羥鈷胺素,Sigma Chemicals)加至終濃度100μg/l。
在Agilent 1100 Series LC System HPLC上通過來自Agilent的胺基酸確定方法確定培養液中形成的甲硫氨酸和其他胺基酸。柱分離前用鄰苯二醛衍生以便可以定量形成的胺基酸。在Hypersil AA柱(Agilent)上分離胺基酸混合物。
分離甲硫氨酸生產力是最初的菌株LU1479 lysC311ile的生產力的至少2倍的克隆。這種克隆的一株被用於隨後的實驗中並被命名為LU1479lysC 311ile ET-16。
實施例7從天藍色鏈黴菌克隆metH並克隆到質粒pCPhsdh metH Sc中a)從天藍色鏈黴菌菌株ATCC BAA-471(來自美國典型培養物保藏中心,(ATCC)Atlanta,USA,可通過目錄號BAA-471D得到)分離染色體DNA。穀氨酸棒狀桿菌ATCC 13032染色體DNA通過Tauch等(1995)Plasmid 33168-179或Eikmanns等(1994)Microbiology 1401817-1828的方法製備。
使用聚合酶鏈式反應(PCR)通過標準方法(如Innis等(1990)PCRProtocols,A Guide to Methods and Applications,Academic Press),以穀氨酸棒狀桿菌DNA作為模板,利用Pfu Turbo聚合酶(Stratagene),使用寡核苷酸引物SEQ ID NO69和SEQ ID NO70從高絲氨酸脫氫酶(HsDH)的非編碼5』區(啟動子區)擴增長度為約180鹼基對的DNA片段。擴增的片段5』端側翼為XhoI限制性切割位點,其3』端側翼為通過oligo導入的同源區,該同源區與天藍色鏈黴菌metH同源。
SEQ ID NO695』-GAGACTCGAGGGAAGGTGAATCGAATTTCGG-3』和SEQ ID NO705』-GTCCCGGGGAGAACGCACGATTCTCCAAAAATAATCGC-3』使用GFXTMPCR、DNA和凝膠帶純化試劑盒(Amersham Pharmacia,Freiburg)根據生產商的使用說明書純化所得DNA片段。
b)從作為PCR反應模板的天藍色鏈黴菌染色體DNA開始,通過GC-RICH PCR系統(Rocbe Diagnostics,Mannheim)按照生產商的使用說明書用寡核苷酸引物SEQ ID NO71和SEQ ID NO72擴增metH片段。該擴增的片段在其5』端側翼為通過oligo導入並且與穀氨酸棒狀桿菌HsDH啟動子區同源的區域。
SEQ ID NO715』-GAATCGTGCGTTCTCCCCGGGAC-3』和SEQ ID NO725』-GTAGTTGACCGAGTTGATCACC-3』
使用GFXTMPCR、DNA和凝膠帶純化試劑盒(Amersham Pharmacia,Freiburg)根據生產商的使用說明書純化所得約1.4kb的DNA片段。
c)在另一PCR反應中,上面所得的兩條片段一起用作模板。由於用寡核苷酸引物SEQ ID NO71和SEQ ID NO72(它們與相應的另一片段同源)導入的區域,兩條片段在PCR反應中相互退火,並且,由於使用聚合酶,這兩條片段延伸以形成連續DNA鏈。該標準方法被修改使得使用的寡核苷酸引物SEQ ID NO69和SEQ ID NO72僅僅在第二次循環中被加入反應混合物。
使用GFXTMPCR、DNA和凝膠帶純化試劑盒根據生產商的使用說明書純化擴增的約1.6kb的DNA片段。此後,將其用限制酶XhoI和NotI(Roche Diagnostics,Mannheim)切割並通過凝膠電泳分離。隨後使用GFXTMPCR、DNA和凝膠帶純化試劑盒(Amersham Pharmacia,Freiburg)從瓊脂糖分離該約1.6kb DNA片段。
d)通過GC-RICH PCR系統(Roche Diagnostics,Mannheim)按照生產商的使用說明書用寡核苷酸引物SEQ ID NO73和SEQ ID NO74,從作為模板的天藍色鏈黴菌染色體DNA開始擴增metH3』區(仍然缺少該區域)。該擴增的片段在其3』端側翼為通過oligo導入的EcoRV限制性切割位點。
SEQ ID NO735』-CCGGCCTGGAGAAGCTCG-3』和SEQ ID NO745』-GAGAGATATCCCTCAGCGGGCGTTGAAG-3』使用GFXTMPCR、DNA和凝膠帶純化試劑盒(Amersham Pharmacia,Freiburg)根據生產商的使用說明書純化所得約2.2kb的DNA片段。此後,將其用限制酶NotI和EcoRV(Roche Diagnostics,Mannheim)切割並通過凝膠電泳分離。隨後使用GFXTMPCR、DNA和凝膠帶純化試劑盒(AmershamPharmacia,Freiburg)從瓊脂糖分離該約2.2kb DNA片段。
e)用限制酶NotI和EcoRV(Roche Diagnostics,Mannheim)切割載體pClik5MCS SEQ ID NO63(實施例1)並使用GFXTMPCR、DNA和凝膠帶純化試劑盒通過電泳分離後純化5kb片段。
使用快速DNA連接試劑盒(Roche Diagnostics,Mannheim)根據生產商的使用說明書連接載體片段以及兩條被切割並純化的PCR片段,並如在Sambrook等(分子克隆實驗室手冊,冷泉港,(1989))中描述的將連接反應物轉化到感受態大腸桿菌XL-1Blue(Stratagene,La Jolla,USA)中。通過塗含有卡那黴素(20μg/ml)的LB瓊脂選擇攜帶質粒的細胞(Lennox,1955,Virology,1190)。
使用來自Qiagen的材料和方法製備質粒DNA。根據如Sanger等(1977)Proceedings of the National Acedemy of Science USA 745463-5467描述的方法進行測序反應。將測序反應物分離並通過ABI Prism 377(PEApplied Biosystems,Weiterstadt)評價。
所得質粒pC Phsdh metH Sc(天藍色鏈黴菌)被列為SEQ ID NO79。
序列SEQ ID NO79含有下面的基本部分/區

1)編碼序列2)在互補菌株上實施例8用質粒pC Phsdh metH Sc轉化菌株LU1479 lysC 311 ileET-16通過如所描述的方法(Liebl等(1989)FEMS Microbiology Letters53299-303)用質粒pC Phsdh metH Sc(實施例7)轉化菌株LU1479 lysC 311ile ET-16(實施例5)。將轉化混合物塗布在補加20mg/l卡那黴素的CM板上以便選擇含有質粒的細胞。挑選並分離所得卡那黴素抗性克隆。在搖瓶實驗中研究克隆的甲硫氨酸生產力(見實施例6)。與LU1479 lysC 311 ileET-16相比,菌株LU1479 lysC 311 ile ET-16 pC Phsdh metH Sc產生明顯更多的甲硫氨酸。
序列表110巴斯福股份公司(BASF Aktiengesellschaft)120使用編碼metH的棒桿菌細菌發酵生產含硫精細化學品的方法130M/43120140
141
1607921012113597212DNA213天藍色鏈黴菌(Streptomyces coelicolor)220
221CDS222(1)..(3594)223RSX142544001gtg cgt tct ccc cgg gac gtc cca cga cgg gcg gca ccg ggc aga ggc 48Val Arg Ser Pro Arg Asp Val Pro Arg Arg Ala Ala Pro Gly Arg Gly1 5 10 15aaa gcc gac agc cgt cgc atc cta ggg agc cct ttc atg gcc tcg tcg 96Lys Ala Asp Ser Arg Arg Ile Leu Gly Ser Pro Phe Met Ala Ser Ser20 25 30cca tcc acc ccg ccc gcc gac acc cgc acc cgc gtg tcc gcc ctc cga144Pro Ser Thr Pro Pro Ala Asp Thr Arg Thr Arg Val Ser Ala Leu Arg35 40 45gag gcc ctc gcc acc cgc gtg gtg gtc gcc gac ggc gcc atg ggc acc192Glu Ala Leu Ala Thr Arg Val Val Val Ala Asp Gly Ala Met Gly Thr50 55 60atg ctc cag gcc cag aac ccc acg ctg gac gac ttc cag cag ctc gaa240Met Leu Gln Ala Gln Asn Pro Thr Leu Asp Asp Phe Gln Gln Leu Glu65 70 75 80ggg tgc aac gag gtc ctg aac ctc acc cgg ccc gac atc gtc cgc tcg288Gly Cys Asn Glu Val Leu Asn Leu Thr Arg Pro Asp Ile Val Arg Ser85 90 95gtg cac gag gag tac ttc gcg gcc ggc gtc gac tgc gtc gag acc aac336Val His Glu Glu Tyr Phe Ala Ala Gly Val Asp Cys Val Glu Thr Asn100 105 110acc ttc ggc gcc aac cac tcc gcc ctg ggc gag tac gac atc ccc gag384Thr Phe Gly Ala Asn His Ser Ala Leu Gly Glu Tyr Asp Ile Pro Glu115 120 125cgc gtc cac gaa ctg tcc gag gcc ggc gcc cgc gtc gcc cgc gag gtc432Arg Val His Glu Leu Ser Glu Ala Gly Ala Arg Val Ala Arg Glu Val130 135 140
gcc gac gag ttc ggc gcc cgc gac ggc cgg cag cgc tgg gtg ctg ggc 480Ala Asp Glu Phe Gly Ala Arg Asp Gly Arg Gln Arg Trp Val Leu Gly145 150 155 160tcc atg ggc ccc ggc acc aag ctc ccc acc ctc ggc cac gcc ccg tac 528Ser Met Gly Pro Gly Thr Lys Leu Pro Thr Leu Gly His Ala Pro Tyr165 170 175acc gtc ctg cgc gac gcc tac cag cgc aac gcc gag gga ctg gtc gcg 576Thr Val Leu Arg Asp Ala Tyr Gln Arg Asn Ala Glu Gly Leu Val Ala180 185 190ggc ggc gcg gac gca ctg ctg gtg gag acc acg cag gac ctg ctc cag 624Gly Gly Ala Asp Ala Leu Leu Val Glu Thr Thr Gln Asp Leu Leu Gln195 200 205acc aag gcc tcg gtg ctc ggc gcc cgg cgc gcc ctg gac gtc ctc ggc 672Thr Lys Ala Ser Val Leu Gly Ala Arg Arg Ala Leu Asp Val Leu Gly210 215 220ctc gac ctg ccg ctc atc gtg tcc gtc acc gtc gag acc acc ggc acc 720Leu Asp Leu Pro Leu Ile Val Ser Val Thr Val Glu Thr Thr Gly Thr225 230 235 240atg ctg ctc ggc tcg gag atc ggc gcc gcg ctc acc gcg ctg gaa ccg 768Met Leu Leu Gly Ser Glu Ile Gly Ala Ala Leu Thr Ala Leu Glu Pro245 250 255ctc ggc atc gac atg atc ggc ctg aac tgc gcc acc ggc ccc gcc gag 816Leu Gly Ile Asp Met Ile Gly Leu Asn Cys Ala Thr Gly Pro Ala Glu260 265 270atg agc gag cac ctg cgc tac ctc gcc cgg cac tcc cgc atc ccg ctg 864Met Ser Glu His Leu Arg Tyr Leu Ala Arg His Ser Arg Ile Pro Leu275 280 285acc tgc atg ccc aac gcc ggt ctg ccc gtc ctc ggc aag gac ggc gcc 912Thr Cys Met Pro Asn Ala Gly Leu Pro Val Leu Gly Lys Asp Gly Ala290 295 300cac tac ccg ctg acc gcg ccc gag ctg gcc gac gca cac gag acc ttc 960His Tyr Pro Leu Thr Ala Pro Glu Leu Ala Asp Ala His Glu Thr Phe305 310 315 320gtg cgc gag tac ggc ctg tcc ctg gtc ggc ggc tgc tgc ggc acc acg1008Val Arg Glu Tyr Gly Leu Ser Leu Val Gly Gly Cys Cys Gly Thr Thr325 330 335ccc gag cac ctg cgc cag gtc gtc gag cgg gtc cgg gac acc gcc ccc1056Pro Glu His Leu Arg Gln Val Val Glu Arg Val Arg Asp Thr Ala Pro340 345 350acc gca cgc gac ccg cgc ccc gag ccc ggc gcc gcc tcg ctc tac cag1104Thr Ala Arg Asp Pro Arg Pro Glu Pro Gly Ala Ala Ser Leu Tyr Gln355 360 365acc gtg ccc ttc cgc cag gac acc tcc tac ctg gcc atc ggc gag cgc1152Thr Val Pro Phe Arg Gln Asp Thr Ser Tyr Leu Ala Ile Gly Glu Arg370 375 380acc aac gcc aac ggg tcc aag aag ttc cgc gag gcc atg ctg gac ggc1200Thr Asn Ala Asn Gly Ser Lys Lys Phe Arg Glu Ala Met Leu Asp Gly
385 390 395 400cgc tgg gac gac tgc gtc gag atg gcc cgc gac cag atc cgc gaa ggc1248Arg Trp Asp Asp Cys Val Glu Met Ala Arg Asp Gln Ile Arg Glu Gly405 410 415gcg cac atg ctc gac ctc tgc gtc gac tac gtc ggc cgg gac ggc gtc1296Ala His Met Leu Asp Leu Cys Val Asp Tyr Val Gly Arg Asp Gly Val420 425 430gcc gac atg gag gaa ctg gcc ggc cgg ttc gcc acc gcc tcc acg ctg1344Ala Asp Met Glu Glu Leu Ala Gly Arg Phe Ala Thr Ala Ser Thr Leu435 440 445ccg atc gtc ctc gac tcc acc gag gtc gac gtc atc cgg gcc ggc ctg1392Pro Ile Val Leu Asp Ser Thr Glu Val Asp Val Ile Arg Ala Gly Leu450 455 460gag aag ctc ggc ggc cgc gcg gtg atc aac tcg gtc aac tac gag gac1440Glu Lys Leu Gly Gly Arg Ala Val Ile Asn Ser Val Asn Tyr Glu Asp465 470 475 480ggc gcc ggc ccc gag tcc cgg ttc gcc cgc gtc acg aag ctc gcc cgg1488Gly Ala Gly Pro Glu Ser Arg Phe Ala Arg Val Thr Lys Leu Ala Arg485 490 495gag cac ggc gcc gcg ctg atc gcg ctg acc atc gac gag gtg gga cag1536Glu His Gly Ala Ala Leu Ile Ala Leu Thr Ile Asp Glu Val Gly Gln500 505 510gcc cgc acc gcc gag aag aag gtc gag atc gcc gaa cgg ctc atc gac1584Ala Arg Thr Ala Glu Lys Lys Val Glu Ile Ala Glu Arg Leu Ile Asp515 520 525gac ctc acc ggc aac tgg ggc atc cac gag tcc gac atc ctc gtc gac1632Asp Leu Thr Gly Asn Trp Gly Ile His Glu Ser Asp Ile Leu Val Asp530 535 540tgc ctg acc ttc acc atc tgc acc ggc cag gag gag tcc cgc aag gac1680Cys Leu Thr Phe Thr Ile Cys Thr Gly Gln Glu Glu Ser Arg Lys Asp545 550 555 560ggc ctg gcc acc atc gag ggc atc cgg gaa ctc aag cgg cgc cac ccg1728Gly Leu Ala Thr Ile Glu Gly Ile Arg Glu Leu Lys Arg Arg His Pro565 570 575gac gtg cag acc acg ctc ggc ctg tcg aac atc tcc ttc ggc ctc aac1776Asp Val Gln Thr Thr Leu Gly Leu Ser Asn Ile Ser Phe Gly Leu Asn580 585 590ccg gcc gcc cgc atc ctg ctc aac tcc gtc ttc ctc gac gaa tgc gtc1824Pro Ala Ala Arg Ile Leu Leu Asn Ser Val Phe Leu Asp Glu Cys Val595 600 605aag gcc ggc ctg gac tcg gcc atc gtg cac gcg agc aag atc ctg ccg1872Lys Ala Gly Leu Asp Ser Ala Ile Val His Ala Ser Lys Ile Leu Pro610 615 620atc gcc cgc ttc gac gag gag cag gtc acc acc gcc ctc gac ttg atc1920Ile Ala Arg Phe Asp Glu Glu Gln Val Thr Thr Ala Leu Asp Leu Ile625 630 635 640
tac gac cgc cgc cgc gag ggc tac gac ccc ctg caa aag ctc atg cag1968Tyr Asp Arg Arg Arg Glu Gly Tyr Asp Pro Leu Gln Lys Leu Met Gln645 650 655ctc ttc gag ggc gcc acc gcc aag tcg ctg aag gcc tcc aag gcc gag2016Leu Phe Glu Gly Ala Thr Ala Lys Ser Leu Lys Ala Ser Lys Ala Glu660 665 670gaa ctg gcc gcc ctc ccg ctg gag gag cgc ctc aag cgc cgc atc atc2064Glu Leu Ala Ala Leu Pro Leu Glu Glu Arg Leu Lys Arg Arg Ile Ile675 680 685gac ggc gag aag aac ggc ctc gaa cag gac ctc gac gag gcc ctc cgg2112Asp Gly Glu Lys Asn Gly Leu Glu Gln Asp Leu Asp Glu Ala Leu Arg690 695 700gag cgc ccg gcc ctc gag atc gtc aac gac acc ctg ctc gac ggt atg2160Glu Arg Pro Ala Leu Glu Ile Val Asn Asp Thr Leu Leu Asp Gly Met705 710 715 720aag gtc gtc ggc gag ctg ttc ggc tcc ggc cag atg cag ctg ccg ttc2208Lys Val Val Gly Glu Leu Phe Gly Ser Gly Gln Met Gln Leu Pro Phe725 730 735gtg ctc cag tcc gcc gag gtc atg aag acc gcg gtg gcc cac ctg gag2256Val Leu Gln Ser Ala Glu Val Met Lys Thr Ala Val Ala His Leu Glu740 745 750ccg cac atg gag aag acc gac gac gac ggc aag ggc acg atc gtg ctg2304Pro His Met Glu Lys Thr Asp Asp Asp Gly Lys Gly Thr Ile Val Leu755 760 765gcc acc gtc cgc ggc gac gtc cac gac atc ggc aag aac ctc gtc gac2352Ala Thr Val Arg Gly Asp Val His Asp Ile Gly Lys Asn Leu Val Asp770 775 780atc atc ctg tcc aac aac ggc tac aac gtc gtc aac ctc ggc atc aag2400Ile Ile Leu Ser Asn Asn Gly Tyr Asn Val Val Asn Leu Gly Ile Lys785 790 795 800cag ccc gtc tcc gcg atc ctg gaa gcg gcc gac gag cac cgg gcc gac2448Gln Pro Val Ser Ala Ile Leu Glu Ala Ala Asp Glu His Arg Ala Asp805 810 815gtc atc ggc atg tcc ggc ctc ctc gtc aag tcc acg gtg atc atg aag2496Val Ile Gly Met Ser Gly Leu Leu Val Lys Ser Thr Val Ile Met Lys820 825 830gag aac ctg gag gag ctg aac cag cgc aag ctg gcc gcc gac tac ccg2544Glu Asn Leu Glu Glu Leu Asn Gln Arg Lys Leu Ala Ala Asp Tyr Pro835 840 845gtc atc ctc ggc ggc gcc gcc ctc acc agg gcc tac gtc gaa cag gac2592Val Ile Leu Gly Gly Ala Ala Leu Thr Arg Ala Tyr Val Glu Gln Asp850 855 860ctg cac gag atc tac gac ggc gag gtc cgc tac gcc cgc gac gcc ttc2640Leu His Glu Ile Tyr Asp Gly Glu Val Arg Tyr Ala Arg Asp Ala Phe865 870 875 880gag ggc ctg cgc ctc atg gac gcc ctc atc ggc atc aag cgc ggc gtg2688Glu Gly Leu Arg Leu Met Asp Ala Leu Ile Gly Ile Lys Arg Gly Val
885 890 895ccc ggc gcc aag ctg ccg gag ctg aag cag cgc cgg gtg cgg gcc gcc2736Pro Gly Ala Lys Leu Pro Glu Leu Lys Gln Arg Arg Val Arg Ala Ala900 905 910acc gtc gag atc gac gag cgc ccc gag gaa ggc cac gtc cgc tcc gac2784Thr Val Glu Ile Asp Glu Arg Pro Glu Glu Gly His Val Arg Ser Asp915 920 925gtc gcc acc gac aac ccg gtc ccg acc ccg ccc ttc cgc ggc acc cgc2832Val Ala Thr Asp Asn Pro Val Pro Thr Pro Pro Phe Arg Gly Thr Arg930 935 940gtc gtc aag ggc atc cag ctc aag gag tac gcc tcc tgg ctc gac gag2880Val Val Lys Gly Ile Gln Leu Lys Glu Tyr Ala Ser Trp Leu Asp Glu945 950 955 960ggc gcc ctc ttc aag ggc cag tgg ggc ctc aag cag gcc cgc acc ggc2928Gly Ala Leu Phe Lys Gly Gln Trp Gly Leu Lys Gln Ala Arg Thr Gly965 970 975gag gga ccc tcc tac gag gaa ctg gtc gag tcc gag ggc cgg ccg cgg2976Glu Gly Pro Ser Tyr Glu Glu Leu Val Glu Ser Glu Gly Arg Pro Arg980 985 990ctg cgc ggc ctg ctc gac cgg ctc cag acg gac aac ctt ttg gag gcg3024Leu Arg Gly Leu Leu Asp Arg Leu Gln Thr Asp Asn Leu Leu Glu Ala99510001005gcc gtg gtc tac ggc tac ttc ccc tgc gtc tcc aag gac gac gac ctg3072Ala Val Val Tyr Gly Tyr Phe Pro Cys Val Ser Lys Asp Asp Asp Leu101010151020atc gtc ctc gac gac gac ggc aac gaa cgc acc cgc ttc acc ttc ccc3120Ile Val Leu Asp Asp Asp Gly Asn Glu Arg Thr Arg Phe Thr Phe Pro1025 103010351040cgc cag cgc cgc ggc cgg cgc ctg tgc ctg gcc gac ttc ttc cgc ccg3168Arg Gln Arg Arg Gly Arg Arg Leu Cys Leu Ala Asp Phe Phe Arg Pro104510501055gag gag tcc ggc gag acc gac gtg gtc ggc ttc cag gtc gtc acc gtc3216Glu Glu Ser Gly Glu Thr Asp Val Val Gly Phe Gln Val Val Thr Val106010651070ggc tcc cgc atc ggc gag gag acg gcc cgc atg ttc gag gcc aac gcc3264Gly Ser Arg Ile Gly Glu Glu Thr Ala Arg Met Phe Glu Ala Asn Ala107510801085tac cgc gac tat ctc gag ctg cac ggc ctg tcc gtg cag ctc gcc gag3312Tyr Arg Asp Tyr Leu Glu Leu His Gly Leu Ser Val Gln Leu Ala Glu109010951100gcc ctc gcc gag tac tgg cac gcg cgc gtg cgc tcg gaa ctc ggc ttc3360Ala Leu Ala Glu Tyr Trp His Ala Arg Val Arg Ser Glu Leu Gly Phe1105 111011151120gcc ggg gag gac ccg gcc gag atg gag gac atg ttc gcc ctg aag tac3408Ala Gly Glu Asp Pro Ala Glu Met Glu Asp Met Phe Ala Leu Lys Tyr112511301135
cgg ggt gcc cgc ttc tcc ctc ggc tac ggc gcc tgc ccc gac ctg gag3456Arg Gly Ala Arg Phe Ser Leu Gly Tyr Gly Ala Cys Pro Asp Leu Glu114011451150gac cgc gcc aag atc gcc gcc ctg ctg gag ccc gag cgc atc ggc gtc3504Asp Arg Ala Lys Ile Ala Ala Leu Leu Glu Pro Glu Arg Ile Gly Val115511601165cac cta tcc gag gag ttc cag ctc cac ccc gag cag tcc acc gac gcc3552His Leu Ser Glu Glu Phe Gln Leu His Pro Glu Gln Ser Thr Asp Ala117011751180atc gtc atc cac cac ccg gag gcc aag tac ttc aac gcc cgc3594Ile Val Ile His His Pro Glu Ala Lys Tyr Phe Asn Ala Arg1185 11901195tga359721022111198212PRT 213天藍色鏈黴菌4002Val Arg Ser Pro Arg Asp Val Pro Arg Arg Ala Ala Pro Gly Arg Gly1 5 10 15Lys Ala Asp Ser Arg Arg Ile Leu Gly Ser Pro Phe Met Ala Ser Ser20 25 30Pro Ser Thr Pro Pro Ala Asp Thr Arg Thr Arg Val Ser Ala Leu Arg35 40 45Glu Ala Leu Ala Thr Arg Val Val Val Ala Asp Gly Ala Met Gly Thr50 55 60Met Leu Gln Ala Gln Asn Pro Thr Leu Asp Asp Phe Gln Gln Leu Glu65 70 75 80Gly Cys Asn Glu Val Leu Asn Leu Thr Arg Pro Asp Ile Val Arg Ser85 90 95Val His Glu Glu Tyr Phe Ala Ala Gly Val Asp Cys Val Glu Thr Asn100 105 110Thr Phe Gly Ala Asn His Ser Ala Leu Gly Glu Tyr Asp Ile Pro Glu115 120 125Arg Val His Glu Leu Ser Glu Ala Gly Ala Arg Val Ala Arg Glu Val130 135 140Ala Asp Glu Phe Gly Ala Arg Asp Gly Arg Gln Arg Trp Val Leu Gly145 150 155 160Ser Met Gly Pro Gly Thr Lys Leu Pro Thr Leu Gly His Ala Pro Tyr165 170 175Thr Val Leu Arg Asp Ala Tyr Gln Arg Asn Ala Glu Gly Leu Val Ala180 185 190Gly Gly Ala Asp Ala Leu Leu Val Glu Thr Thr Gln Asp Leu Leu Gln
195 200 205Thr Lys Ala Ser Val Leu Gly Ala Arg Arg Ala Leu Asp Val Leu Gly210 215 220Leu Asp Leu Pro Leu Ile Val Ser Val Thr Val Glu Thr Thr Gly Thr225 230 235 240Met Leu Leu Gly Ser Glu Ile Gly Ala Ala Leu Thr Ala Leu Glu Pro245 250 255Leu Gly Ile Asp Met Ile Gly Leu Asn Cys Ala Thr Gly Pro Ala Glu260 265 270Met Ser Glu His Leu Arg Tyr Leu Ala Arg His Ser Arg Ile Pro Leu275 280 285Thr Cys Met Pro Asn Ala Gly Leu Pro Val Leu Gly Lys Asp Gly Ala290 295 300His Tyr Pro Leu Thr Ala Pro Glu Leu Ala Asp Ala His Glu Thr Phe305 310 315 320Val Arg Glu Tyr Gly Leu Ser Leu Val Gly Gly Cys Cys Gly Thr Thr325 330 335Pro Glu His Leu Arg Gln Val Val Glu Arg Val Arg Asp Thr Ala Pro340 345 350Thr Ala Arg Asp Pro Arg Pro Glu Pro Gly Ala Ala Ser Leu Tyr Gln355 360 365Thr Val Pro Phe Arg Gln Asp Thr Ser Tyr Leu Ala Ile Gly Glu Arg370 375 380Thr Asn Ala Asn Gly Ser Lys Lys Phe Arg Glu Ala Met Leu Asp Gly385 390 395 400Arg Trp Asp Asp Cys Val Glu Met Ala Arg Asp Gln Ile Arg Glu Gly405 410 415Ala His Met Leu Asp Leu Cys Val Asp Tyr Val Gly Arg Asp Gly Val420 425 430Ala Asp Met Glu Glu Leu Ala Gly Arg Phe Ala Thr Ala Ser Thr Leu435 440 445Pro Ile Val Leu Asp Ser Thr Glu Val Asp Val Ile Arg Ala Gly Leu450 455 460Glu Lys Leu Gly Gly Arg Ala Val Ile Asn Ser Val Asn Tyr Glu Asp465 470 475 480Gly Ala Gly Pro Glu Ser Arg Phe Ala Arg Val Thr Lys Leu Ala Arg485 490 495Glu His Gly Ala Ala Leu Ile Ala Leu Thr Ile Asp Glu Val Gly Gln500 505 510Ala Arg Thr Ala Glu Lys Lys Val Glu Ile Ala Glu Arg Leu Ile Asp515 520 525
Asp Leu Thr Gly Asn Trp Gly Ile His Glu Ser Asp Ile Leu Val Asp530 535 540Cys Leu Thr Phe Thr Ile Cys Thr Gly Gln Glu Glu Ser Arg Lys Asp545 550 555 560Gly Leu Ala Thr Ile Glu Gly Ile Arg Glu Leu Lys Arg Arg His Pro565 570 575Asp Val Gln Thr Thr Leu Gly Leu Ser Asn Ile Ser Phe Gly Leu Asn580 585 590Pro Ala Ala Arg Ile Leu Leu Asn Ser Val Phe Leu Asp Glu Cys Val595 600 605Lys Ala Gly Leu Asp Ser Ala Ile Val His Ala Ser Lys Ile Leu Pro610 615 620Ile Ala Arg Phe Asp Glu Glu Gln Val Thr Thr Ala Leu Asp Leu Ile625 630 635 640Tyr Asp Arg Arg Arg Glu Gly Tyr Asp Pro Leu Gln Lys Leu Met Gln645 650 655Leu Phe Glu Gly Ala Thr Ala Lys Ser Leu Lys Ala Ser Lys Ala Glu660 665 670Glu Leu Ala Ala Leu Pro Leu Glu Glu Arg Leu Lys Arg Arg Ile Ile675 680 685Asp Gly Glu Lys Asn Gly Leu Glu Gln Asp Leu Asp Glu Ala Leu Arg690 695 700Glu Arg Pro Ala Leu Glu Ile Val Asn Asp Thr Leu Leu Asp Gly Met705 710 715 720Lys Val Val Gly Glu Leu Phe Gly Ser Gly Gln Met Gln Leu Pro Phe725 730 735Val Leu Gln Ser Ala Glu Val Met Lys Thr Ala Val Ala His Leu Glu740 745 750Pro His Met Glu Lys Thr Asp Asp Asp Gly Lys Gly Thr Ile Val Leu755 760 765Ala Thr Val Arg Gly Asp Val His Asp Ile Gly Lys Asn Leu Val Asp770 775 780Ile Ile Leu Ser Asn Asn Gly Tyr Asn Val Val Asn Leu Gly Ile Lys785 790 795 800Gln Pro Val Ser Ala Ile Leu Glu Ala Ala Asp Glu His Arg Ala Asp805 810 815Val Ile Gly Met Ser Gly Leu Leu Val Lys Ser Thr Val Ile Met Lys820 825 830Glu Asn Leu Glu Glu Leu Asn Gln Arg Lys Leu Ala Ala Asp Tyr Pro835 840 845Val Ile Leu Gly Gly Ala Ala Leu Thr Arg Ala Tyr Val Glu Gln Asp850 855 860
Leu His Glu Ile Tyr Asp Gly Glu Val Arg Tyr Ala Arg Asp Ala Phe865 870 875 880Glu Gly Leu Arg Leu Met Asp Ala Leu Ile Gly Ile Lys Arg Gly Val885 890 895Pro Gly Ala Lys Leu Pro Glu Leu Lys Gln Arg Arg Val Arg Ala Ala900 905 910Thr Val Glu Ile Asp Glu Arg Pro Glu Glu Gly His Val Arg Ser Asp915 920 925Val Ala Thr Asp Asn Pro Val Pro Thr Pro Pro Phe Arg Gly Thr Arg930 935 940Val Val Lys Gly Ile Gln Leu Lys Glu Tyr Ala Ser Trp Leu Asp Glu945 950 955 960Gly Ala Leu Phe Lys Gly Gln Trp Gly Leu Lys Gln Ala Arg Thr Gly965 970 975Glu Gly Pro Ser Tyr Glu Glu Leu Val Glu Ser Glu Gly Arg Pro Arg980 985 990Leu Arg Gly Leu Leu Asp Arg Leu Gln Thr Asp Asn Leu Leu Glu Ala99510001005Ala Val Val Tyr Gly Tyr Phe Pro Cys Val Ser Lys Asp Asp Asp Leu101010151020Ile Val Leu Asp Asp Asp Gly Asn Glu Arg Thr Arg Phe Thr Phe Pro1025 103010351040Arg Gln Arg Arg Gly Arg Arg Leu Cys Leu Ala Asp Phe Phe Arg Pro104510501055Glu Glu Ser Gly Glu Thr Asp Val Val Gly Phe Gln Val Val Thr Val106010651070Gly Ser Arg Ile Gly Glu Glu Thr Ala Arg Met Phe Glu Ala Asn Ala107510801085Tyr Arg Asp Tyr Leu Glu Leu His Gly Leu Ser Val Gln Leu Ala Glu109010951100Ala Leu Ala Glu Tyr Trp His Ala Arg Val Arg Ser Glu Leu Gly Phe1105 111011151120Ala Gly Glu Asp Pro Ala Glu Met Glu Asp Met Phe Ala Leu Lys Tyr112511301135Arg Gly Ala Arg Phe Ser Leu Gly Tyr Gly Ala Cys Pro Asp Leu Glu114011451150Asp Arg Ala Lys Ile Ala Ala Leu Leu Glu Pro Glu Arg Ile Gly Val115511601165His Leu Ser Glu Glu Phe Gln Leu His Pro Glu Gln Ser Thr Asp Ala117011751180Ile Val Ile His His Pro Glu Ala Lys Tyr Phe Asn Ala Arg
11851190119521032113537212DNA213魚腥藻屬種(Anabaena sp.)220
221CDS222(1)..(3534)223RAN037904003atg act cat cct ttc ctg aaa cgc ctg cac agt ccg gaa ctt ccg gtt48Met Thr His Pro Phe Leu Lys Arg Leu His Ser Pro Glu Leu Pro Val1 5 10 15atc gtc ttc gac ggt gca atg gga act aac cta caa acc caa aac ctc96Ile Val Phe Asp Gly Ala Met Gly Thr Asn Leu Gln Thr Gln Asn Leu20 25 30acg gct gag gat ttc ggc ggt gtg cag tat gaa ggt tgt aac gaa tac144Thr Ala Glu Asp Phe Gly Gly Val Gln Tyr Glu Gly Cys Asn Glu Tyr35 40 45cta gtc cac acc aaa ccc gaa gct gtc gcc aag gt tcac cgc gac ttt192Leu Val His Thr Lys Pro Glu Ala Val Ala Lys Val His Arg Asp Phe50 55 60ctc gct gtg ggt gca gat gtc atc gaa acc gac act ttc ggt gcg aca240Leu Ala Val Gly Ala Asp Val Ile Glu Thr Asp Thr Phe Gly Ala Thr65 70 75 80tcc att gtt ttg gcg gaa tat gac tta gca gac caa aca tat tac ctg288Ser Ile Val Leu Ala Glu Tyr Asp Leu Ala Asp Gln Thr Tyr Tyr Leu85 90 95aac aag aaa gcc gcc gaa ctg gcg aaa agt gtc gct gct gaa ttt tcc336Asn Lys Lys Ala Ala Glu Leu Ala Lys Ser Val Ala Ala Glu Phe Ser100 105 110aca cca gat aaa ccc cgg ttt gtt gct ggt tcc atc ggc ccc aca acc384Thr Pro Asp Lys Pro Arg Phe Val Ala Gly Ser Ile Gly Pro Thr Thr115 120 125aaa ctt ccc acc ttg gga cat atc gac ttt gac act ctc aaa act tgc432Lys Leu Pro Thr Leu Gly His Ile Asp Phe Asp Thr Leu Lys Thr Cys130 135 140ttt gct gaa caa gca gaa gcg ctg tta gat ggt ggc gtg gat tta ctt480Phe Ala Glu Gln Ala Glu Ala Leu Leu Asp Gly Gly Val Asp Leu Leu145 150 155 160ttg gtg gag act tgt caa gat gtg ctg caa atc aaa gcg gcg ctg aat528Leu Val Glu Thr Cys Gln Asp Val Leu Gln Ile Lys Ala Ala Leu Asn165 170 175ggg ata gaa gaa gtc ttt ggc aag aga ggg gaa cgc ata ccc ttg atg576Gly Ile Glu Glu Val Phe Gly Lys Arg Gly Glu Arg Ile Pro Leu Met180 185 190
gtg tcc gtg aca atg gaa agc atg ggg aca atg ttg gtc ggt tcc gaa624Val Ser Val Thr Met Glu Ser Met Gly Thr Met Leu Val Gly Ser Glu195 200 205atc aac gcc gtc ctg aca att tta gaa cct ttc cca att gac att ctc672Ile Asn Ala Val Leu Thr Ile Leu Glu Pro Phe Pro Ile Asp Ile Leu210 215 220ggt ctg aac tgt gcc aca ggc cca gac ttg atg aaa cca cat att aaa720Gly Leu Asn Cys Ala Thr Gly Pro Asp Leu Met Lys Pro His Ile Lys225 230 235 240tat ttg gct gaa cat tcg ccg ttt gtg gtt tct tgt att cct aac gcg768Tyr Leu Ala Glu His Ser Pro Phe Val Val Ser Cys Ile Pro Asn Ala245 250 255ggt tta cca gaa aac gtt ggt ggt caa gca cat tat cgc tta aca cca816Gly Leu Pro Glu Asn Val Gly Gly Gln Ala His Tyr Arg Leu Thr Pro260 265 270atg gaa tta cgc atg gcg ttg atg cac ttt gtt gaa gat ttg ggt gtc864Met Glu Leu Arg Met Ala Leu Met His Phe Val Glu Asp Leu Gly Val275 280 285caa gtg atc ggg ggt tgc tgt ggg aca cgt cca gaa cac att caa caa912Gln Val Ile Gly Gly Cys Cys Gly Thr Arg Pro Glu His Ile Gln Gln290 295 300tta gca gaa att gcc aag gat tta aag cca aag gtg aga cag cca agt960Leu Ala Glu Ile Ala Lys Asp Leu Lys Pro Lys Val Arg Gln Pro Ser305 310 315 320tta gaa cct gcg gct gca tca ata tat agt act caa ccc tac gaa caa1008Leu Glu Pro Ala Ala Ala Ser Ile Tyr Ser Thr Gln Pro Tyr Glu Gln325 330 335gat aat tct ttc ttg att gtg ggt gaa cgc ctc aac gcc agt ggt tcc1056Asp Asn Ser Phe Leu Ile Val Gly Glu Arg Leu Asn Ala Ser Gly Ser340 345 350aag aaa tgc cgt gat ttg ctg aat gcg gaa gat tgg gac gga ttg gta1104Lys Lys Cys Arg Asp Leu Leu Asn Ala Glu Asp Trp Asp Gly Leu Val355 360 365tca atg gcg cga tcg caa gtc aag gaa ggc gca cat atc ctt gat gtc1152Ser Met Ala Arg Ser Gln Val Lys Glu Gly Ala His Ile Lau Asp Val370 375 380aac gtt gat tat gtg gga cgg gac ggt gtg cgg gat atg cac gaa cta1200Asn Val Asp Tyr Val Gly Arg Asp Gly Val Arg Asp Met His Glu Leu385 390 395 400gtt tcc cgc att gtg aat aat gtt aca ctc ccc tta atg ctc gac tcc1248Val Ser Arg Ile Val Asn Asn Val Thr Leu Pro Leu Met Leu Asp Ser405 410 415acc gaa tgg gaa aag atg gag gcg ggt tta aag gtg gct ggt ggt aag1296Thr Glu Trp Glu Lys Met Glu Ala Gly Leu Lys Val Ala Gly Gly Lys420 425 430tgt ttg ctg aac tcc acc aac tac gaa gat ggg gaa cca cgt ttc tta1344Cys Leu Leu Asn Ser Thr Asn Tyr Glu Asp Gly Glu Pro Arg Phe Leu
435 440 445aaa gtg ttg gag ttg gcg aag aaa tat ggc gcg ggt gtt gtt att ggc1392Lys Val Leu Glu Leu Ala Lys Lys Tyr Gly Ala Gly Val Val Ile Gly450 455 460aca att gac gaa gaa ggg atg gcg cgg aca gcc gag aaa aag ttt caa1440Thr Ile Asp Glu Glu Gly Met Ala Arg Thr Ala Glu Lys Lys Phe Gln465 470 475 480att gcc cag cgt gcc tat cgt caa tcg gta gaa tat ggg att ccc ccc1488Ile Ala Gln Arg Ala Tyr Arg Gln Ser Val Glu Tyr Gly Ile Pro Pro485 490 495aca gaa ata ttc ttt gat acc tta gct tta cea att tct acc ggg att1536Thr Glu Ile Phe Phe Asp Thr Leu Ala Leu Pro Ile Ser Thr Gly Ile500 505 510gaa gaa gac cgg gaa aat ggc aag gcg aca att gaa tca att agc cgt1584Glu Glu Asp Arg Glu Asn Gly Lys Ala Thr Ile Glu Ser Ile Ser Arg515 520 525atc cgt aaa gaa ttg cca ggg tgt cat gtt att tta ggc gtg tca aat1632Ile Arg Lys Glu Leu Pro Gly Cys His Val Ile Leu Gly Val Ser Asn530 535 540ata tcc ttt ggc tta aat tca gcc tcg cgg atg gtc tta aac tcc gtg1680Ile Ser Phe Gly Leu Asn Ser Ala Ser Arg Met Val Leu Asn Ser Val545 550 555 560ttt ctc cat gaa gca atg act gct ggc atg gat gcg gcg atc gtc agt1728Phe Leu His Glu Ala Met Thr Ala Gly Met Asp Ala Ala Ile Val Ser565 570 575gct agc aag att cta cca ctg tcg aag att gaa gag cgt cat caa gaa1776Ala Ser Lys Ile Leu Pro Leu Ser Lys Ile Glu Glu Arg His Gln Glu580 585 590gtc tgc cgc cag tta att tat gac cag cgt aaa ttt gag ggt gat atc1824Val Cys Arg Gln Leu Ile Tyr Asp Gln Arg Lys Phe Glu Gly Asp Ile595 600 605tgc atc tat gac ccc tta aca gaa cta act aaa ttg ttt gag gga gtc1872Cys Ile Tyr Asp Pro Leu Thr Glu Leu Thr Lys Leu Phe Glu Gly Val610 615 620acc acc aaa cgt aac aaa ggc gtt gat gaa agc tta ccc atc gaa gaa1920Thr Thr Lys Arg Asn Lys Gly Val Asp Glu Ser Leu Pro Ile Glu Glu625 630 635 640cga ctc aag cgt cac att atc gac ggc gaa cgc att ggt tta gaa gcg1968Arg Leu Lys Arg His Ile Ile Asp Gly Glu Arg Ile Gly Leu Glu Ala645 650 655caa ctg aca aaa gcc tta gaa caa tat cca ccc cta gaa att atc aac2016Gln Leu Thr Lys Ala Leu Glu Gln Tyr Pro Pro Leu Glu Ile Ile Asn660 665 670act ttc cta cta gat ggg atg aaa gta gtc ggg gaa ttg ttc ggt tca2064Thr Phe Leu Leu Asp Gly Met Lys Val Val Gly Glu Leu Phe Gly Ser675 680 685
gga caa atg cag cta cct ttc gtt tta cag tca gcc gaa acc atg aaa2112Gly Gln Met Gln Leu Pro Phe Val Leu Gln Ser Ala Glu Thr Met Lys690 695 700gcg gcg gta gcc tac cta gaa ccg ttc atg gaa aaa tcg gaa agt ggc2160Ala Ala Val Ala Tyr Leu Glu Pro Phe Met Glu Lys Ser Glu Ser Gly705 710 715 720aac aat gcc aaa ggt aaa gta att att gcc acc gtg aaa ggc gat gtt2208Asn Asn Ala Lys Gly Lys Val Ile Ile Ala Thr Val Lys Gly Asp Val725 730 735cac gac att ggt aaa aac cta gta gac att atc ttg tcc aac aac ggc2256His Asp Ile Gly Lys Asn Leu Val Asp Ile Ile Leu Ser Asn Asn Gly740 745 750tac aag gta att aac ctg gga att aaa cag ccg gtg gaa aat atc atc2304Tyr Lys Val Ile Asn Leu Gly Ile Lys Gln Pro Val Glu Asn Ile Ile755 760 765gag gct tac aac caa cac aaa gct gat tgt att gcc atg agt ggc ttg2352Glu Ala Tyr Asn Gln His Lys Ala Asp Cys Ile Ala Met Ser Gly Leu770 775 780ctg gta aaa tcc acc gca ttc atg aaa gaa aat ttg gag gtc ttc aac2400Leu Val Lys Ser Thr Ala Phe Met Lys Glu Asn Leu Glu Val Phe Asn785 790 795 800gaa aaa ggc att aat gtt cct gta att tta ggt ggt gcg gca tta acc2448Glu Lys Gly Ile Asn Val Pro Val Ile Leu Gly Gly Ala Ala Leu Thr805 810 815ccg aaa ttc gtg cat aaa gat tgc caa aat acc tac aaa ggt aaa gtc2496Pro Lys Phe Val His Lys Asp Cys Gln Asn Thr Tyr Lys Gly Lys Val820 825 830att tat ggc aaa gat gct ttc tca gac ctg cat ttc atg gat aaa tta2544Ile Tyr Gly Lys Asp Ala Phe Ser Asp Leu His Phe Met Asp Lys Leu835 840 845atg cca gcc aaa gcc act ggc aaa tgg gac aat tcc tta gga ttc ttg2592Met Pro Ala Lys Ala Thr Gly Lys Trp Asp Asn Ser Leu Gly Phe Leu850 855 860gat gaa gta gaa acc gag gaa aca gaa cct acc aat cac aaa tcc cca2640Asp Glu Val Glu Thr Glu Glu Thr Glu Pro Thr Asn His Lys Ser Pro865 870 875 880atc ccc agt ccc caa tcc cca gtc ccc agt ccc cag tcc cca gtc cct2688Ile Pro Ser Pro Gln Ser Pro Val Pro Ser Pro Gln Ser Pro Val Pro885 890 895ata gac acc cga cgt tcc gaa gct gta gcc ata gac att ccc cgt ccc2736Ile Asp Thr Arg Arg Ser Glu Ala Val Ala Ile Asp Ile Pro Arg Pro900 905 910aca cca cca ttc tgg gga acg caa tta tta cag cct agc gat att tcc2784Thr Pro Pro Phe Trp Gly Thr Gln Leu Leu Gln Pro Ser Asp Ile Ser915 920 925tta gag gaa ata ttc tgg cac atg gat ttg caa gcc ttg att gcg gga2832Leu Glu Glu Ile Phe Trp His Met Asp Leu Gln Ala Leu Ile Ala Gly930 935 940
caa tgg caa ttc cgc aaa ccc aaa gaa caa tca aag gaa gaa tat caa2880Gln Trp Gln Phe Arg Lys Pro Lys Glu Gln Ser Lys Glu Glu Tyr Gln945 950 955 960gct ttc ttg aat gag aaa gtg tat cca gtt cta gaa act tgg aaa cag2928Ala Phe Leu Asn Glu Lys Val Tyr Pro Val Leu Glu Thr Trp Lys Gln965 970 975cgc atc att gca gaa aac ttg tta cat ccc cag gta att tat ggg tat2976Arg Ile Ile Ala Glu Asn Leu Leu His Pro Gln Val Ile Tyr Gly Tyr980 985 990ttt cct tgt caa tct gag ggt aat act tta tat gtt tac gaa aca aac3024Phe Pro Cys Gln Ser Glu Gly Asn Thr Leu Tyr Val Tyr Glu Thr Asn99510001005agc cca aat gcc aca gaa atc act cag ttt gaa ttc ccc cga caa aag3072Ser Pro Asn Ala Thr Glu Ile Thr Gln Phe Glu Phe Pro Arg Gln Lys101010151020tca tca aaa cga tta tgt att gcc gat ttc ttt gca ccg aaa gat tca3120Ser Ser Lys Arg Leu Cys Ile Ala Asp Phe Phe Ala Pro Lys Asp Ser1025 103010351040gga atc att gat gtc ttc ccc atg cag gcg gtg act gta ggc gaa att3168Gly Ile Ile Asp Val Phe Pro Met Gln Ala Val Thr Val Gly Glu Ile104510501055gct aca gag ttc gcg caa aaa ttg ttt gca aac aat caa tac act gat3216Ala Thr Glu Phe Ala Gln Lys Leu Phe Ala Asn Asn Gln Tyr Thr Asp106010651070tat ctg tat ttt cac ggt ttg gcg gtg caa gta gca gaa gcc ttg gcc3264Tyr Leu Tyr Phe His Gly Leu Ala Val Gln Val Ala Glu Ala Leu Ala107510801085gag tgg aca cac gcc aga atc cgc cgt gag tta ggg ttc ggt gct gaa3312Glu Trp Thr His Ala Arg Ile Arg Arg Glu Leu Gly Phe Gly Ala Glu109010951100gaa ccg gat aat atc cgg gat att ttg gca caa cgc tat cag ggt tcc3360Glu Pro Asp Asn Ile Arg Asp Ile Leu Ala Gln Arg Tyr Gln Gly Ser1105 111011151120cgg tat agt ttt ggc tac cca gct tgt ccc aat att caa gac cag ttt3408Arg Tyr Ser Phe Gly Tyr Pro Ala Cys Pro Asn Ile Gln Asp Gln Phe112511301135aag cag ctg gat ttg ttg gag act agc aga att aac tta tac atg gat3456Lys Gln Leu Asp Leu Leu Glu Thr Ser Arg Ile Asn Leu Tyr Met Asp114011451150gaa agt gag caa ctt tat cca gaa cag tct acg acg gcg att att act3504Glu Ser Glu Gln Leu Tyr Pro Glu Gln Ser Thr Thr Ala Ile Ile Thr115511601165tat cac cca gta gct aag tac ttc acc gcg taa3537Tyr His Pro Val Ala Lys Tyr Phe Thr Ala11701175
21042111178212PRT213魚腥藻屬種4004Met Thr His Pro Phe Leu Lys Arg Leu His Ser Pro Glu Leu Pro Val1 5 10 15Ile Val Phe Asp Gly Ala Met Gly Thr Asn Leu Gln Thr Gln Asn Leu20 25 30Thr Ala Glu Asp Phe Gly Gly Val Gln Tyr Glu Gly Cys Asn Glu Tyr35 40 45Leu Val His Thr Lys Pro Glu Ala Val Ala Lys Val His Arg Asp Phe50 55 60Leu Ala Val Gly Ala Asp Val Ile Glu Thr Asp Thr Phe Gly Ala Thr65 70 75 80Ser Ile Val Leu Ala Glu Tyr Asp Leu Ala Asp Gln Thr Tyr Tyr Leu85 90 95Asn Lys Lys Ala Ala Glu Leu Ala Lys Ser Val Ala Ala Glu Phe Ser100 105 110Thr Pro Asp Lys Pro Arg Phe Val Ala Gly Ser Ile Gly Pro Thr Thr115 120 125Lys Leu Pro Thr Leu Gly His Ile Asp Phe Asp Thr Leu Lys Thr Cys130 135 140Phe Ala Glu Gln Ala Glu Ala Leu Leu Asp Gly Gly Val Asp Leu Leu145 150 155 160Leu Val Glu Thr Cys Gln Asp Val Leu Gln Ile Lys Ala Ala Leu Asn165 170 175Gly Ile Glu Glu Val Phe Gly Lys Arg Gly Glu Arg Ile Pro Leu Met180 185 190Val Ser Val Thr Met Glu Ser Met Gly Thr Met Leu Val Gly Ser Glu195 200 205Ile Asn Ala Val Leu Thr Ile Leu Glu Pro Phe Pro Ile Asp Ile Leu210 215 220Gly Leu Asn Cys Ala Thr Gly Pro Asp Leu Met Lys Pro His Ile Lys225 230 235 240Tyr Leu Ala Glu His Ser Pro Phe Val Val Ser Cys Ile Pro Asn Ala245 250 255Gly Leu Pro Glu Asn Val Gly Gly Gln Ala His Tyr Arg Leu Thr Pro260 265 270Met Glu Leu Arg Met Ala Leu Met His Phe Val Glu Asp Leu Gly Val275 280 285Gln Val Ile Gly Gly Cys Cys Gly Thr Arg Pro Glu His Ile Gln Gln290 295 300
Leu Ala Glu Ile Ala Lys Asp Leu Lys Pro Lys Val Arg Gln Pro Ser305 310 315 320Leu Glu Pro Ala Ala Ala Ser Ile Tyr Ser Thr Gln Pro Tyr Glu Gln325 330 335Asp Asn Ser Phe Leu Ile Val Gly Glu Arg Leu Asn Ala Ser Gly Ser340 345 350Lys Lys Cys Arg Asp Leu Leu Asn Ala Glu Asp Trp Asp Gly Leu Val355 360 365Ser Met Ala Arg Ser Gln Val Lys Glu Gly Ala His Ile Leu Asp Val370 375 380Asn Val Asp Tyr Val Gly Arg Asp Gly Val Arg Asp Met His Glu Leu385 390 395 400Val Ser Arg Ile Val Asn Asn Val Thr Leu Pro Leu Met Leu Asp Ser405 410 415Thr Glu Trp Glu Lys Met Glu Ala Gly Leu Lys Val Ala Gly Gly Lys420 425 430Cys Leu Leu Asn Ser Thr Asn Tyr Glu Asp Gly Glu Pro Arg Phe Leu435 440 445Lys Val Leu Glu Leu Ala Lys Lys Tyr Gly Ala Gly Val Val Ile Gly450 455 460Thr Ile Asp Glu Glu Gly Met Ala Arg Thr Ala Glu Lys Lys Phe Gln465 470 475 480Ile Ala Gln Arg Ala Tyr Arg Gln Ser Val Glu Tyr Gly Ile Pro Pro485 490 495Thr Glu Ile Phe Phe Asp Thr Leu Ala Leu Pro Ile Ser Thr Gly Ile500 505 510Glu Glu Asp Arg Glu Asn Gly Lys Ala Thr Ile Glu Ser Ile Ser Arg515 520 525Ile Arg Lys Glu Leu Pro Gly Cys His Val Ile Leu Gly Val Ser Asn530 535 540Ile Ser Phe Gly Leu Asn Ser Ala Ser Arg Met Val Leu Asn Ser Val545 550 555 560Phe Leu His Glu Ala Met Thr Ala Gly Met Asp Ala Ala Ile Val Ser565 570 575Ala Ser Lys Ile Leu Pro Leu Ser Lys Ile Glu Glu Arg His Gln Glu580 585 590Val Cys Arg Gln Leu Ile Tyr Asp Gln Arg Lys Phe Glu Gly Asp Ile595 600 605Cys Ile Tyr Asp Pro Leu Thr Glu Leu Thr Lys Leu Phe Glu Gly Val610 615 620Thr Thr Lys Arg Asn Lys Gly Val Asp Glu Ser Leu Pro Ile Glu Glu625 630 635 640
Arg Leu Lys Arg His Ile Ile Asp Gly Glu Arg Ile Gly Leu Glu Ala645 650 655Gln Leu Thr Lys Ala Leu Glu Gln Tyr Pro Pro Leu Glu Ile Ile Asn660 665 670Thr Phe Leu Leu Asp Gly Met Lys Val Val Gly Glu Leu Phe Gly Ser675 680 685Gly Gln Met Gln Leu Pro Phe Val Leu Gln Ser Ala Glu Thr Met Lys690 695 700Ala Ala Val Ala Tyr Leu Glu Pro Phe Met Glu Lys Ser Glu Ser Gly705 710 715 720Asn Asn Ala Lys Gly Lys Val Ile Ile Ala Thr Val Lys Gly Asp Val725 730 735His Asp Ile Gly Lys Asn Leu Val Asp Ile Ile Leu Ser Asn Asn Gly740 745 750Tyr Lys Val Ile Asn Leu Gly Ile Lys Gln Pro Val Glu Asn Ile Ile755 760 765Glu Ala Tyr Asn Gln His Lys Ala Asp Cys Ile Ala Met Ser Gly Leu770 775 780Leu Val Lys Ser Thr Ala Phe Met Lys Glu Asn Leu Glu Val Phe Asn785 790 795 800Glu Lys Gly Ile Asn Val Pro Val Ile Leu Gly Gly Ala Ala Leu Thr805 810 815Pro Lys Phe Val His Lys Asp Cys Gln Asn Thr Tyr Lys Gly Lys Val820 825 830Ile Tyr Gly Lys Asp Ala Phe Ser Asp Leu His Phe Met Asp Lys Leu835 840 845Met Pro Ala Lys Ala Thr Gly Lys Trp Asp Asn Ser Leu Gly Phe Leu850 855 860Asp Glu Val Glu Thr Glu Glu Thr Glu Pro Thr Asn His Lys Ser Pro865 870 875 880Ile Pro Ser Pro Gln Ser Pro Val Pro Ser Pro Gln Ser Pro Val Pro885 890 895Ile Asp Thr Arg Arg Ser Glu Ala Val Ala Ile Asp Ile Pro Arg Pro900 905 910Thr Pro Pro Phe Trp Gly Thr Gln Leu Leu Gln Pro Ser Asp Ile Ser915 920 925Leu Glu Glu Ile Phe Trp His Met Asp Leu Gln Ala Leu Ile Ala Gly930 935 940Gln Trp Gln Phe Arg Lys Pro Lys Glu Gln Ser Lys Glu Glu Tyr Gln945 950 955 960Ala Phe Leu Asn Glu Lys Val Tyr Pro Val Leu Glu Thr Trp Lys Gln
965 970 975Arg Ile Ile Ala Glu Asn Leu Leu His Pro Gln Val Ile Tyr Gly Tyr980 985 990Phe Pro Cys Gln Ser Glu Gly Asn Thr Leu Tyr Val Tyr Glu Thr Asn99510001005Ser Pro Asn Ala Thr Glu Ile Thr Gln Phe Glu Phe Pro Arg Gln Lys101010151020Ser Ser Lys Arg Leu Cys Ile Ala Asp Phe Phe Ala Pro Lys Asp Ser1025 103010351040Gly Ile Ile Asp Val Phe Pro Met Gln Ala Val Thr Val Gly Glu Ile104510501055Ala Thr Glu Phe Ala Gln Lys Leu Phe Ala Asn Asn Gln Tyr Thr Asp106010651070Tyr Leu Tyr Phe His Gly Leu Ala Val Gln Val Ala Glu Ala Leu Ala107510801085Glu Trp Thr His Ala Arg Ile Arg Arg Glu Leu Gly Phe Gly Ala Glu109010951100Glu Pro Asp Asn Ile Arg Asp Ile Leu Ala Gln Arg Tyr Gln Gly Ser1105 111011151120Arg Tyr Ser Phe Gly Tyr Pro Ala Cys Pro Asn Ile Gln Asp Gln Phe112511301135Lys Gln Leu Asp Leu Leu Glu Thr Ser Arg Ile Asn Leu Tyr Met Asp114011451150Glu Ser Glu Gln Leu Tyr Pro Glu Gln Ser Thr Thr Ala Ile Ile Thr115511601165Tyr His Pro Val Ala Lys Tyr Phe Thr Ala1170117521052113588212DNA213集胞藻屬種(Synechocystis sp.)220
221CDS222(1)..(3585)223RCY359654005atg aaa agt gct ttt tta gac cgt atc cac agt ccc gat cgc ccg gta48Met Lys Ser Ala Phe Leu Asp Arg Ile His Ser Pro Asp Arg Pro Val1 5 10 15tta gtc ttt gac ggg gct atg ggt aca aac ctg cag gta cag aac cta96Leu Val Phe Asp Gly Ala Met Gly Thr Asn Leu Gln Val Gln Asn Leu20 25 30acg gcg gcg gat ttt ggt ggg gcg gaa tac gaa ggt tgc aat gaa tat144
Thr Ala Ala Asp Phe Gly Gly Ala Glu Tyr Glu Gly Cys Asn Glu Tyr35 40 45tta gtc cat acc aag cca gag gcc gtg gct acg gtg cat cgt gct ttt192Leu Val His Thr Lys Pro Glu Ala Val Ala Thr Val His Arg Ala Phe50 55 60tac gaa gcg ggg gcc gat gtc gtg gaa acg gat act ttt ggg gga acg240Tyr Glu Ala Gly Ala Asp Val Val Glu Thr Asp Thr Phe Gly Gly Thr65 70 75 80ccc ctg gtg ctg gcg gag tac gat tta gca gac caa agt tat tac tta288Pro Leu Val Leu Ala Glu Tyr Asp Leu Ala Asp Gln Ser Tyr Tyr Leu85 90 95aat aaa gca gcg gcg gag ttg gcc aag gcg gta gca gcg gaa ttt tct336Asn Lys Ala Ala Ala Glu Leu Ala Lys Ala Val Ala Ala Glu Phe Ser100 105 110acc cca gaa aag cct cga ttc gtg gcc ggc tcc atg gga cca ggc acc384Thr Pro Glu Lys Pro Arg Phe Val Ala Gly Ser Met Gly Pro Gly Thr115 120 125aag cta ccc acc cta ggt cat gtg gac tac gat agt ctc aag gat gcc432Lys Leu Pro Thr Leu Gly His Val Asp Tyr Asp Ser Leu Lys Asp Ala130 135 140tat gtg gtt cag gtg cgg ggt tta tac gat ggc gga gtg gat tta ttg480Tyr Val Val Gln Val Arg Gly Leu Tyr Asp Gly Gly Val Asp Leu Leu145 150 155 160cta gtg gaa acc tgc cag gat gtg ctg caa att aaa gcg gcc ttg aac528Leu Val Glu Thr Cys Gln Asp Val Leu Gln Ile Lys Ala Ala Leu Asn165 170 175gcc att gaa cag gtc ttt gcc gaa aaa ggc gat cgc cta ccg ttg atg576Ala Ile Glu Gln Val Phe Ala Glu Lys Gly Asp Arg Leu Pro Leu Met180 185 190gtg tca gta acc atg gaa acc atg ggg acc atg ctg gtg ggt acg gag624Val Ser Val Thr Met Glu Thr Met Gly Thr Met Leu Val Gly Thr Glu195 200 205atg gcg gcg gcc ctg gcc att ttg gag ccc tat ccc atc gat att ttg672Met Ala Ala Ala Leu Ala Ile Leu Glu Pro Tyr Pro Ile Asp Ile Leu210 215 220ggg cta aac tgc gcc acc ggg cca gat ttg atg aag gaa cac gtt aaa720Gly Leu Asn Cys Ala Thr Gly Pro Asp Leu Met Lys Glu His Val Lys225 230 235 240tat ctt tcc gaa cat tcc ccc ttt gtg gtg tcc tgt att ccc aat gct768Tyr Leu Ser Glu His Ser Pro Phe Val Val Ser Cys Ile Pro Asn Ala245 250 255ggt ttg cca gaa aac gtt ggc ggt caa gct ttt tat cgc ctc acc ccg816Gly Leu Pro Glu Asn Val Gly Gly Gln Ala Phe Tyr Arg Leu Thr Pro260 265 270atg gaa ctg caa atg tcc ctg atg cac ttc atc gaa gac ctg gga gta864Met Glu Leu Gln Met Ser Leu Met His Phe Ile Glu Asp Leu Gly Val275 280 285
cag gta att ggt ggt tgt tgt ggc act aga ccc gat cac atc aag gcc912Gln Val Ile Gly Gly Cys Cys Gly Thr Arg Pro Asp His Ile Lys Ala290 295 300ctg gcg gat att gcc aag gat ctc cag ccc aaa caa cgc caa cct cac960Leu Ala Asp Ile Ala Lys Asp Leu Gln Pro Lys Gln Arg Gln Pro His305 310 315 320tac gaa ccc agc gcc gct tcc att tat tcc acc caa acc tac gcc caa1008Tyr Glu Pro Ser Ala Ala Ser Ile Tyr Ser Thr Gln Thr Tyr Ala Gln325 330 335gaa aat tct ttt tta atc att ggc gaa cgg ctc aat gcc agt ggc tcg1056Glu Asn Ser Phe Leu Ile Ile Gly Glu Arg Leu Asn Ala Ser Gly Ser340 345 350aaa aaa tgt cga gat ctg ctc aat gct gaa gat tgg gac agc cta gtt1104Lys Lys Cys Arg Asp Leu Leu Asn Ala Glu Asp Trp Asp Ser Leu Val355 360 365tcc ctg gct aaa tcc caa gtc aag gaa gga gcc caa atc ctt gac gtc1152Ser Leu Ala Lys Ser Gln Val Lys Glu Gly Ala Gln Ile Leu Asp Val370 375 380aac gtg gat tac gtt ggt cga gat ggg gta agg gac atg aaa gaa tta1200Asn Val Asp Tyr Val Gly Arg Asp Gly Val Arg Asp Met Lys Glu Leu385 390 395 400gct tcc cga cta gtc aat aat gtc acc ctg ccg ttg atg ttg gac tcc1248Ala Ser Arg Leu Val Asn Asn Val Thr Leu Pro Leu Met Leu Asp Ser405 410 415acc gaa tgg caa aaa atg gag gcg ggt tta aaa gtt gca ggg gga aaa1296Thr Glu Trp Gln Lys Met Glu Ala Gly Leu Lys Val Ala Gly Gly Lys420 425 430tgt att ctc aat tcc acc aac tac gaa gac ggg gaa gaa cgg ttt tat1344Cys Ile Leu Asn Ser Thr Asn Tyr Glu Asp Gly Glu Glu Arg Phe Tyr435 440 445aaa gtg tta gaa att gcc aaa gaa tat gga gct ggt att gtc att ggc1392Lys Val Leu Glu Ile Ala Lys Glu Tyr Gly Ala Gly Ile Val Ile Gly450 455460acc atc gat gaa gat ggc atg gga cgc act gca gat aaa aaa ttt gag1440Thr Ile Asp Glu Asp Gly Met Gly Arg Thr Ala Asp Lys Lys Phe Glu465 470 475 480att gcc aaa cgg gcc tac gaa gcg gcg atc gcc ttt ggc att ccg gcc1488Ile Ala Lys Arg Ala Tyr Glu Ala Ala Ile Ala Phe Gly Ile Pro Ala485 490 495aca gaa att ttc ttt gat cct tta gct ctg cct att tcc acc ggc att1536Thr Glu Ile Phe Phe Asp Pro Leu Ala Leu Pro Ile Ser Thr Gly Ile500 505 510gaa gaa gac agg gag aac ggt aaa gcc acc gtg gat gct atc cgc aga1584Glu Glu Asp Arg Glu Asn Gly Lys Ala Thr Val Asp Ala Ile Arg Arg515 520 525att cgc cag gaa ttg ccc gat tgt cat att ttg ttg ggg gtt tct aac1632
Ile Arg Gln Glu Leu Pro Asp Cys His Ile Leu Leu Gly Val Ser Asn530 535 540gtt tcc ttt ggc ttg aat ccc gcc gct cgc cag gta ctc aat tcc atc1680Val Ser Phe Gly Leu Asn Pro Ala Ala Arg Gln Val Leu Asn Ser Ile545 550 555 560ttt ctc cac gaa tgt atg cag gtg ggc atg gat gcg gcc att gtc agt1728Phe Leu His Glu Cys Met Gln Val Gly Met Asp Ala Ala Ile Val Ser565 570 575gcc aat aag att tta ccc ctg gca aaa att gac cca gaa caa caa caa1776Ala Asn Lys Ile Leu Pro Leu Ala Lys Ile Asp Pro Glu Gln Gln Gln580 585 590gtc tgt cta gat tta atc tat gac cgc cgg gaa ttt gaa gga gag cgc1824Val Cys Leu Asp Leu Ile Tyr Asp Arg Arg Glu Phe Glu Gly Glu Arg595 600 605tgt aca tat gac ccg tta acc aaa ctc acc act tta ttt gaa ggt aaa1872Cys Thr Tyr Asp Pro Leu Thr Lys Leu Thr Thr Leu Phe Glu Gly Lys610 615 620acc acc aaa cgg gat aaa tcc ggt gat gcc aat tta ccg gtg gaa gaa1920Thr Thr Lys Arg Asp Lys Ser Gly Asp Ala Asn Leu Pro Val Glu Glu625 630 635 640aga tta aaa cgc cac atc att gat ggg gaa aga ttg ggc tta gaa gag1968Arg Leu Lys Arg His Ile Ile Asp Gly Glu Arg Leu Gly Leu Glu Glu645 650 655gcc ctc aat gaa gct tta aaa ctt tac gct ccc tta gat atc att aac2016Ala Leu Asn Glu Ala Leu Lys Leu Tyr Ala Pro Leu Asp Ile Ile Asn660 665 670atc tat ttg ttg gat ggc atg aaa gtg gtg ggg gaa cta ttt ggt tcc2064Ile Tyr Leu Leu Asp Gly Met Lys Val Val Gly Glu Leu Phe Gly Ser675 680 685ggg caa atg cag ttg ccc ttt gtg ttg cag tcg gcc caa acc atg aaa2112Gly Gln Met Gln Leu Pro Phe Val Leu Gln Ser Ala Gln Thr Met Lys690 695 700gcg gcg gtg gct ttt tta gaa ccc cat atg gat aag gat gat tcc gcc2160Ala Ala Val Ala Phe Leu Glu Pro His Met Asp Lys Asp Asp Ser Ala705 710 715 720gac aat gct aag ggt act ttt tta att gcc act gtt aag ggg gat gtc2208Asp Asn Ala Lys Gly Thr Phe Leu Ile Ala Thr Val Lys Gly Asp Val725 730 735cat gat att ggc aaa aac tta gtg gat att atc ctt tcc aac aat ggc2256His Asp Ile Gly Lys Asn Leu Val Asp Ile Ile Leu Ser Asn Asn Gly740 745 750tat cga gtg gtc aac cta ggc att aaa cag cca gtg gaa aat att atc2304Tyr Arg Val Val Asn Leu Gly Ile Lys Gln Pro Val Glu Asn Ile Ile755 760 765gaa gcc tac aaa aaa cac agg ccc gat tgc att gcc atg agt ggt ttg2352Glu Ala Tyr Lys Lys His Arg Pro Asp Cys Ile Ala Met Ser Gly Leu770 775 780
ttg gtc aaa tca act gct ttt atg aag gaa aat tta gaa gtt ttc aac2400Leu Val Lys Ser Thr Ala Phe Met Lys Glu Asn Leu Glu Val Phe Asn785 790 795 800caa gag ggc att act gtt ccc gtc att ctt ggt ggt gct gct tta acg2448Gln Glu Gly Ile Thr Val Pro Val Ile Leu Gly Gly Ala Ala Leu Thr805 810 815cct aaa ttt gtt cac cag gac tgc caa aat acc tac aaa ggc caa gta2496Pro Lys Phe Val His Gln Asp Cys Gln Asn Thr Tyr Lys Gly Gln Val820 825 830att tac ggc aaa gat gcg ttc gcc gat tta cat ttc atg gat aag cta2544Ile Tyr Gly Lys Asp Ala Phe Ala Asp Leu His Phe Met Asp Lys Leu835 840 845atg ccc gct aaa aat agc cac aat tgg gat gat ttc cag ggc ttt tta2592Met Pro Ala Lys Asn Ser His Asn Trp Asp Asp Phe Gln Gly Phe Leu850 855 860ggg gaa tat gca acg gaa aat ggc cat aat gtg acc act gat gat ggt2640Gly Glu Tyr Ala Thr Glu Asn Gly His Asn Val Thr Thr Asp Asp Gly865 870 875 880gct aaa act aat ttt ggc att gaa gaa gaa aaa tta att gac gct agt2688Ala Lys Thr Asn Phe Gly Ile Glu Glu Glu Lys Leu Ile Asp Ala Ser885 890 895gag cag tct agg gag ccg gag gta att gat act gtt cgt tct gaa gcg2736Glu Gln Ser Arg Glu Pro Glu Val Ile Asp Thr Val Arg Ser Glu Ala900 905 910gtg gac cct gat cta gaa aga cct gtg cca cct ttt tgg ggc act aaa2784Val Asp Pro Asp Leu Glu Arg Pro Val Pro Pro Phe Trp Gly Thr Lys915 920 925att ttg caa tcc agt gat att tcc ctc gat gaa gtc ttc cct tta ctg2832Ile Leu Gln Ser Ser Asp Ile Ser Leu Asp Glu Val Phe Pro Leu Leu930 935 940gat tta caa gca tta ttt gtt ggt cag tgg cag ttt cgc aaa cct agg2880Asp Leu Gln Ala Leu Phe Val Gly Gln Trp Gln Phe Arg Lys Pro Arg945 950 955 960gag caa tcc agg gaa gaa tac gag caa ttc cta gcg gaa aaa gtt cat2928Glu Gln Ser Arg Glu Glu Tyr Glu Gln Phe Leu Ala Glu Lys Val His965 970 975ccc att ttg gct gag tgg aaa ggt aag gtc atg gca gaa aat tta ctc2976Pro Ile Leu Ala Glu Trp Lys Gly Lys Val Met Ala Glu Asn Leu Leu980 985 990cat cct acg gtg gtt tat ggt tat ttt ccc tgt caa tcc cag ggc aat3024His Pro Thr Val Val Tyr Gly Tyr Phe Pro Cys Gln Ser Gln Gly Asn99510001005acc ttg tta att tat gac cca gaa ttg gtc agc caa aat aat ggc caa3072Thr Leu Leu Ile Tyr Asp Pro Glu Leu Val Ser Gln Asn Asn Gly Gln101010151020att ccc cca gac gca acg gcg atc gcc aaa ttt gag ttt ccc cgg caa3120
Ile Pro Pro Asp Ala Thr Ala Ile Ala Lys Phe Glu Phe Pro Arg Gln1025 103010351040aaa tca ggg cgg cgg ctc tgt att gcg gac ttt ttt gct tca aaa gaa3168Lys Ser Gly Arg Arg Leu Cys Ile Ala Asp Phe Phe Ala Ser Lys Glu104510501055tcg ggg att act gat gtt ttt cct ttg caa gcg gtt aca gtg ggg gaa3216Ser Gly Ile Thr Asp Val Phe Pro Leu Gln Ala Val Thr Val Gly Glu106010651070atc gcg acg gaa tat gca agg aaa ctt ttt gct ggc gat aat tac acc3264Ile Ala Thr Glu Tyr Ala Arg Lys Leu Phe Ala Gly Asp Asn Tyr Thr107510801085gat tac ctc tac ttc cac ggc atg gcg gtg cag atg gcg gaa gct tta3312Asp Tyr Leu Tyr Phe His Gly Met Ala Val Gln Met Ala Glu Ala Leu109010951100gcg gag tgg act cac caa cgg ata cgt cag gaa ttg ggc ttt ggc cat3360Ala Glu Trp Thr His Gln Arg Ile Arg Gln Glu Leu Gly Phe Gly His1105 111011151120tta gat cca gat aac atc cgt gat ctt ctc cag caa cgt tac caa ggt3408Leu Asp Pro Asp Asn Ile Arg Asp Leu Leu Gln Gln Arg Tyr Gln Gly112511301135tcc cgc tac agt ttt ggt tat ccc gct tgt ccc aac atg cag gat caa3456Ser Arg Tyr Ser Phe Gly Tyr Pro Ala Cys Pro Asn Met Gln Asp Gln114011451150tac aca caa tta gaa ttg tta caa acc gaa cga att ggc ttg tat atg3504Tyr Thr Gln Leu Glu Leu Leu Gln Thr Glu Arg Ile Gly Leu Tyr Met115511601165gat gaa agt gaa cag gtt tat cca gaa caa tcc acc acg gcg att att3552Asp Glu Ser Glu Gln Val Tyr Pro Glu Gln Ser Thr Thr Ala Ile Ile117011751180tcc tat cat cct gcg gct aaa tat ttc agc gct taa3588Ser Tyr His Pro Ala Ala Lys Tyr Phe Ser Ala1185 1190119521062111195212PRT213集胞藻屬種4006Met Lys Ser Ala Phe Leu Asp Arg Ile His Ser Pro Asp Arg Pro Val1 5 10 15Leu Val Phe Asp Gly Ala Met Gly Thr Asn Leu Gln Val Gln Asn Leu20 25 30Thr Ala Ala Asp Phe Gly Gly Ala Glu Tyr Glu Gly Cys Asn Glu Tyr35 40 45Leu Val His Thr Lys Pro Glu Ala Val Ala Thr Val His Arg Ala Phe50 55 60Tyr Glu Ala Gly Ala Asp Val Val Glu Thr Asp Thr Phe Gly Gly Thr
65 70 75 80Pro Leu Val Leu Ala Glu Tyr Asp Leu Ala Asp Gln Ser Tyr Tyr Leu85 90 95Asn Lys Ala Ala Ala Glu Leu Ala Lys Ala Val Ala Ala Glu Phe Ser100 105 110Thr Pro Glu Lys Pro Arg Phe Val Ala Gly Ser Met Gly Pro Gly Thr115 120 125Lys Leu Pro Thr Leu Gly His Val Asp Tyr Asp Ser Leu Lys Asp Ala130 135 140Tyr Val Val Gln Val Arg Gly Leu Tyr Asp Gly Gly Val Asp Leu Leu145 150 155 160Leu Val Glu Thr Cys Gln Asp Val Leu Gln Ile Lys Ala Ala Leu Asn165 170 175Ala Ile Glu Gln Val Phe Ala Glu Lys Gly Asp Arg Leu Pro Leu Met180 185 190Val Ser Val Thr Met Glu Thr Met Gly Thr Met Leu Val Gly Thr Glu195 200 205Met Ala Ala Ala Leu Ala Ile Leu Glu Pro Tyr Pro Ile Asp Ile Leu210 215 220Gly Leu Asn Cys Ala Thr Gly Pro Asp Leu Met Lys Glu His Val Lys225 230 235 240Tyr Leu Ser Glu His Ser Pro Phe Val Val Ser Cys Ile Pro Asn Ala245 250 255Gly Leu Pro Glu Asn Val Gly Gly Gln Ala Phe Tyr Arg Leu Thr Pro260 265 270Met Glu Leu Gln Met Ser Leu Met His Phe Ile Glu Asp Leu Gly Val275 280 285Gln Val Ile Gly Gly Cys Cys Gly Thr Arg Pro Asp His Ile Lys Ala290 295 300Leu Ala Asp Ile Ala Lys Asp Leu Gln Pro Lys Gln Arg Gln Pro His305 310 315 320Tyr Glu Pro Ser Ala Ala Ser Ile Tyr Ser Thr Gln Thr Tyr Ala Gln325 330 335Glu Asn Ser Phe Leu Ile Ile Gly Glu Arg Leu Asn Ala Ser Gly Ser340 345 350Lys Lys Cys Arg Asp Leu Leu Asn Ala Glu Asp Trp Asp Ser Leu Val355 360 365Ser Leu Ala Lys Ser Gln Val Lys Glu Gly Ala Gln Ile Leu Asp Val370 375 380Asn Val Asp Tyr Val Gly Arg Asp Gly Val Arg Asp Met Lys Glu Leu385 390 395 400
Ala Ser Arg Leu Val Asn Asn Val Thr Leu Pro Leu Met Leu Asp Ser405 410 415Thr Glu Trp Gln Lys Met Glu Ala Gly Leu Lys Val Ala Gly Gly Lys420 425 430Cys Ile Leu Asn Ser Thr Asn Tyr Glu Asp Gly Glu Glu Arg Phe Tyr435 440 445Lys Val Leu Glu Ile Ala Lys Glu Tyr Gly Ala Gly Ile Val Ile Gly450 455 460Thr Ile Asp Glu Asp Gly Met Gly Arg Thr Ala Asp Lys Lys Phe Glu465 470 475 480Ile Ala Lys Arg Ala Tyr Glu Ala Ala Ile Ala Phe Gly Ile Pro Ala485 490 495Thr Glu Ile Phe Phe Asp Pro Leu Ala Leu Pro Ile Ser Thr Gly Ile500 505 510Glu Glu Asp Arg Glu Asn Gly Lys Ala Thr Val Asp Ala Ile Arg Arg515 520 525Ile Arg Gln Glu Leu Pro Asp Cys His Ile Leu Leu Gly Val Ser Asn530 535 540Val Ser Phe Gly Leu Asn Pro Ala Ala Arg Gln Val Leu Asn Ser Ile545 550 555 560Phe Leu His Glu Cys Met Gln Val Gly Met Asp Ala Ala Ile Val Ser565 570 575Ala Asn Lys Ile Leu Pro Leu Ala Lys Ile Asp Pro Glu Gln Gln Gln580 585 590Val Cys Leu Asp Leu Ile Tyr Asp Arg Arg Glu Phe Glu Gly Glu Arg595 600 605Cys Thr Tyr Asp Pro Leu Thr Lys Leu Thr Thr Leu Phe Glu Gly Lys610 615 620Thr Thr Lys Arg Asp Lys Ser Gly Asp Ala Asn Leu Pro Val Glu Glu625 630 635 640Arg Leu Lys Arg His Ile Ile Asp Gly Glu Arg Leu Gly Leu Glu Glu645 650 655Ala Leu Asn Glu Ala Leu Lys Leu Tyr Ala Pro Leu Asp Ile Ile Asn660 665 670Ile Tyr Leu Leu Asp Gly Met Lys Val Val Gly Glu Leu Phe Gly Ser675 680 685Gly Gln Met Gln Leu Pro Phe Val Leu Gln Ser Ala Gln Thr Met Lys690 695 700Ala Ala Val Ala Phe Leu Glu Pro His Met Asp Lys Asp Asp Ser Ala705 710 715 720Asp Asn Ala Lys Gly Thr Phe Leu Ile Ala Thr Val Lys Gly Asp Val725 730 735
His Asp Ile Gly Lys Asn Leu Val Asp Ile Ile Leu Ser Asn Asn Gly740 745 750Tyr Arg Val Val Asn Leu Gly Ile Lys Gln Pro Val Glu Asn Ile Ile755 760 765Glu Ala Tyr Lys Lys His Arg Pro Asp Cys Ile Ala Met Ser Gly Leu770 775 780Leu Val Lys Ser Thr Ala Phe Met Lys Glu Asn Leu Glu Val Phe Asn785 790 795 800Gln Glu Gly Ile Thr Val Pro Val Ile Leu Gly Gly Ala Ala Leu Thr805 810 815Pro Lys Phe Val His Gln Asp Cys Gln Asn Thr Tyr Lys Gly Gln Val820 825 830Ile Tyr Gly Lys Asp Ala Phe Ala Asp Leu His Phe Met Asp Lys Leu835 840 845Met Pro Ala Lys Asn Ser His Asn Trp Asp Asp Phe Gln Gly Phe Leu850 855 860Gly Glu Tyr Ala Thr Glu Asn Gly His Asn Val Thr Thr Asp Asp Gly865 870 875 880Ala Lys Thr Asn Phe Gly Ile Glu Glu Glu Lys Leu Ile Asp Ala Ser885 890 895Glu Gln Ser Arg Glu Pro Glu Val Ile Asp Thr Val Arg Ser Glu Ala900 905 910Val Asp Pro Asp Leu Glu Arg Pro Val Pro Pro Phe Trp Gly Thr Lys915 920 925Ile Leu Gln Ser Ser Asp Ile Ser Leu Asp Glu Val Phe Pro Leu Leu930 935 940Asp Leu Gln Ala Leu Phe Val Gly Gln Trp Gln Phe Arg Lys Pro Arg945 950 955 960Glu Gln Ser Arg Glu Glu Tyr Glu Gln Phe Leu Ala Glu Lys Val His965 970 975Pro Ile Leu Ala Glu Trp Lys Gly Lys Val Met Ala Glu Asn Leu Leu980 985 990His Pro Thr Val Val Tyr Gly Tyr Phe Pro Cys Gln Ser Gln Gly Asn99510001005Thr Leu Leu Ile Tyr Asp Pro Glu Leu Val Ser Gln Asn Asn Gly Gln101010151020Ile Pro Pro Asp Ala Thr Ala Ile Ala Lys Phe Glu Phe Pro Arg Gln1025 103010351040Lys Ser Gly Arg Arg Leu Cys Ile Ala Asp Phe Phe Ala Ser Lys Glu104510501055Ser Gly Ile Thr Asp Val Phe Pro Leu Gln Ala Val Thr Val Gly Glu
106010651070Ile Ala Thr Glu Tyr Ala Arg Lys Leu Phe Ala Gly Asp Asn Tyr Thr107510801085Asp Tyr Leu Tyr Phe His Gly Met Ala Val Gln Met Ala Glu Ala Leu109010951100Ala Glu Trp Thr His Gln Arg Ile Arg Gln Glu Leu Gly Phe Gly His1105 111011151120Leu Asp Pro Asp Asn Ile Arg Asp Leu Leu Gln Gln Arg Tyr Gln Gly112511301135Ser Arg Tyr Ser Phe Gly Tyr Pro Ala Cys Pro Asn Met Gln Asp Gln114011451150Tyr Thr Gln Leu Glu Leu Leu Gln Thr Glu Arg Ile Gly Leu Tyr Met115511601165Asp Glu Ser Glu Gln Val Tyr Pro Glu Gln Ser Thr Thr Ala Ile Ile117011751180Ser Tyr His Pro Ala Ala Lys Tyr Phe Ser Ala1185 1190119521072113561212DNA213海洋原綠球藻(Prochlorococcus marinus)220
221CDS222(1)..(3558)223RCK008304007atg gtt tca ttt aga aat tat tta aat aga gat gat aaa cca att att48Met Val Ser Phe Arg Asn Tyr Leu Asn Arg Asp Asp Lys Pro Ile Ile1 5 10 15att ttc gat ggt ggg aca ggt act tct ttt caa aat tta aat tta tca96Ile Phe Asp Gly Gly Thr Gly Thr Ser Phe Gln Asn Leu Asn Leu Ser20 25 30tca cat gat ttt ggt gga gat gat tta gag ggt tgc aat gaa aac tta144Ser His Asp Phe Gly Gly Asp Asp Leu Glu Gly Cys Asn Glu Asn Leu35 40 45gtt cta tcc tct cct aat act gtt gaa caa gta cat aat tca ttt ctt192Val Leu Ser Ser Pro Asn Thr Val Glu Gln Val His Asn Ser Phe Leu50 55 60gaa gca ggt tgt cat gta att gaa acc aat aca ttt ggt gct tca tct240Glu Ala Gly Cys His Val Ile Glu Thr Asn Thr Phe Gly Ala Ser Ser65 70 75 80att gtt tta gac gaa tat agt att tct aat aaa gct tat gaa atc aat288Ile Val Leu Asp Glu Tyr Ser Ile Ser Asn Lys Ala Tyr Glu Ile Asn85 90 95
aaa aaa gca gct cag ata gct aaa aaa tgt gca aat tta ttt tca tct336Lys Lys Ala Ala Gln Ile Ala Lys Lys Cys Ala Asn Leu Phe Ser Ser100 105 110att aat act cct aga ttt gtc gct gga tca att ggg cca act aca aaa384Ile Asn Thr Pro Arg Phe Val Ala Gly Ser Ile Gly Pro Thr Thr Lys115 120 125tta cca aca tta ggt cat att agt ttt gat aag ctt aaa gat tca tat432Leu Pro Thr Leu Gly His Ile Ser Phe Asp Lys Leu Lys Asp Ser Tyr130 135 140gaa gaa caa ata aat ggt cta att gac gga ggt att gac ctt cta ttg480Glu Glu Gln Ile Asn Gly Leu Ile Asp Gly Gly Ile Asp Leu Leu Leu145 150 155 160att gaa aca tgc caa gat gtt tta caa ata aaa tca gca tta tct gct528Ile Glu Thr Cys Gln Asp Val Leu Gln Ile Lys Ser Ala Leu Ser Ala165 170 175tct caa gaa gtt att aaa aac agg aat att gaa tta cca ata atg ata576Ser Gln Glu Val Ile Lys Asn Arg Asn Ile Glu Leu Pro Ile Met Ile180 185 190tcc ata act atg gaa acc aca gga acg atg ctt gtc ggg tca gat ata624Ser Ile Thr Met Glu Thr Thr Gly Thr Met Leu Val Gly Ser Asp Ile195 200 205gct tct gca tta aca ata tta gag cca tac aat att gat att ctg gga672Ala Ser Ala Leu Thr Ile Leu Glu Pro Tyr Asn Ile Asp Ile Leu Gly210 215 220ctg aat tgt gca act ggt cca gtt caa atg aaa gaa cat att aag tat720Leu Asn Cys Ala Thr Gly Pro Val Gln Met Lys Glu His Ile Lys Tyr225 230 235 240tta gct gaa aat tca cct ttt gca att agt tgt ata cct aat gca gga768Leu Ala Glu Asn Ser Pro Phe Ala Ile Ser Cys Ile Pro Asn Ala Gly245 250 255tta cct gaa aat ata gga ggt gtt gct cac tat aaa tta act cca ttg816Leu Pro Glu Asn Ile Gly Gly Val Ala His Tyr Lys Leu Thr Pro Leu260 265 270gag ttg aaa atg cag tta atg aac ttt att tat gat ttt aac gta caa864Glu Leu Lys Met Gln Leu Met Asn Phe Ile Tyr Asp Phe Asn Val Gln275 280 285ctt att ggc gga tgt tgt ggt act act cct gaa cat atc aag cat tta912Leu Ile Gly Gly Cys Cys Gly Thr Thr Pro Glu His Ile Lys His Leu290 295 300tca tca atc att gag gaa ata gtt gat aaa aaa ata aat aaa aga ctt960Ser Ser Ile Ile Glu Glu Ile Val Asp Lys Lys Ile Asn Lys Arg Leu305 310 315 320cct act gta aaa aca aat ttt gtt cct tca gca gct tct ata tat aac1008Pro Thr Val Lys Thr Asn Phe Val Pro Ser Ala Ala Ser Ile Tyr Asn325 330 335gca gtt cca tat aaa caa gat aac tca ata tta ata gtt gga gaa cgt1056Ala Val Pro Tyr Lys Gln Asp Asn Ser Ile Leu Ile Val Gly Glu Arg
340 345 350tta aat gct agt gga tca aaa aaa gta agg gaa tta cta aat gaa gat1104Leu Asn Ala Ser Gly Ser Lys Lys Val Arg Glu Leu Leu Asn Glu Asp355 360 365gat tgg gac ggc ctg cta tca att gct aaa caa cag caa aaa gaa aat1152Asp Trp Asp Gly Leu Leu Ser Ile Ala Lys Gln Gln Gln Lys Glu Asn370 375 380gct cac ata cta gat gtc aat gtt gat tat gta gga aga gat gga gtt1200Ala His Ile Leu Asp Val Asn Val Asp Tyr Val Gly Arg Asp Gly Val385 390 395 400aaa gat atg aaa gaa att acc tca aga tta gtt aca aat ata aat ctt1248Lys Asp Met Lys Glu Ile Thr Ser Arg Leu Val Thr Asn Ile Asn Leu405 410 415cca tta atg ata gat tca aca gaa gca gat aaa atg gaa agt gga tta1296Pro Leu Met Ile Asp Ser Thr Glu Ala Asp Lys Met Glu Ser Gly Leu420 425 430aag act gta gga gga aaa tgc att ata aat tca aca aac tac gaa gat1344Lys Thr Val Gly Gly Lys Cys Ile Ile Asn Ser Thr Asn Tyr Glu Asp435 440 445gga gat gac aga ttt aat cag gtc tta aga ctt gca tta gat tat ggt1392Gly Asp Asp Arg phe Asn Gln Val Leu Arg Leu Ala Leu Asp Tyr Gly450 455 460gct gga ata gta att gga act att gat gaa gat gga atg gca aga aca1440Ala Gly Ile Val Ile Gly Thr Ile Asp Glu Asp Gly Met Ala Arg Thr465 470 475 480tca cag aaa aaa tat gac att gca aaa aga gca tta att aaa act aga1488Ser Gln Lys Lys Tyr Asp Ile Ala Lys Arg Ala Leu Ile Lys Thr Arg485 490 495tca agt ggc ctc gct gat tat gag ata ttt ttt gat cct cta gca ttg1536Ser Ser Gly Leu Ala Asp Tyr Glu Ile Phe Phe Asp Pro Leu Ala Leu500 505 510cca ata tct act gga att gaa gaa gat aga tta aat gct aaa gca act1584Pro Ile Ser Thr Gly Ile Glu Glu Asp Arg Leu Asn Ala Lys Ala Thr515 520 525att gaa gct ata tca aaa ata aga aaa agc ttt cca gat att cat att1632Ile Glu Ala Ile Ser Lys Ile Arg Lys Ser Phe Pro Asp Ile His Ile530 535 540att tta ggg ata tct aat att agt ttc ggg ctt tca cca tta tca aga1680Ile Leu Gly Ile Ser Asn Ile Ser Phe Gly Leu Ser Pro Leu Ser Arg545 550 555 560att aat cta aat tca ata ttt ctc gat gaa tgt ata aag gca gga tta1728Ile Asn Leu Asn Ser Ile Phe Leu Asp Glu Cys Ile Lys Ala Gly Leu565 570 575gat tca gcg att att gca cca aat aaa ata ttg cct ctt tca aaa ata1776Asp Ser Ala Ile Ile Ala Pro Asn Lys Ile Leu Pro Leu Ser Lys Ile580 585 590
tct gcg gaa aca aaa aaa tta tgt tta gat tta att tat gac aga aga1824Ser Ala Glu Thr Lys Lys Leu Cys Leu Asp Leu Ile Tyr Asp Arg Arg595 600 605aat ttc gaa aat gaa ata tgt ata tat gat cca tta gtt gaa cta aca1872Asn Phe Glu Asn Glu Ile Cys Ile Tyr Asp Pro Leu Val Glu Leu Thr610 615 620aaa gca ttc caa gat ata aca atc agt gac ttt aaa aaa gga tct act1920Lys Ala Phe Gln Asp Ile Thr Ile Ser Asp Phe Lys Lys Gly Ser Thr625 630 635 640tca aac aaa aac ctc acc tta gaa gaa aaa ctt aaa aac cat att gta1968Ser Asn Lys Asn Leu Thr Leu Glu Glu Lys Leu Lys Asn His Ile Val645 650 655gat ggg gaa aaa ata ggt tta gaa gaa caa tta aat aat gcg ctt aaa2016Asp Gly Glu Lys Ile Gly Leu Glu Glu Gln Leu Asn Asn Ala Leu Lys660 665 670aag tac aaa cca ctt gaa ata att aat act tat tta tta gat gga atg2064Lys Tyr Lys Pro Leu Glu Ile Ile Asn Thr Tyr Leu Leu Asp Gly Met675 680 685aaa gta gtc ggt gaa cta ttt gga tcc ggc caa atg caa tta cct ttt2112Lys Val Val Gly Glu Leu Phe Gly Ser Gly Gln Met Gln Leu Pro Phe690 695 700gta ttg caa tca gcg gaa aca atg aaa ttt gct gtt tca gtg ctt gaa2160Val Leu Gln Ser Ala Glu Thr Met Lys Phe Ala Val Ser Val Leu Glu705 710 715 720cct cat atg gaa aca gta gat gaa aaa ata tct aac gga aaa tta cta2208Pro His Met Glu Thr Val Asp Glu Lys Ile Ser Asn Gly Lys Leu Leu725 730 735ata gca act gtt aaa gga gat gtt cat gat ata ggt aaa aat tta gtt2256Ile Ala Thr Val Lys Gly Asp Val His Asp Ile Gly Lys Asn Leu Val740 745 750gat ata att ctc tca aat aat ggt ttt gat gta atc aac ctt gga att2304Asp Ile Ile Leu Ser Asn Asn Gly Phe Asp Val Ile Asn Leu Gly Ile755 760 765aag caa gat gtt tca gcg att att gat gca caa aaa aaa cat aaa gca2352Lys Gln Asp Val Ser Ala Ile Ile Asp Ala Gln Lys Lys His Lys Ala770 775 780gac tgt att gct atg agt ggt tta ctt gtt aaa tct aca gca ttt atg2400Asp Cys Ile Ala Met Ser Gly Leu Leu Val Lys Ser Thr Ala Phe Met785 790 795 800aag gat aat tta gaa gca ttt aac aat gct gaa att aat gtt cca gtt2448Lys Asp Asn Leu Glu Ala Phe Asn Asn Ala Glu Ile Asn Val Pro Val805 810 815att ctt gga ggt gca gca tta act cca aaa ttt gtg aat gaa gat tgt2496Ile Leu Gly Gly Ala Ala Leu Thr Pro Lys Phe Val Asn Glu Asp Cys820 825 830agt cag ata tat aaa ggt aaa att ttg tat ggg aaa gat gct ttt aca2544Ser Gln Ile Tyr Lys Gly Lys Ile Leu Tyr Gly Lys Asp Ala Phe Thr
835 840 845gat tta caa ttt atg aat gac tat atg gat agt aaa aag aag ggc aat2592Asp Leu Gln phe Met Asn Asp Tyr Met Asp Ser Lys Lys Lys Gly Asn850 855 860tgg tct aat gaa aat ggt ttt act aat act gat gat att caa att aaa2640Trp Ser Asn Glu Asn Gly Phe Thr Asn Thr Asp Asp Ile Gln Ile Lys865 870 875 880tta gct tcc cca agg tct tcc gct aaa gat aaa aat tta aat aaa aat2688Leu Ala Ser Pro Arg Ser Ser Ala Lys Asp Lys Asn Leu Asn Lys Asn885 890 895ttt gaa aaa acc aaa agt att caa tta att gag aat ttt aat aga tct2736Phe Glu Lys Thr Lys Ser Ile Gln Leu Ile Glu Asn Phe Asn Arg Ser900 905 910aat ttt gta gag gaa gag gaa cct ata aag gct cca ttt ttg gga act2784Asn Phe Val Glu Glu Glu Glu Pro Ile Lys Ala Pro Phe Leu Gly Thr915 920 925aga gtt ctt caa gat att gaa ata gac ttt gac aaa cta att ttt tat2832Arg Val Leu Gln Asp Ile Glu Ile Asp Phe Asp Lys Leu Ile Phe Tyr930 935 940cta gat aaa aaa gca tta ttt agt ggt caa tgg caa att aaa aaa aat2880Leu Asp Lys Lys Ala Leu Phe Ser Gly Gln Trp Gln Ile Lys Lys Asn945 950 955 960aaa ggt caa tca gta gaa gaa tac aat aat tat tta gat tca tat gca2928Lys Gly Gln Ser Val Glu Glu Tyr Asn Asn Tyr Leu Asp Ser Tyr Ala965 970 975aat cca tta ctt gaa aaa tgg att aat att att tta gat aaa ggc tta2976Asn Pro Leu Leu Glu Lys Trp Ile Asn Ile Ile Leu Asp Lys Gly Leu980 985 990att tca cca aaa gta gtc tat ggc tac ttc cgt tgc ggg agg aat gat3024Ile Ser Pro Lys Val Val Tyr Gly Tyr Phe Arg Cys Gly Arg Asn Asp99510001005aat agt att tat ctc ttt gat aat gta tca aat aaa aga att tct gaa3072Asn Ser Ile Tyr Leu Phe Asp Asn Val Ser Asn Lys Arg Ile Ser Glu101010151020ttt aac ttt cct aga caa aaa tcg gga aat aat ctt tgt att gca gat3120Phe Asn Phe Pro Arg Gln Lys Ser Gly Asn Asn Leu Cys Ile Ala Asp1025 103010351040ttt tac tgt gat ctt aaa aat aat gat cca gta gat ata ttt cca atg3168Phe Tyr Cys Asp Leu Lys Asn Asn Asp Pro Val Asp Ile Phe Pro Met104510501055caa gca gta aca atg ggg gaa ata gct agc gaa tat tcc caa gaa tta3216Gln Ala Val Thr Met Gly Glu Ile Ala Ser Glu Tyr Ser Gln Glu Leu106010651070ttt aaa gct gat aaa tat agt gat tat tta ata ttt cat ggt tta acc3264Phe Lys Ala Asp Lys Tyr Ser Asp Tyr Leu Ile Phe His Gly Leu Thr107510801085
gtt caa tta gca gaa gct ctt gca gaa tat gtt cat tca ata gta aga3312Val Gln Leu Ala Glu Ala Leu Ala Glu Tyr Val His Ser Ile Val Arg109010951100att gaa tgc gga ttt aaa tca tat gag cca aac aat aac cgt gat ata3360Tle Glu Cys Gly Phe Lys Ser Tyr Glu Pro Asn Asn Asn Arg Asp Ile1105 111011151120tta gct caa aaa tat aga gga gct aga tac tca ttt ggt tat cca gct3408Leu Ala Gln Lys Tyr Arg Gly Ala Arg Tyr Ser Phe Gly Tyr Pro Ala112511301135tgt cct aaa gtt tct gat tca aat ata cag tta tca tta ttg gat aca3456Cys Pro Lys Val Ser Asp Ser Asn Ile Gln Leu Ser Leu Leu Asp Thr114011451150aaa agg att aat tta aca atg gat gaa tca gag caa tta cat cct gaa3504Lys Arg Ile Asn Leu Thr Met Asp Glu Ser Glu Gln Leu His Pro Glu115511601165caa agt act act gct ata att tca ctt cat tca aaa gca aaa tat ttt3552Gln Ser Thr Thr Ala Ile Ile Ser Leu His Ser Lys Ala Lys Tyr Phe117011751180agt gcc taa3561Ser Ala118521082111186212PRT213海洋原綠球藻4008Met Val Ser Phe Arg Asn Tyr Leu Asn Arg Asp Asp Lys Pro Ile Ile1 5 10 15Ile Phe Asp Gly Gly Thr Gly Thr Ser Phe Gln Asn Leu Asn Leu Ser20 25 30Ser His Asp Phe Gly Gly Asp Asp Leu Glu Gly Cys Asn Glu Asn Leu35 40 45Val Leu Ser Ser Pro Asn Thr Val Glu Gln Val His Asn Ser Phe Leu50 55 60Glu Ala Gly Cys His Val Ile Glu Thr Asn Thr Phe Gly Ala Ser Ser65 70 75 80Ile Val Leu Asp Glu Tyr Ser Ile Ser Asn Lys Ala Tyr Glu Ile Asn85 90 95Lys Lys Ala Ala Gln Ile Ala Lys Lys Cys Ala Asn Leu Phe Ser Ser100 105 110Ile Asn Thr Pro Arg Phe Val Ala Gly Ser Ile Gly Pro Thr Thr Lys115 120 125Leu Pro Thr Leu Gly His Ile Ser Phe Asp Lys Leu Lys Asp Ser Tyr130 135 140
Glu Glu Gln Ile Asn Gly Leu Ile Asp Gly Gly Ile Asp Leu Leu Leu145 150 155 160Ile Glu Thr Cys Gln Asp Val Leu Gln Ile Lys Ser Ala Leu Ser Ala165 170 175Ser Gln Glu Val Ile Lys Asn Arg Asn Ile Glu Leu Pro Ile Met Ile180 185 190Ser Ile Thr Met Glu Thr Thr Gly Thr Met Leu Val Gly Ser Asp Ile195 200 205Ala Ser Ala Leu Thr Ile Leu Glu Pro Tyr Asn Ile Asp Ile Leu Gly210 215 220Leu Asn Cys Ala Thr Gly Pro Val Gln Met Lys Glu His Ile Lys Tyr225 230 235 240Leu Ala Glu Asn Ser Pro Phe Ala Ile Ser Cys Ile Pro Asn Ala Gly245 250 255Leu Pro Glu Asn Ile Gly Gly Val Ala His Tyr Lys Leu Thr Pro Leu260 265 270Glu Leu Lys Met Gln Leu Met Asn Phe Ile Tyr Asp Phe Asn Val Gln275 280 285Leu Ile Gly Gly Cys Cys Gly Thr Thr Pro Glu His Ile Lys His Leu290 295 300Ser Ser Ile Ile Glu Glu Ile Val Asp Lys Lys Ile Asn Lys Arg Leu305 310 315 320Pro Thr Val Lys Thr Asn Phe Val Pro Ser Ala Ala Ser Ile Tyr Asn325 330 335Ala Val Pro Tyr Lys Gln Asp Asn Ser Ile Leu Ile Val Gly Glu Arg340 345 350Leu Asn Ala Ser Gly Ser Lys Lys Val Arg Glu Leu Leu Asn Glu Asp355 360 365Asp Trp Asp Gly Leu Leu Ser Ile Ala Lys Gln Gln Gln Lys Glu Asn370 375 380Ala His Ile Leu Asp Val Asn Val Asp Tyr Val Gly Arg Asp Gly Val385 390 395 400Lys Asp Met Lys Glu Ile Thr Ser Arg Leu Val Thr Asn Ile Asn Leu405 410 415Pro Leu Met Ile Asp Ser Thr Glu Ala Asp Lys Met Glu Ser Gly Leu420 425 430Lys Thr Val Gly Gly Lys Cys Ile Ile Asn Ser Thr Asn Tyr Glu Asp435 440 445Gly Asp Asp Arg Phe Asn Gln Val Leu Arg Leu Ala Leu Asp Tyr Gly450 455 460Ala Gly Ile Val Ile Gly Thr Ile Asp Glu Asp Gly Met Ala Arg Thr465 470 475 480
Ser Gln Lys Lys Tyr Asp Ile Ala Lys Arg Ala Leu Ile Lys Thr Arg485 490 495Ser Ser Gly Leu Ala Asp Tyr Glu Ile Phe Phe Asp Pro Leu Ala Leu500 505 510Pro Ile Ser Thr Gly Ile Glu Glu Asp Arg Leu Asn Ala Lys Ala Thr515 520 525Ile Glu Ala Ile Ser Lys Ile Arg Lys Ser Phe Pro Asp Ile His Ile530 535 540Ile Leu Gly Ile Ser Asn Ile Ser Phe Gly Leu Ser Pro Leu Ser Arg545 550 555 560Ile Asn Leu Asn Ser Ile Phe Leu Asp Glu Cys Ile Lys Ala Gly Leu565 570 575Asp Ser Ala Ile Ile Ala Pro Asn Lys Ile Leu Pro Leu Ser Lys Ile580 585 590Ser Ala Glu Thr Lys Lys Leu Cys Leu Asp Leu Ile Tyr Asp Arg Arg595 600 605Asn Phe Glu Asn Glu Ile Cys Ile Tyr Asp Pro Leu Val Glu Leu Thr610 615 620Lys Ala Phe Gln Asp Ile Thr Ile Ser Asp Phe Lys Lys Gly Ser Thr625 630 635 640Ser Asn Lys Asn Leu Thr Leu Glu Glu Lys Leu Lys Asn His Ile Val645 650 655Asp Gly Glu Lys Ile Gly Leu Glu Glu Gln Leu Asn Asn Ala Leu Lys660 665 670Lys Tyr Lys Pro Leu Glu Ile Ile Asn Thr Tyr Leu Leu Asp Gly Met675 680 685Lys Val Val Gly Glu Leu Phe Gly Ser Gly Gln Met Gln Leu Pro Phe690 695 700Val Leu Gln Ser Ala Glu Thr Met Lys Phe Ala Val Ser Val Leu Glu705 710 715 720Pro His Met Glu Thr Val Asp Glu Lys Ile Ser Asn Gly Lys Leu Leu725 730 735Ile Ala Thr Val Lys Gly Asp Val His Asp Ile Gly Lys Asn Leu Val740 745 750Asp Ile Ile Leu Ser Asn Asn Gly Phe Asp Val Ile Asn Leu Gly Ile755 760 765Lys Gln Asp Val Ser Ala Ile Ile Asp Ala Gln Lys Lys His Lys Ala770 775 780Asp Cys Ile Ala Met Ser Gly Leu Leu Val Lys Ser Thr Ala Phe Met785 790 795 800Lys Asp Asn Leu Glu Ala Phe Asn Asn Ala Glu Ile Asn Val Pro Val
805 810 815Ile Leu Gly Gly Ala Ala Leu Thr Pro Lys Phe Val Asn Glu Asp Cys820 825 830Ser Gln Ile Tyr Lys Gly Lys Ile Leu Tyr Gly Lys Asp Ala Phe Thr835 840 845Asp Leu Gln Phe Met Asn Asp Tyr Met Asp Ser Lys Lys Lys Gly Asn850 855 860Trp Ser Asn Glu Asn Gly Phe Thr Asn Thr Asp Asp Ile Gln Ile Lys865 870 875 880Leu Ala Ser Pro Arg Ser Ser Ala Lys Asp Lys Asn Leu Asn Lys Asn885 890 895Phe Glu Lys Thr Lys Ser Ile Gln Leu Ile Glu Asn Phe Asn Arg Ser900 905 910Asn Phe Val Glu Glu Glu Glu Pro Ile Lys Ala Pro Phe Leu Gly Thr915 920 925Arg Val Leu Gln Asp Ile Glu Ile Asp Phe Asp Lys Leu Ile Phe Tyr930 935 940Leu Asp Lys Lys Ala Leu Phe Ser Gly Gln Trp Gln Ile Lys Lys Asn945 950 955 960Lys Gly Gln Ser Val Glu Glu Tyr Asn Asn Tyr Leu Asp Ser Tyr Ala965 970 975Asn Pro Leu Leu Glu Lys Trp Ile Asn Ile Ile Leu Asp Lys Gly Leu980 985 990Ile Ser Pro Lys Val Val Tyr Gly Tyr Phe Arg Cys Gly Arg Asn Asp99510001005Asn Ser Ile Tyr Leu Phe Asp Asn Val Ser Asn Lys Arg Ile Ser Glu101010151020Phe Asn Phe Pro Arg Gln Lys Ser Gly Asn Asn Leu Cys Ile Ala Asp1025 103010351040Phe Tyr Cys Asp Leu Lys Asn Asn Asp Pro Val Asp Ile Phe Pro Met104510501055Gln Ala Val Thr Met Gly Glu Ile Ala Ser Glu Tyr Ser Gln Glu Leu106010651070Phe Lys Ala Asp Lys Tyr Ser Asp Tyr Leu Ile Phe His Gly Leu Thr107510801085Val Gln Leu Ala Glu Ala Leu Ala Glu Tyr Val His Ser Ile Val Arg109010951100Ile Glu Cys Gly Phe Lys Ser Tyr Glu Pro Asn Asn Asn Arg Asp Ile1105 111011151120Leu Ala Gln Lys Tyr Arg Gly Ala Arg Tyr Ser Phe Gly Tyr Pro Ala112511301135
Cys Pro Lys Val Ser Asp Ser Asn Ile Gln Leu Ser Leu Leu Asp Thr114011451150Lys Arg Ile Asn Leu Thr Met Asp Glu Ser Glu Gln Leu His Pro Glu115511601165Gln Ser Thr Thr Ala Ile Ile Ser Leu His Ser Lys Ala Lys Tyr Phe117011751180Ser Ala118521092113048212DNA213嗜熱棲熱菌(Thermus thermophilus)220
221CDS222(1)..(3045)223RTT002664009atg cgg gcc tac aag gag gcg gca cgg ggg ctt ctt aag ggc ggg gtg48Met Arg Ala Tyr Lys Glu Ala Ala Arg Gly Leu Leu Lys Gly Gly Val1 5 10 15gac ctc atc ctc ttg gag acc gcc cag gac atc ctc cag gtg cgc tgc96Asp Leu Ile Leu Leu Glu Thr Ala Gln Asp Ile Leu Gln Val Arg Cys20 25 30gcc gtc ttg gcg gtg cgg gag gcc atg gcc gag gtg ggc cgg gag gtg144Ala Val Leu Ala Val Arg Glu Ala Met Ala Glu Val Gly Arg Glu Val35 40 45ccc ctc cag gtc cag gtg acc ttt gag gcc acg ggg acg atg ctc gtg192Pro Leu Gln Val Gln Val Thr Phe Glu Ala Thr Gly Thr Met Leu Val50 55 60ggc acg gac gag cag gcg gcc ctg gcc gct ctg gag agc ctc ccc gtg240Gly Thr Asp Glu Gln Ala Ala Leu Ala Ala Leu Glu Ser Leu Pro Val65 70 75 80gac gtg gtg ggg atg aac tgc gcc acg ggc ccc gac ctc atg gac agc288Asp Val Val Gly Met Asn Cys Ala Thr Gly Pro Asp Leu Met Asp Ser85 90 95aag gtg cgc tac ttc gcc gag cac agc acc cgc ttc gtc tcc tgc ctc336Lys Val Arg Tyr Phe Ala Glu His Ser Thr Arg Phe Val Ser Cys Leu100 105 110ccg aec gcg ggc ctg ccc cgg aac gag ggg ggg agg gtg gtc tac gac384Pro Asn Ala Gly Leu Pro Arg Asn Glu Gly Gly Arg Val Val Tyr Asp115 120 125ctc acc ccc gag gag ctc gcc aag tgg cac ctc aag ttc gtg gcc gag432Leu Thr Pro Glu Glu Leu Ala Lys Trp His Leu Lys Phe Val Ala Glu130 135 140tac ggg gtg aac gcc gtg ggg gga tgc tgc ggc acg ggg ccc gag cac480Tyr Gly Val Asn Ala Val Gly Gly Cys Cys Gly Thr Gly Pro Glu His
145 150 155 160ata agg aag gtg gcc gag gcg gtg aag ggg ctc gcc ccg aag cca agg528Ile Arg Lys Val Ala Glu Ala Val Lys Gly Leu Ala Pro Lys Pro Arg165 170 175ccc gaa agc ttc cct ccc cag gtg gcc tcc ttg tac cag gcg gtg tcc576Pro Glu Ser Phe Pro Pro Gln Val Ala Ser Leu Tyr Gln Ala Val Ser180 185 190ctc aag cag gag gcg agc ctt ttc ctc gtg ggg gag agg ctc aac gcc624Leu Lys Gln Glu Ala Ser Leu Phe Leu Val Gly Glu Arg Leu Asn Ala195 200 205acg ggg agc aag cgc ttc cgg gag atg ctc ttc gcg aga gac ctc gag672Thr Gly Ser Lys Arg Phe Arg Glu Met Leu Phe Ala Arg Asp Leu Glu210 215 220ggc atc ctc gcc ctc gcc cgg gag cag gtg gag gag ggg gcc cac gcc720Gly Ile Leu Ala Leu Ala Arg Glu Gln Val Glu Glu Gly Ala His Ala225 230 235 240ctg gac ctc tcc gtg gcc tgg acg ggg cgg gac gag ctt gag gac ctc768Leu Asp Leu Ser Val Ala Trp Thr Gly Arg Asp Glu Leu Glu Asp Leu245 250 255cgg tgg ctc ctt ccc cat ctc gcc acc gcc ctt acc gtc ccc gtc atg816Arg Trp Leu Leu Pro His Leu Ala Thr Ala Leu Thr Val Pro Val Met260 265 270gtg gac tcc acc tcc cct gag gcc atg gag ctc gcc ctc aaa tac ctc864Val Asp Ser Thr Ser Pro Glu Ala Met Glu Leu Ala Leu Lys Tyr Leu275 280 285ccg ggc cgg gtc ctc ctg aac tcc gcc aac ctc gag gat ggc tta gag912Pro Gly Arg Val Leu Leu Asn Ser Ala Asn Leu Glu Asp Gly Leu Glu290 295 300cgc ttt gac cgg gtg gcc tcc ctg gcc aag gcc cac ggg gcg gcc ctc960Arg Phe Asp Arg Val Ala Ser Leu Ala Lys Ala His Gly Ala Ala Leu305 310 315 320gtg gtc ctc gcc att gac gag aag ggg atg gcc aag acc cgg gag gag1008Val Val Leu Ala Ile Asp Glu Lys Gly Met Ala Lys Thr Arg Glu Glu325 330 335aag gtg cgg gtg gcc ctg agg atg tac gag cgc ctc acg gag cac cac1056Lys Val Arg Val Ala Leu Arg Met Tyr Glu Arg Leu Thr Glu His His340 345 350ggc ctc cgc ccc gag gac ctc ctc ttt gac ctc ctt acc ttc ccc atc1104Gly Leu Arg Pro Glu Asp Leu Leu Phe Asp Leu Leu Thr Phe Pro Ile355 360 365acc caa ggg gac gag gag agc cgc cct ctg gcc aag gag acc ctc ctc1152Thr Gln Gly Asp Glu Glu Ser Arg Pro Leu Ala Lys Glu Thr Leu Leu370 375 380gcc ata gag gag cta cgg gag agg ctt ccc ggg gtg ggc ttc gtc ctt1200Ala Ile Glu Glu Leu Arg Glu Arg Leu Pro Gly Val Gly Phe Val Leu385 390 395 400
cgg gtc tcc aac gtc tcc ttc ggg ctc aag ccc cgg gcg agg cgc gtc1248Arg Val Ser Asn Val Ser Phe Gly Leu Lys Pro Arg Ala Arg Arg Val405 410 415ctg aac tcc gtc ttc ctg gac gag gcg agg aaa cgg ggc ctc acc gcg1296Leu Asn Ser Val Phe Leu Asp Glu Ala Arg Lys Arg Gly Leu Thr Ala420 425 430gcc atc gtg gac gcg ggg aag atc ctc ccc ata agc cag atc ccc gag1344Ala Ile Val Asp Ala Gly Lys Ile Leu Pro Ile Ser Gln Ile Pro Glu435 440 445gag gcc tac gcc ctc gcc tta gac ctc atc tac gac cgc cgc aag gag1392Glu Ala Tyr Ala Leu Ala Leu Asp Leu Ile Tyr Asp Arg Arg Lys Glu450 455 460ggc ttt gac ccc ctc ctc gcc ttc atg gcc tac ttt gag gcc cac aag1440Gly Phe Asp Pro Leu Leu Ala Phe Met Ala Tyr Phe Glu Ala His Lys465 470 475 480gag gac ccg ggg aag agg gag gac gcc ttc ctg gcc ctt ccc ctt ctg1488Glu Asp Pro Gly Lys Arg Glu Asp Ala Phe Leu Ala Leu Pro Leu Leu485 490 495gag agg ctc aag cgc cgc gtg gtg gag ggg agg aag cag ggc ctc gag1536Glu Arg Leu Lys Arg Arg Val Val Glu Gly Arg Lys Gln Gly Leu Glu500 505 510gcc gac ctg gag gag gcc ctg aag gcg ggg cac aag ccc ttg gac ctc1584Ala Asp Leu Glu Glu Ala Leu Lys Ala Gly His Lys Pro Leu Asp Leu515 520 525atc aac ggc ccc ctc ctc gcg ggg atg aag gag gtg ggg gac ctc ttc1632Ile Asn Gly Pro Leu Leu Ala Gly Met Lys Glu Val Gly Asp Leu Phe530 535 540ggg gcg ggg aag atg cag ctc ccc ttc gtc ctc cag gcc gcc gag gtg1680Gly Ala Gly Lys Met Gln Leu Pro Phe Val Leu Gln Ala Ala Glu Val545 550 555 560atg aag cgg gcg gtg gcc tac ctc gag ccc cac atg gag aag aag ggg1728Met Lys Arg Ala Val Ala Tyr Leu Glu Pro His Met Glu Lys Lys Gly565 570 575gag ggc aag ggt acc ctg gtc ctc gcc acc gtc aag ggg gac gtg cac1776Glu Gly Lys Gly Thr Leu Val Leu Ala Thr Val Lys Gly Asp Val His580 585 590gac atc ggc aag aac ctg gtg gac atc atc ctc agc aac aac ggc tac1824Asp Ile Gly Lys Asn Leu Val Asp Ile Ile Leu Ser Asn Asn Gly Tyr595 600 605cgg gtg gtg aac ctg ggg atc aag gtg ccc att gag gag atc ctg aag1872Arg Val Val Asn Leu Gly Ile Lys Val Pro Ile Glu Glu Ile Leu Lys610 615 620gcc gtg gag gcg cac aag ccc cac gcc gtg ggc atg tcg ggc ctc ctg1920Ala Val Glu Ala His Lys Pro His Ala Val Gly Met Ser Gly Leu Lau625 630 635 640gtg aag agc acc ctg gtg atg aag gag aac ctg gag tac atg cgg gat1968Val Lys Ser Thr Leu Val Met Lys Glu Asn Leu Glu Tyr Met Arg Asp
645 650 655agg ggc tac acc ctc ccc gtg atc ctg ggc ggg gcc gcc ctc acc cgg2016Arg Gly Tyr Thr Leu Pro Val Ile Leu Gly Gly Ala Ala Leu Thr Arg660 665 670agc tac gtg gag gag ctt aag gcc atc tac ccc aac gtc tac tac gcc2064Ser Tyr Val Glu Glu Leu Lys Ala Ile Tyr Pro Asn Val Tyr Tyr Ala675 680 685gag gac gcc ttt gag ggc tta agg ctc atg gag gag ctc acg ggc cac2112Glu Asp Ala Phe Glu Gly Leu Arg Leu Met Glu Glu Leu Thr Gly His690 695 700gcc cct ccc gag ctc acc cgg aag gcc cca gct agg ccc aag cgg gag2160Ala Pro Pro Glu Leu Thr Arg Lys Ala Pro Ala Arg Pro Lys Arg Glu705 710 715 720gcc ccc aag gtg gcg ccc cgc gct cgg ccc gtg ggg gag gcc ccc gcc2208Ala Pro Lys Val Ala Pro Arg Ala Arg Pro Val Gly Glu Ala Pro Ala725 730 735gtc ccc cgg ccc ccc ttc ttc ggc gtg cgg gtg gag gaa ggc ttg gac2256Val Pro Arg Pro Pro Phe Phe Gly Val Arg Val Glu Glu Gly Leu Asp740 745 750ctc gcc acc atc gcc cac tac gtc aac aag ctc gcc ctc tac cgg ggc2304Leu Ala Thr Ile Ala His Tyr Val Asn Lys Leu Ala Leu Tyr Arg Gly755 760 765cag tgg ggc tac agc cgc aag ggc ttt ccc ggg agg cgt ggc agg ccc2352Gln Trp Gly Tyr Ser Arg Lys Gly Phe Pro Gly Arg Arg Gly Arg Pro770 775 780tgg tgg agc ggg agg cgg agc ctg tct tcc aga ggc tcc tca agg agg2400Trp Trp Ser Gly Arg Arg Ser Leu Ser Ser Arg Gly Ser Ser Arg Arg785 790 795 800cga tgg cgg aag ggt ggc ttg aac cca agg tcc tct acg gct tct tcc2448Arg Trp Arg Lys Gly Gly Leu Asn Pro Arg Ser Ser Thr Ala Ser Ser805 810 815ccg tgg ccc ggg agg gga gga gct tct cgt ctt ctc ccc aga gac ggg2496Pro Trp Pro Gly Arg Gly Gly Ala Ser Arg Leu Leu Pro Arg Asp Gly820 825 830gga ggt gct gga gcg ctt ccg ctt ccc ccg gca aag ggg cgg ggg cct2544Gly Gly Ala Gly Ala Leu Pro Leu Pro Pro Ala Lys Gly Arg Gly Pro835 840 845gag cct cgt gga cta ctt ccg ccc ccg gtt tgc cgc gcc ttt ggg gga2592Glu Pro Arg Gly Leu Leu Pro Pro Pro Val Cys Arg Ala Phe Gly Gly850 855 860cga ggc gga ctg gat gcc caa gga ggc ctt ccg ggc ggg ggc cgg gac2640Arg Gly Gly Leu Asp Ala Gln Gly Gly Leu Pro Gly Gly Gly Arg Asp865 870 875 880gtc ctc ggg gtc cag ctc gtc acc atg ggg gag gcc cct tcc cga aag2688Val Leu Gly Val Gln Leu Val Thr Met Gly Glu Ala Pro Ser Arg Lys885 890 895
gcc cag gcc ctc ttt gcg tcc ggg gcc tac cag gac tac ctc ttc gtc2736Ala Gln Ala Leu Phe Ala Ser Gly Ala Tyr Gln Asp Tyr Leu Phe Val900 905 910cac ggc ttc agc gtg gag atg acc gag gcc ttg gcg gag tac tgg cac2784His Gly Phe Ser Val Glu Met Thr Glu Ala Leu Ala Glu Tyr Trp His915 920 925aag agg atg cgg cag atg tgg ggc atc gcc cac aag gac gcc acc gag2832Lys Arg Met Arg Gln Met Trp Gly Ile Ala His Lys Asp Ala Thr Glu930 935 940atc cag aag ctc ttc cag cag ggc tac cag ggg gcc cgc tac tcc ttc2880Ile Gln Lys Leu Phe Gln Gln Gly Tyr Gln Gly Ala Arg Tyr Ser Phe945 950 955 960ggc tac ccc gcc tgc ccg gac ctc gcc gac cag gcc aag ctg gac cgg2928Gly Tyr Pro Ala Cys Pro Asp Leu Ala Asp Gln Ala Lys Leu Asp Arg965 970 975ctc atg ggc ttc cac cgg gtg ggg gtg cac ctc acg gag aac ttc cag2976Leu Met Gly Phe His Arg Val Gly Val His Leu Thr Glu Asn Phe Gln980 985 990ctg gag ccg gag cac gcc acc agc gcc ctc gtg gtc cac cac ccc gag3024Leu Glu Pro Glu His Ala Thr Ser Ala Leu Val Val His His Pro Glu99510001005gcc cgc tac ttc agc gtg gac tag3048Ala Arg Tyr Phe Ser Val Asp10101015210102111015212PRT213嗜熱棲熱菌40010Met Arg Ala Tyr Lys Glu Ala Ala Arg Gly Leu Leu Lys Gly Gly Val1 5 10 15Asp Leu Ile Leu Leu Glu Thr Ala Gln Asp Ile Leu Gln Val Arg Cys20 25 30Ala Val Leu Ala Val Arg Glu Ala Met Ala Glu Val Gly Arg Glu Val35 40 45Pro Leu Gln Val Gln Val Thr Phe Glu Ala Thr Gly Thr Met Leu Val50 55 60Gly Thr Asp Glu Gln Ala Ala Leu Ala Ala Leu Glu Ser Leu Pro Val65 70 75 80Asp Val Val Gly Met Asn Cys Ala Thr Gly Pro Asp Leu Met Asp Ser85 90 95Lys Val Arg Tyr Phe Ala Glu His Ser Thr Arg Phe Val Ser Cys Leu100 105 110Pro Asn Ala Gly Leu Pr0 Arg Asn Glu Gly Gly Arg Val Val Tyr Asp115 120 125
Leu Thr Pro Glu Glu Leu Ala Lys Trp His Leu Lys Phe Val Ala Glu130 135 140Tyr Gly Val Asn Ala Val Gly Gly Cys Cys Gly Thr Gly Pro Glu His145 150 155 160Ile Arg Lys Val Ala Glu Ala Val Lys Gly Leu Ala Pro Lys Pro Arg165 170 175Pro Glu Ser Phe Pro Pro Gln Val Ala Ser Leu Tyr Gln Ala Val Ser180 185 190Leu Lys Gln Glu Ala Ser Leu Phe Leu Val Gly Glu Arg Leu Asn Ala195 200 205Thr Gly Ser Lys Arg Phe Arg Glu Met Leu Phe Ala Arg Asp Leu Glu210 215 220Gly Ile Leu Ala Leu Ala Arg Glu Gln Val Glu Glu Gly Ala His Ala225 230 235 240Leu Asp Leu Ser Val Ala Trp Thr Gly Arg Asp Glu Leu Glu Asp Leu245 250 255Arg Trp Leu Leu Pro His Leu Ala Thr Ala Leu Thr Val Pro Val Met260 265 270Val Asp Ser Thr Ser Pro Glu Ala Met Glu Leu Ala Leu Lys Tyr Leu275 280 285Pro Gly Arg Val Leu Leu Asn Ser Ala Asn Leu Glu Asp Gly Leu Glu290 295 300Arg Phe Asp Arg Val Ala Ser Leu Ala Lys Ala His Gly Ala Ala Leu305 310 315 320Val Val Leu Ala Ile Asp Glu Lys Gly Met Ala Lys Thr Arg Glu Glu325 330 335Lys Val Arg Val Ala Leu Arg Met Tyr Glu Arg Leu Thr Glu His His340 345 350Gly Leu Arg Pro Glu Asp Leu Leu Phe Asp Leu Leu Thr Phe Pro Ile355 360 365Thr Gln Gly Asp Glu Glu Ser Arg Pro Leu Ala Lys Glu Thr Leu Leu370 375 380Ala Ile Glu Glu Leu Arg Glu Arg Leu Pro Gly Val Gly Phe Val Leu385 390 395 400Arg Val Ser Asn Val Ser Phe Gly Leu Lys Pro Arg Ala Arg Arg Val405 410 415Leu Asn Ser Val Phe Leu Asp Glu Ala Arg Lys Arg Gly Leu Thr Ala420 425 430Ala Ile Val Asp Ala Gly Lys Ile Leu Pro Ile Ser Gln Ile Pro Glu435 440 445Glu Ala Tyr Ala Leu Ala Leu Asp Leu Ile Tyr Asp Arg Arg Lys Glu
450 455 460Gly Phe Asp Pro Leu Leu Ala Phe Met Ala Tyr Phe Glu Ala His Lys465 470 475 480Glu Asp Pro Gly Lys Arg Glu Asp Ala Phe Leu Ala Leu Pro Leu Leu485 490 495Glu Arg Leu Lys Arg Arg Val Val Glu Gly Arg Lys Gln Gly Leu Glu500 505 510Ala Asp Leu Glu Glu Ala Leu Lys Ala Gly His Lys Pro Leu Asp Leu515 520 525Ile Asn Gly Pro Leu Leu Ala Gly Met Lys Glu Val Gly Asp Leu Phe530 535 540Gly Ala Gly Lys Met Gln Leu Pro Phe Val Leu Gln Ala Ala Glu Val545 550 555 560Met Lys Arg Ala Val Ala Tyr Leu Glu Pro His Met Glu Lys Lys Gly565 570 575Glu Gly Lys Gly Thr Leu Val Leu Ala Thr Val Lys Gly Asp Val His580 585 590Asp Ile Gly Lys Asn Leu Val Asp Ile Ile Leu Ser Asn Asn Gly Tyr595 600 605Arg Val Val Asn Leu Gly Ile Lys Val Pro Ile Glu Glu Ile Leu Lys610 615 620Ala Val Glu Ala His Lys Pro His Ala Val Gly Met Ser Gly Leu Leu625 630 635 640Val Lys Ser Thr Leu Val Met Lys Glu Asn Leu Glu Tyr Met Arg Asp645 650 655Arg Gly Tyr Thr Leu Pro Val Ile Leu Gly Gly Ala Ala Leu Thr Arg660 665 670Ser Tyr Val Glu Glu Leu Lys Ala Ile Tyr Pro Asn Val Tyr Tyr Ala675 680 685Glu Asp Ala Phe Glu Gly Leu Arg Leu Met Glu Glu Leu Thr Gly His690 695 700Ala Pro Pro Glu Leu Thr Arg Lys Ala Pro Ala Arg Pro Lys Arg Glu705 710 715 720Ala Pro Lys Val Ala Pro Arg Ala Arg Pro Val Gly Glu Ala Pro Ala725 730 735Val Pro Arg Pro Pro Phe Phe Gly Val Arg Val Glu Glu Gly Leu Asp740 745 750Leu Ala Thr Ile Ala His Tyr Val Asn Lys Leu Ala Leu Tyr Arg Gly755 760 765Gln Trp Gly Tyr Ser Arg Lys Gly Phe Pro Gly Arg Arg Gly Arg Pro770 775 780
Trp Trp Ser Gly Arg Arg Ser Leu Ser Ser Arg Gly Ser Ser Arg Arg785 790 795 800Arg Trp Arg Lys Gly Gly Leu Asn Pro Arg Ser Ser Thr Ala Ser Ser805 810 815Pro Trp Pro Gly Arg Gly Gly Ala Ser Arg Leu Leu Pro Arg Asp Gly820 825 830Gly Gly Ala Gly Ala Leu Pro Leu Pro Pro Ala Lys Gly Arg Gly Pro835 840 845Glu Pro Arg Gly Leu Leu Pro Pro Pro Val Cys Arg Ala Phe Gly Gly850 855 860Arg Gly Gly Leu Asp Ala Gln Gly Gly Leu Pro Gly Gly Gly Arg Asp865 870 875 880Val Leu Gly Val Gln Leu Val Thr Met Gly Glu Ala Pro Ser Arg Lys885 890 895Ala Gln Ala Leu Phe Ala Ser Gly Ala Tyr Gln Asp Tyr Leu Phe Val900 905 910His Gly Phe Ser Val Glu Met Thr Glu Ala Leu Ala Glu Tyr Trp His915 920 925Lys Arg Met Arg Gln Met Trp Gly Ile Ala His Lys Asp Ala Thr Glu930 935 940Ile Gln Lys Leu Phe Gln Gln Gly Tyr Gln Gly Ala Arg Tyr Ser Phe945 950 955 960Gly Tyr Pro Ala Cys Pro Asp Leu Ala Asp Gln Ala Lys Leu Asp Arg965 970 975Leu Met Gly Phe His Arg Val Gly Val His Leu Thr Glu Asn Phe Gln980 985 990Leu Glu Pro Glu His Ala Thr Ser Ala Leu Val Val His His Pro Glu99510001005Ala Arg Tyr Phe Ser Val Asp10101015210112113441212DNA213Bacillus halodurans220
221CDS222(1)..(3438)223RHD0 555040011atg act aaa tcg ttg ttt gaa caa cag tta gag cga aaa atc gtc atc48Met Thr Lys Ser Leu Phe Glu Gln Gln Leu Glu Arg Lys Ile Val Ile1 5 10 15ctt gat ggg gcg atg ggg acc atg tta caa gcc gcg aat cta acc gct96
Leu Asp Gly Ala Met Gly Thr Met Leu Gln Ala Ala Asn Leu Thr Ala20 25 30gat gac ttt ggc gga gaa gag tat gaa ggg tgt aat gaa tat tta aat144Asp Asp Phe Gly Gly Glu Glu Tyr Glu Gly Cys Asn Glu Tyr Leu Asn35 40 45gag acg gcc ccc cat gtc gtt gag gac att cat cgc gca tac tta gag192Glu Thr Ala Pro His Val Val Glu Asp Ile His Arg Ala Tyr Leu Glu50 55 60gca gga gca gac gtc att gcg acg aac acg ttc ggg gca aca gat atc240Ala Gly Ala Asp Val Ile Ala Thr Asn Thr Phe Gly Ala Thr Asp Ile65 70 75 80gtt ctt gac gat tat gat ctc gga tac aaa gca gag gag tta aac ata288Val Leu Asp Asp Tyr Asp Leu Gly Tyr Lys Ala Glu Glu Leu Asn Ile85 90 95tgc gcg gtg aaa atc gct aaa cgt gta gct gaa gag ttt tcc act cca336Cys Ala Val Lys Ile Ala Lys Arg Val Ala Glu Glu Phe Ser Thr Pro100 105 110gat tgg cct cga ttc gtt gca ggg gcg atg ggg ccg acg acg aaa tct384Asp Trp Pro Arg Phe Val Ala Gly Ala Met Gly Pro Thr Thr Lys Ser115 120 125ctt tcc gtc aca ggg ggc gcg aca ttc gaa caa ctt atc gag tct tat432Leu Ser Val Thr Gly Gly Ala Thr Phe Glu Gln Leu Ile Glu Ser Tyr130 135 140cgc cag caa gct aca ggt cta att aaa ggc ggg gcg gat att tta tta480Arg Gln Gln Ala Thr Gly Leu Ile Lys Gly Gly Ala Asp Ile Leu Leu145 150 155 160ctc gaa acg agc cag gat atg cga aac gtg aag gcg gct tat tta gga528Leu Glu Thr Ser Gln Asp Met Arg Asn Val Lys Ala Ala Tyr Leu Gly165 170 175ctg agc caa gcg caa aaa gag cta gag gtg aaa ctg cct ctc att att576Leu Ser Gln Ala Gln Lys Glu Leu Glu Val Lys Leu Pro Leu Ile Ile180 185 190tct gga acg att gaa ccg atg gga aca acg ctc gcc ggc caa aac atc624Ser Gly Thr Ile Glu Pro Met Gly Thr Thr Leu Ala Gly Gln Asn Ile195 200 205gag gcg ttc tat ttg tca tta gag cat atg aat ccc gtc gtt gtc ggt672Glu Ala Phe Tyr Leu Ser Leu Glu His Met Asn Pro Val Val Val Gly210 215 220ctc aac tgc gct aca gga cca gaa ttt atg cgc gat cac ctc cgt tct720Leu Asn Cys Ala Thr Gly Pro Glu Phe Met Arg Asp His Leu Arg Ser225 230 235 240ctt tca gac ctt gcg acc tgc tct gta agc tgt tat ccg aat gct ggg768Leu Ser Asp Leu Ala Thr Cys Ser Val Ser Cys Tyr Pro Asn Ala Gly245 250 255tta cct gat gaa gag ggg aac tat cac gaa tcc cca gaa tca tta gca816Leu Pro Asp Glu Glu Gly Asn Tyr His Glu Ser Pro Glu Ser Leu Ala260 265 270
gcc aag ctc gca ggt ttt gcg gaa aag ggc tgg ttg aat atg gtt ggt864Ala Lys Leu Ala Gly Phe Ala Glu Lys Gly Trp Leu Asn Met Val Gly275 280 285ggc tgt tgc ggg acg act cca gac cac att cgt gct ctt ttg gac gtt912Gly Cys Cys Gly Thr Thr Pro Asp His Ile Arg Ala Leu Leu Asp Val290 295 300atg aag caa ttt gag ccg aga caa cca aaa ggg gat cac ccc cac tcg960Met Lys Gln Phe Glu Pro Arg Gln Pro Lys Gly Asp His Pro His Ser305 310 315 320gtc tca gga att gag cca ctg tta tac gat gac agc atg cgt cca cta1008Val Ser Gly Ile Glu Pro Leu Leu Tyr Asp Asp Ser Met Arg Pro Leu325 330 335ttt gtc ggt gaa cgg aca aac gtc atc ggg tct cgt aaa ttt aaa cgg1056Phe Val Gly Glu Arg Thr Asn Val Ile Gly Ser Arg Lys Phe Lys Arg340 345 350ttg atc gaa gaa gaa aaa tat gaa gaa gcc tca gaa att gca aga tcc1104Leu Ile Glu Glu Glu Lys Tyr Glu Glu Ala Ser Glu Ile Ala Arg Ser355 360 365caa gtg aag aaa ggg gcc cac gtt atc gat gtt tgt ctt gct gat ccg1152Gln Val Lys Lys Gly Ala His Val Ile Asp Val Cys Leu Ala Asp Pro370 375 380gat cgc gat gaa atg gag gac atg gag gaa ttt tta aaa ttc gtg atc1200Asp Arg Asp Glu Met Glu Asp Met Glu Glu Phe Leu Lys Phe Val Ile385 390 395 400aac aaa gtg aag gta ccg ctc atg att gac tcc acc gac gaa aag gta1248Asn Lys Val Lys Val Pro Leu Met Ile Asp Ser Thr Asp Glu Lys Val405 410 415att gaa caa gcg ctt acg tat tca caa ggg aaa gcg atc att aat tcg1296Ile Glu Gln Ala Leu Thr Tyr Ser Gln Gly Lys Ala Ile Ile Asn Ser420 425 430atc aac tta gag gac ggc gaa gaa cgt ttt gaa aaa gtg gtc ccg ctc1344Ile Asn Leu Glu Asp Gly Glu Glu Arg Phe Glu Lys Val Val Pro Leu435 440 445gtc cat aag tat gga gcc gcg gtt gtc gtt ggt acg atc gac gaa gaa1392Val His Lys Tyr Gly Ala Ala Val Val Val Gly Thr Ile Asp Glu Glu450 455 460gga atg gcg att acg gca gaa aaa aaa tta gcg gtt gcg aaa cga tca1440Gly Met Ala Ile Thr Ala Glu Lys Lys Leu Ala Val Ala Lys Arg Ser465 470 475 480tac gac ctg ctc gta aac aaa tac aac att cgt ccg agc gat att att1488Tyr Asp Leu Leu Val Asn Lys Tyr Asn Ile Arg Pro Ser Asp Ile Ile485 490 495ttt gat ccg ctc gtg ttc cca gta gga aca ggc gat gag caa tac att1536Phe Asp Pro Leu Val Phe Pro Val Gly Thr Gly Asp Glu Gln Tyr Ile500 505 510ggc tcg gcg aat gag acg gtg gaa gga att agg agg atc aaa gaa gag1584
Gly Ser Ala Asn Glu Thr Val Glu Gly Ile Arg Arg Ile Lys Glu Glu515 520 525ctc cct gaa tgt tta acg att ctt gga gtt agt aac gtg tcg ttc ggt1632Leu Pro Glu Cys Leu Thr Ile Leu Gly Val Ser Asn Val Ser Phe Gly530 535 540ctt ccg cct gtc gga aga gag gtg ctg aac gcg gcg tac tta tac cat1680Leu Pro Pro Val Gly Arg Glu Val Leu Asn Ala Ala Tyr Leu Tyr His545 550 555 560tgt aca caa gct ggc ctt gat tac gct atc gtg aac aca gaa aag ctt1728Cys Thr Gln Ala Gly Leu Asp Tyr Ala Ile Val Asn Thr Glu Lys Leu565 570 575gag cgt tat gcc tcg att tct gat gaa gaa aaa gaa ttg tca agg aag1776Glu Arg Tyr Ala Ser Ile Ser Asp Glu Glu Lys Glu Leu Ser Arg Lys580 585 590ctc tta ttt gaa acg aca gat gaa acg ctc gct gag ttc acc gcc ttt1824Leu Leu Phe Glu Thr Thr Asp Glu Thr Leu Ala Glu Phe Thr Ala Phe595 600 605tat cga ggg aaa aaa gca gag aaa aaa gtg gag act tct aat tta act1872Tyr Arg Gly Lys Lys Ala Glu Lys Lys Val Glu Thr Ser Asn Leu Thr610 615 620ttg gaa gag cgg ttg gca aac tac att gtt gaa ggg tca aag gac gga1920Leu Glu Glu Arg Leu Ala Asn Tyr Ile Val Glu Gly Ser Lys Asp Gly625 630 635 640ctg aca gaa gat tta gat aaa gcg ctc gcg aaa tat gat gat ccg ctt1968Leu Thr Glu Asp Leu Asp Lys Ala Leu Ala Lys Tyr Asp Asp Pro Leu645 650 655gat atc att aac ggc ccg ctc atg aat gga atg gac gaa gtc ggt cgt2016Asp Ile Ile Asn Gly Pro Leu Met Asn Gly Met Asp Glu Val Gly Arg660 665 670ttg ttt aac aat aac gag ctt att gtc gct gaa gta ttg caa agc gct2064Leu Phe Asn Asn Asn Glu Leu Ile Val Ala Glu Val Leu Gln Ser Ala675 680 685gag gtt atg aag gct tcc gtc gcc cac ctt gag cca cat atg gaa aag2112Glu Val Met Lys Ala Ser Val Ala His Leu Glu Pro His Met Glu Lys690 695 700aaa gca gac gat cat gga aaa gga aaa atc att ctt gcc acg gtc aag2160Lys Ala Asp Asp His Gly Lys Gly Lys Ile Ile Leu Ala Thr Val Lys705 710 715 720ggc gat gtt cac gat atc ggg aaa aat cta gtg gaa att att ttg agc2208Gly Asp Val His Asp Ile Gly Lys Asn Leu Val Glu Ile Ile Leu Ser725 730 735aat aat ggt ttc cgc atc gtg aac cta gga att aaa gtt acc tct aat2256Asn Asn Gly Phe Arg Ile Val Asn Leu Gly Ile Lys Val Thr Ser Asn740 745 750gag ctg att gaa gcg gtg gcg aga gaa aat cca gat gcg att ggc ttg2304Glu Leu Ile Glu Ala Val Ala Arg Glu Asn Pro Asp Ala Ile Gly Leu755 760 765
tca ggg ttg ctc gtc aaa tca gca caa caa atg gta ctt acc gcc caa2352Ser Gly Leu Leu Val Lys Ser Ala Gln Gln Met Val Leu Thr Ala Gln770 775 780gat ttg aag caa caa caa att tcc att ccg att tta gtc gga ggc gca2400Asp Leu Lys Gln Gln Gln Ile Ser Ile Pro Ile Leu Val Gly Gly Ala785 790 795 800gcc ctt acg cgg aaa ttt acg aat aca aaa atc gct cca gag tat gat2448Ala Leu Thr Arg Lys Phe Thr Asn Thr Lys Ile Ala Pro Glu Tyr Asp805 810 815ggt ctc gtc gtc tac gcg aag gat gcg atg aac ggg tta gag ctt gcc2496Gly Leu Val Val Tyr Ala Lys Asp Ala Met Asn Gly Leu Glu Leu Ala820 825 830aat aaa tta atg aaa cct gat gaa cga gaa aag cta gcg gtc tcc ctc2544Asn Lys Leu Met Lys Pro Asp Glu Arg Glu Lys Leu Ala Val Ser Leu835 840 845cat gaa gcg aag gag cag gcg aac tcg agg aca caa atg gga gga ggc2592His Glu Ala Lys Glu Gln Ala Asn Ser Arg Thr Gln Met Gly Gly Gly850 855 860gga act gca gtt gcg gta aag ccg act cga tcc cat gtt tcg aca acg2640Gly Thr Ala Val Ala Val Lys Pro Thr Arg Ser His Val Ser Thr Thr865 870 875 880gtg cct gta gcg gtc cca cct gat gtg aag ccg cac att ttg cgc cac2688Val Pro Val Ala Val Pro Pro Asp Val Lys Pro His Ile Leu Arg His885 890 895cat agc att gcc cat tta gag ccg tat att aac atg cag atg ttg tta2736His Ser Ile Ala His Leu Glu Pro Tyr Ile Asn Met Gln Met Leu Leu900 905 910gga cgt cac tta ggc tta caa ggg aaa gtg agc cgc ctg ctt gca gaa2784Gly Arg His Leu Gly Leu Gln Gly Lys Val Ser Arg Leu Leu Ala Glu915 920 925aaa gac gag aag gct ctt gaa tta aaa gaa aaa gtt gat gcg cta ctc2832Lys Asp Glu Lys Ala Leu Glu Leu Lys Glu Lys Val Asp Ala Leu Leu930 935 940acc agg gtg aaa gag gag cag ctc atg gaa gcc cat ggc atg tat cag2880Thr Arg Val Lys Glu Glu Gln Leu Met Glu Ala His Gly Met Tyr Gln945 950 955 960ttt ttt cct gcc cag tcg gat ggg gac gat att gtc att tat gat caa2928Phe Phe Pro Ala Gln Ser Asp Gly Asp Asp Ile Val Ile Tyr Asp Gln965 970 975acg gga aca aat gaa atc gag cga ttc cat ttt ccg cgt cag aat aag2976Thr Gly Thr Asn Glu Ile Glu Arg Phe His Phe Pro Arg Gln Asn Lys980 985 990gag cct tat ctg tgt ctt gcc gat ttc ctt cgc cca gtt tcc agt ggg3024Glu Pro Tyr Leu Cys Leu Ala Asp Phe Leu Arg Pro Val Ser Ser Gly995 10001005gaa atg gac tat gtt ggc ttc ctt gct gta acc gca gga aaa ggc att3072
Glu Met Asp Tyr Val Gly Phe Leu Ala Val Thr Ala Gly Lys Gly Ile101010151020cgt gaa tta ggg gag cag gcg aaa gag gct gga gac tat tta ttc agt3120Arg Glu Leu Gly Glu Gln Ala Lys Glu Ala Gly Asp Tyr Leu Phe Ser1025103010351040cac tta atc caa gca aca gcc tta gag atg gcg gaa ggg ttt gcc gag3168His Leu Ile Gln Ala Thr Ala Leu Glu Met Ala Glu Gly Phe Ala Glu104510501055cgt gtc cat cag ctc atg cgt gat aag tgg ggg ttt cct gat tcg gct3216Arg Val His Gln Leu Met Arg Asp Lys Trp Gly Phe Pro Asp Ser Ala106010651070gac ttt aca atg gaa gag cgt ttc gct gca aaa tac cgt ggc atc cgt3264Asp Phe Thr Met Glu Glu Arg Phe Ala Ala Lys Tyr Arg Gly Ile Arg107510801085gta tcg ttt ggc tac cct gca tgc cct gac ttg gat gac caa gca aag3312Val Ser Phe Gly Tyr Pro Ala Cys Pro Asp Leu Asp Asp Gln Ala Lys109010951100ttg ttt aag ctg ttg aag cct gga aag atc gga att gag ttg acg gaa3360Leu Phe Lys Leu Leu Lys Pro Gly Lys Ile Gly Ile Glu Leu Thr Glu1105111011151120ggg ttt atg atg gag cca gaa gcc tcc gtc acc gcg atg gtg ttt gcc3408Gly Phe Met Met Glu Pro Glu Ala Ser Val Thr Ala Met Val Phe Ala112511301135cat cct gag gct cgc tat ttt aat gtt tta tag3441His Pro Glu Ala Arg Tyr Phe Asn Val Leu11401145210122111146212PRT213Bacillus halodurans40012Met Thr Lys Ser Leu Phe Glu Gln Gln Leu Glu Arg Lys Ile Val Ile1 5 10 15Leu Asp Gly Ala Met Gly Thr Met Leu Gln Ala Ala Asn Leu Thr Ala20 25 30Asp Asp Phe Gly Gly Glu Glu Tyr Glu Gly Cys Asn Glu Tyr Leu Asn35 40 45Glu Thr Ala Pro His Val Val Glu Asp Ile His Arg Ala Tyr Leu Glu50 55 60Ala Gly Ala Asp Val Ile Ala Thr Asn Thr Phe Gly Ala Thr Asp Ile65 70 75 80Val Leu Asp Asp Tyr Asp Leu Gly Tyr Lys Ala Glu Glu Leu Asn Ile85 90 95Cys Ala Val Lys Ile Ala Lys Arg Val Ala Glu Glu Phe Ser Thr Pro100 105 110
Asp Trp Pro Arg Phe Val Ala Gly Ala Met Gly Pro Thr Thr Lys Ser115 120 125Leu Ser Val Thr Gly Gly Ala Thr Phe Glu Gln Leu Ile Glu Ser Tyr130 135 140Arg Gln Gln Ala Thr Gly Leu Ile Lys Gly Gly Ala Asp Ile Leu Leu145 150 155 160Leu Glu Thr Ser Gln Asp Met Arg Asn Val Lys Ala Ala Tyr Leu Gly165 170 175Leu Ser Gln Ala Gln Lys Glu Leu Glu Val Lys Leu Pro Leu Ile Ile180 185 190Ser Gly Thr Ile Glu Pro Met Gly Thr Thr Leu Ala Gly Gln Asn Ile195 200 205Glu Ala Phe Tyr Leu Ser Leu Glu His Met Asn Pro Val Val Val Gly210 215 220Leu Asn Cys Ala Thr Gly Pro Glu Phe Met Arg Asp His Leu Arg Ser225 230 235 240Leu Ser Asp Leu Ala Thr Cys Ser Val Ser Cys Tyr Pro Asn Ala Gly245 250 255Leu Pro Asp Glu Glu Gly Asn Tyr His Glu Ser Pro Glu Ser Leu Ala260 265 270Ala Lys Leu Ala Gly Phe Ala Glu Lys Gly Trp Leu Asn Met Val Gly275 280 285Gly Cys Cys Gly Thr Thr Pro Asp His Ile Arg Ala Leu Leu Asp Val290 295 300Met Lys Gln Phe Glu Pro Arg Gln Pro Lys Gly Asp His Pro His Ser305 310 315 320Val Ser Gly Ile Glu Pro Leu Leu Tyr Asp Asp Ser Met Arg Pro Leu325 330 335Phe Val Gly Glu Arg Thr Asn Val Ile Gly Ser Arg Lys Phe Lys Arg340 345 350Leu Ile Glu Glu Glu Lys Tyr Glu Glu Ala Ser Glu Ile Ala Arg Ser355 360 365Gln Val Lys Lys Gly Ala His Val Ile Asp Val Cys Leu Ala Asp Pro370 375 380Asp Arg Asp Glu Met Glu Asp Met Glu Glu Phe Leu Lys Phe Val Ile385 390 395 400Asn Lys Val Lys Val Pro Leu Met Ile Asp Ser Thr Asp Glu Lys Val405 410 415Ile Glu Gln Ala Leu Thr Tyr Ser Gln Gly Lys Ala Ile Ile Asn Ser420 425 430Ile Asn Leu Glu Asp Gly Glu Glu Arg Phe Glu Lys Val Val Pro Leu
435 440 445Val His Lys Tyr Gly Ala Ala Val Val Val Gly Thr Ile Asp Glu Glu450 455 460Gly Met Ala Ile Thr Ala Glu Lys Lys Leu Ala Val Ala Lys Arg Ser465 470 475480Tyr Asp Leu Leu Val Asn Lys Tyr Asn Ile Arg Pro Ser Asp Ile Ile485 490 495Phe Asp Pro Leu Val Phe Pro Val Gly Thr Gly Asp Glu Gln Tyr Ile500 505 510Gly Ser Ala Asn Glu Thr Val Glu Gly Ile Arg Arg Ile Lys Glu Glu515 520 525Leu Pro Glu Cys Leu Thr Ile Leu Gly Val Ser Asn Val Ser Phe Gly530 535 540Leu Pro Pro Val Gly Arg Glu Val Leu Asn Ala Ala Tyr Leu Tyr His545 550 555 560Cys Thr Gln Ala Gly Leu Asp Tyr Ala Ile Val Asn Thr Glu Lys Leu565 570 575Glu Arg Tyr Ala Ser Ile Ser Asp Glu Glu Lys Glu Leu Ser Arg Lys580 585 590Leu Leu Phe Glu Thr Thr Asp Glu Thr Leu Ala Glu Phe Thr Ala Phe595 600 605Tyr Arg Gly Lys Lys Ala Glu Lys Lys Val Glu Thr Ser Asn Leu Thr610 615 620Leu Glu Glu Arg Leu Ala Asn Tyr Ile Val Glu Gly Ser Lys Asp Gly625 630 635 640Lau Thr Glu Asp Leu Asp Lys Ala Leu Ala Lys Tyr Asp Asp Pro Leu645 650 655Asp Ile Ile Asn Gly Pro Leu Met Asn Gly Met Asp Glu Val Gly Arg660 665 670Leu Phe Asn Asn Asn Glu Leu Ile Val Ala Glu Val Leu Gln Ser Ala675 680 685Glu Val Met Lys Ala Ser Val Ala His Leu Glu Pro His Met Glu Lys690 695 700Lys Ala Asp Asp His Gly Lys Gly Lys Ile Ile Leu Ala Thr Val Lys705 710 715 720Gly Asp Val His Asp Ile Gly Lys Asn Leu Val Glu Ile Ile Leu Ser725 730 735Asn Asn Gly Phe Arg Ile Val Asn Leu Gly Ile Lys Val Thr Ser Asn740 745 750Glu Leu Ile Glu Ala Val Ala Arg Glu Asn Pro Asp Ala Ile Gly Leu755 760 765
Ser Gly Leu Leu Val Lys Ser Ala Gln Gln Met Val Leu Thr Ala Gln770 775 780Asp Leu Lys Gln Gln Gln Ile Ser Ile Pro Ile Leu Val Gly Gly Ala785 790 795 800Ala Leu Thr Arg Lys Phe Thr Asn Thr Lys Ile Ala Pro Glu Tyr Asp805 810 815Gly Leu Val Val Tyr Ala Lys Asp Ala Met Asn Gly Leu Glu Leu Ala820 825 830Asn Lys Leu Met Lys Pro Asp Glu Arg Glu Lys Leu Ala Val Ser Leu835 840 845His Glu Ala Lys Glu Gln Ala Asn Ser Arg Thr Gln Met Gly Gly Gly850 855 860Gly Thr Ala Val Ala Val Lys Pro Thr Arg Ser His Val Ser Thr Thr865 870 875 880Val Pro Val Ala Val Pro Pro Asp Val Lys Pro His Ile Leu Arg His885 890 895His Ser Ile Ala His Leu Glu Pro Tyr Ile Asn Met Gln Met Leu Leu900 905 910Gly Arg His Leu Gly Leu Gln Gly Lys Val Ser Arg Leu Leu Ala Glu915 920 925Lys Asp Glu Lys Ala Leu Glu Leu Lys Glu Lys Val Asp Ala Leu Leu930 935 940Thr Arg Val Lys Glu Glu Gln Leu Met Glu Ala His Gly Met Tyr Gln945 950 955 960Phe Phe Pro Ala Gln Ser Asp Gly Asp Asp Ile Val Ile Tyr Asp Gln965 970 975Thr Gly Thr Asn Glu Ile Glu Arg Phe His Phe Pro Arg Gln Asn Lys980 985 990Glu Pro Tyr Leu Cys Leu Ala Asp Phe Leu Arg Pro Val Ser Ser Gly99510001005Glu Met Asp Tyr Val Gly Phe Leu Ala Val Thr Ala Gly Lys Gly Ile101010151020Arg Glu Leu Gly Glu Gln Ala Lys Glu Ala Gly Asp Tyr Leu Phe Ser1025 103010351040His Leu Ile Gln Ala Thr Ala Leu Glu Met Ala Glu Gly Phe Ala Glu104510501055Arg Val His Gln Leu Met Arg Asp Lys Trp Gly Phe Pro Asp Ser Ala106010651070Asp Phe Thr Met Glu Glu Arg Phe Ala Ala Lys Tyr Arg Gly Ile Arg107510801085Val Ser Phe Gly Tyr Pro Ala Cys Pro Asp Leu Asp Asp Gln Ala Lys109010951100
Leu Phe Lys Leu Leu Lys Pro Gly Lys Ile Gly Ile Glu Leu Thr Glu1105 111011151120Gly Phe Met Met Glu Pro Glu Ala Ser Val Thr Ala Met Val Phe Ala112511301135His Pro Glu Ala Arg Tyr Phe Asn Val Leu11401145210132113411212DNA213嗜熱脂肪芽孢桿菌(Bacillus stearothermophilus)220
221CDS222(1)..(3408)223RBE0204440013atg gct aac gtc acc tta gaa cag caa ctg caa aga aaa att ctt gtc48Met Ala Asn Val Thr Leu Glu Gln Gln Leu Gln Arg Lys Ile Leu Val1 5 10 15atc gat ggc gcc atg ggc acg atg atc caa agc gcc aac cta tcg gcc96Ile Asp Gly Ala Met Gly Thr Met Ile Gln Ser Ala Asn Leu Ser Ala20 25 30gcc gac ttt ggc ggc gag gcg tat gaa ggg tgc aac gaa tat ttg acc144Ala Asp Phe Gly Gly Glu Ala Tyr Glu Gly Cys Asn Glu Tyr Leu Thr35 40 45ctc acc gcc ccg cat gtc atc cgc cgc att cat gaa gcg tac cta gaa192Leu Thr Ala Pro His Val Ile Arg Arg Ile His Glu Ala Tyr Leu Glu50 55 60gcc ggt gct gat atc att gaa acg aac acg ttc gga gcg aca cgc atc240Ala Gly Ala Asp Ile Ile Glu Thr Asn Thr Phe Gly Ala Thr Arg Ile65 70 75 80gtg ctt gac gaa tat ggc ctc ggt cat ttg gcg ctt gag ctg aac atc288Val Leu Asp Glu Tyr Gly Leu Gly His Leu Ala Leu Glu Leu Asn Ile85 90 95gaa gcg gcc aaa ctc gcc aaa caa acg gct gag tcg ttc tcc acc ccg336Glu Ala Ala Lys Leu Ala Lys Gln Thr Ala Glu Ser Phe Ser Thr Pro100 105 110gac tgg ccg cgc ttt gtc gcc ggt tcg atg ggg ccg acg acg aaa acg384Asp Trp Pro Arg Phe Val Ala Gly Ser Met Gly Pro Thr Thr Lys Thr115 120 125ttg tcg gtg aca ggc ggc gca acg ttt gaa gaa ctc gtc gcc gcc tac432Leu Ser Val Thr Gly Gly Ala Thr Phe Glu Glu Leu Val Ala Ala Tyr130 135 140gaa gaa caa gcg cgc gga ctg ctc tta gga ggc gtc gac ctt ctc cta480Glu Glu Gln Ala Arg Gly Leu Leu Leu Gly Gly Val Asp Leu Leu Leu145 150 155 160
ctc gag acg tgc caa gat acg ctg aat gtc aaa gcc ggt ttt ctc ggc528Leu Glu Thr Cys Gln Asp Thr Leu Asn Val Lys Ala Gly Phe Leu Gly165 170 175att tcg aag gcg ttt gaa gcg gtc ggc cgc cgc gtg ccg ctc atg att576Ile Ser Lys Ala Phe Glu Ala Val Gly Arg Arg Val Pro Leu Met Ile180 185 190tcc ggc acg atc gaa ccg atg ggc acg acg ctc gcc ggg cag gcg atc624Ser Gly Thr Ile Glu Pro Met Gly Thr Thr Leu Ala Gly Gln Ala Ile195 200 205gat gcg ttt ttc atc tcg gtg cgc cat atg aag ccg atc gcc gtc ggc672Asp Ala Phe Phe Ile Ser Val Arg His Met Lys Pro Ile Ala Val Gly210 215 220tta aac tgc gca acc ggt cog gag ttt atg acc gac cat ttg cgc acg720Leu Asn Cys Ala Thr Gly Pro Glu Phe Met Thr Asp His Leu Arg Thr225 230 235 240ctc gcc tcg ctc gct gac acg gcg gtc agc tgc tac ccg aac gcc ggt768Leu Ala Ser Leu Ala Asp Thr Ala Val Ser Cys Tyr Pro Asn Ala Gly245 250 255ctg ccg gat gag gaa ggc cac tat cat gaa acg ccg aat atg ctg gca816Leu Pro Asp Glu Glu Gly His Tyr His Glu Thr Pro Asn Met Leu Ala260 265 270gag aaa atc cgc cgc ttt gcc gaa aag gga tgg atc aac atc gtc ggc864Glu Lys Ile Arg Arg Phe Ala Glu Lys Gly Trp Ile Asn Ile Val Gly275 280 285ggg tgt tgc ggc acg acg ccg gat cat atc cgc gcc att gct gaa gcg912Gly Cys Cys Gly Thr Thr Pro Asp His Ile Arg Ala Ile Ala Glu Ala290 295 300gtg cgt gat ctc ccg ccg cgg gcg att ccg tct tcg ttt gat gtc cac960Val Arg Asp Leu Pro Pro Arg Ala Ile Pro Ser Ser Phe Asp Val His305 310 315 320gcc gtt tcc ggc atc gag gcg ctc atc tat gat gaa acg atg cgc ccg1008Ala Val Ser Gly Ile Glu Ala Leu Ile Tyr Asp Glu Thr Met Arg Pro325 330 335ctc ttt gtc ggc gag cgg aca aac gtg atc ggc tcg cgc aaa ttc aag1056Leu Phe Val Gly Glu Arg Thr Asn Val Ile Gly Ser Arg Lys Phe Lys340 345 350cgc ctc atc gcc gaa ggg aaa tac gaa gaa gcg gcg gaa atc gcc cgc1104Arg Leu Ile Ala Glu Gly Lys Tyr Glu Glu Ala Ala Glu Ile Ala Arg355 360 365gcc caa gtg aaa aac ggc gcc cat gtc atc gac att tgc ctc gcc gac1152Ala Gln Val Lys Asn Gly Ala His Val Ile Asp Ile Cys Leu Ala Asp370 375 380cca gac cgc gac gaa ctc cat gac atg gag cag ttc gtc cgc gaa gtc1200Pro Asp Arg Asp Glu Leu His Asp Met Glu Gln Phe Val Arg Glu Val385 390 395 400gtg aaa aaa gtg aaa gtg ccg ctt gtc atc gat tcg acc gac gag cgc1248Val Lys Lys Val Lys Val Pro Leu Val Ile Asp Ser Thr Asp Glu Arg
405 4l0 415gtc atc gaa cgc gcc ctt acg tat tcg caa ggg aag gcg atc atc aac1296Val Ile Glu Arg Ala Leu Thr Tyr Ser Gln Gly Lys Ala Ile Ile Asn420 425 430tcg atc aac ctc gaa gat ggc gaa gag cgg ttt gcg aag gtc gtt cct1344Ser Ile Asn Leu Glu Asp Gly Glu Glu Arg Phe Ala Lys Val Val Pro435 440 445ctc ctg cat caa tac ggc gcc gcc gtt gtc gtc ggc acg atc gat gag1392Leu Leu His Gln Tyr Gly Ala Ala Val Val Val Gly Thr Ile Asp Glu450 455 460caa gga atg gcg gtt aca gcc gaa cgg aaa ttg gaa atc gcc ttg cgt1440Gln Gly Met Ala Val Thr Ala Glu Arg Lys Leu Glu Ile Ala Leu Arg465 470 475 480tcg tat gac ttg ctg gtg aac cgc tac ggc gtc ccc gag cgc gac atc1488Ser Tyr Asp Leu Leu Val Asn Arg Tyr Gly Val Pro Glu Arg Asp Ile485 490 495att ttc gac ccg ctc gtc ttc ccg gtc ggc acc ggc gat gag caa tac1536Ile Phe Asp Pro Leu Val Phe Pro Val Gly Thr Gly Asp Glu Gln Tyr500 505 510atc ggc gcg gcg aaa gaa acc att gag ggc atc cgc ctc att aaa gag1584Ile Gly Ala Ala Lys Glu Thr Ile Glu Gly Ile Arg Leu Ile Lys Glu515 520 525cgg ctg cct cat tgc ttg acg atg ctt ggc atc agc aac gtc tcg ttc1632Arg Leu Pro His Cys Leu Thr Met Leu Gly Ile Ser Asn Val Ser Phe530 535 540ggc ttg ccg ccg gcc gga cgc gag gtg ctc aac tcc gtc ttt ttg tac1680Gly Leu Pro Pro Ala Gly Arg Glu Val Leu Asn Ser Val Phe Leu Tyr545 550 555 560cat tgc acg caa gcc ggg ctc gat tac gcc atc gtc aac acc gag aaa1728His Cys Thr Gln Ala Gly Leu Asp Tyr Ala Ile Val Asn Thr Glu Lys565 570 575ttg gag cgg ttc gcc tcg att ccg gaa gag gaa gtg cga atg gct gag1776Leu Glu Arg Phe Ala Ser Ile Pro Glu Glu Glu Val Arg Met Ala Glu580 585 590gca ctt ctt ttt gac aca aac gac gaa aca tta aac gcc ttt atc gaa1824Ala Leu Leu Phe Asp Thr Asn Asp Glu Thr Leu Asn Ala Phe Ile Glu595 600 605ttt tac cga agc aaa atc acc gcc gcc aaa ccg gcg cag acg aac ttg1872Phe Tyr Arg Ser Lys Ile Thr Ala Ala Lys Pro Ala Gln Thr Asn Leu610 615 620agc ttg gaa gag cgg ctc gcc cgc tac gtt att gaa ggg tcg aaa gac1920Ser Leu Glu Glu Arg Leu Ala Arg Tyr Val Ile Glu Gly Ser Lys Asp625 630 635 640ggg ctc att ctc gat ttg gaa aag gcg ctt gag acc tac tcc gat ccg1968Gly Leu Ile Leu Asp Leu Glu Lys Ala Leu Glu Thr Tyr Ser Asp Pro645 650 655
ctg tcc atc atc aac ggt ccg ctc atg gcc ggc atg gat gaa gtc ggg2016Leu Ser Ile Ile Asn Gly Pro Leu Met Ala Gly Met Asp Glu Val Gly660 665 670cgg ctg ttc aac aac aac cag ctc atc gtc gct gaa gta ttg caa agc2064Arg Leu Phe Asn Asn Asn Gln Leu Ile Val Ala Glu Val Leu Gln Ser675 680 685gcg gaa gtg atg aaa gca gcg gtc gcc ttt tta gag ctg tat atg gaa2112Ala Glu Val Met Lys Ala Ala Val Ala Phe Leu Glu Leu Tyr Met Glu690 695 700aag aaa gaa gga agc aca aaa gga aaa gtc att ctc gcc acc gtc aaa2160Lys Lys Glu Gly Ser Thr Lys Gly Lys Val Ile Leu Ala Thr Val Lys705 710 715 720ggc gat gtg cat gac atc ggc aaa aac ttg gtc gac atc att tta agc2208Gly Asp Val His Asp Ile Gly Lys Asn Leu Val Asp Ile Ile Leu Ser725 730 735aac aac ggc tac gag gtg atc gac ctc ggc att aaa gtc gct ccg cag2256Asn Asn Gly Tyr Glu Val Ile Asp Leu Gly Ile Lys Val Ala Pro Gln740 745 750caa ctc att gaa gcg gtg cgc gaa cat cag ccg gac atc atc ggg ttg2304Gln Leu Ile Glu Ala Val Arg Glu His Gln Pro Asp Ile Ile Gly Leu755 760 765tcg ggc ttg ctt gtg aaa tcg gct caa cag atg gtc gtc acc gcc caa2352Ser Gly Leu Leu Val Lys Ser Ala Gln Gln Met Val Val Thr Ala Gln770 775 780gac ttg cgc caa gcg ggc atc tcg acc ccg att tta gtc ggc ggc gcc2400Asp Leu Arg Gln Ala Gly Ile Ser Thr Pro Ile Leu Val Gly Gly Ala785 790 795 800gcc ttg acg cgc aaa ttt acg gaa aac aaa atc gcg ccc gag tac gac2448Ala Leu Thr Arg Lys Phe Thr Glu Asn Lys Ile Ala Pro Glu Tyr Asp805 810 815ggc gtt gtc ttg tac gcg aaa gac gcc atg gac ggg ctc gcc ctt gcc2496Gly Val Val Leu Tyr Ala Lys Asp Ala Met Asp Gly Leu Ala Leu Ala820 825 830aac caa atc cag cag ggc gag att gac tac aag aaa aaa gaa acg gcc2544Asn Gln Ile Gln Gln Gly Glu Ile Asp Tyr Lys Lys Lys Glu Thr Ala835 840 845gaa agc gag cca acg cgg caa acg acg gtg gtc aca gcg gtc aaa tcg2592Glu Ser Glu Pro Thr Arg Gln Thr Thr Val Val Thr Ala Val Lys Ser850 855 860acc gtc tcg acc gac gtt ccc gtc tac atc ccg gcc gat ctc gag cgc2640Thr Val Ser Thr Asp Val Pro Val Tyr Ile Pro Ala Asp Leu Glu Arg865 870 875 880cac gcg ctg cga aat gtg ccg ctt gac cac att ttg ccg tac gtc aac2688His Ala Leu Arg Asn Val Pro Leu Asp His Ile Leu Pro Tyr Val Asn885 890895tgg caa atg gtg ctc ggc cac cac ctc ggc ttg aaa gga aaa gtg aaa2736Trp Gln Met Val Leu Gly His His Leu Gly Leu Lys Gly Lys Val Lys
900 905 910cgg ctg ctt gaa gag aaa gac gaa aaa gcg ttg gcg tta aaa gcg gtc2784Arg Leu Leu Glu Glu Lys Asp Glu Lys Ala Leu Ala Leu Lys Ala Val915 920 925gtc gac gaa ctg ctc gcc gaa gcg aaa gag cgc cgc tgg att cag ccc2832Val Asp Glu Leu Leu Ala Glu Ala Lys Glu Arg Arg Trp Ile Gln Pro930 935 940gcc ggc gtc tac cgc ttc ttc ccg gcg caa agc gac ggc aac cgg gtt2880Ala Gly Val Tyr Arg Phe Phe Pro Ala Gln Ser Asp Gly Asn Arg Val945 950 955 960tac att tac gat ccg act gac ggc aaa aca gtg ctc gag atg ttc gac2928Tyr Ile Tyr Asp Pro Thr Asp Gly Lys Thr Val Leu Glu Met Phe Asp965 970 975ttt ccg cgc caa ccg cgg gcg ccg tat ctt tgc ctc gcc gat tat ttg2976Phe Pro Arg Gln Pro Arg Ala Pro Tyr Leu Cys Leu Ala Asp Tyr Leu980 985 990aaa tcg aaa gaa agc ggc gaa atg gat tac gtc ggt ttg ttc gcc gtc3024Lys Ser Lys Glu Ser Gly Glu Met Asp Tyr Val Gly Leu Phe Ala Val99510001005acc gct ggg cat ggc gtc cgc gaa ctc gcc cag cgc tgg aag gaa gaa3072Thr Ala Gly His Gly Val Arg Glu Leu Ala Gln Arg Trp Lys Glu Glu101010151020ggc gaa ttt ttg aaa agc cat gcc atc caa gcg ttg gcg ctc gag att3120Gly Glu Phe Leu Lys Ser His Ala Ile Gln Ala Leu Ala Leu Glu Ile1025 103010351040gcc gaa ggg ttc gcc gaa cga atc cat caa att atg cgc gac cgc tgg3168Ala Glu Gly Phe Ala Glu Arg Ile His Gln Ile Met Arg Asp Arg Trp104510501055ggc ttc ccg gac gac ccg gat ttc acg atg gaa gag cgc ttc gcc gcc3216Gly Phe Pro Asp Asp Pro Asp Phe Thr Met Glu Glu Arg Phe Ala Ala106010651070aaa tac cag ggc cag cgc tac tcg ttc ggc tac ccg gcc tgt ccg aac3264Lys Tyr Gln Gly Gln Arg Tyr Ser Phe Gly Tyr Pro Ala Cys Pro Asn1075 1080 1085ttg gaa gac cag gag aaa ctg ttc cgt ctg ctt cat cca gaa gac atc3312Leu Glu Asp Gln Glu Lys Leu Phe Arg Leu Leu His Pro Glu Asp Ile109010951100ggc atc cgt ctc acc gac ggc tat atg atg gaa ccc gaa gca tcg gtt3360Gly Ile Arg Leu Thr Asp Gly Tyr Met Met Glu Pro Glu Ala Ser Val1105 111011151120tcg gcg atc gtc ttc gcc cat ccg gaa gcg cgg tat ttc aat gtg tta3408Ser Ala Ile Val Phe Ala His Pro Glu Ala Arg Tyr Phe Asn Val Leu112511301135taa341121014
2111136212PRT213嗜熱脂肪芽孢桿菌40014Met Ala Asn Val Thr Leu Glu Gln Gln Leu Gln Arg Lys Ile Leu Val1 5 10 15Ile Asp Gly Ala Met Gly Thr Met Ile Gln Ser Ala Asn Leu Ser Ala20 25 30Ala Asp Phe Gly Gly Glu Ala Tyr Glu Gly Cys Asn Glu Tyr Leu Thr35 40 45Leu Thr Ala Pro His Val Ile Arg Arg Ile His Glu Ala Tyr Leu Glu50 55 60Ala Gly Ala Asp Ile Ile Glu Thr Asn Thr Phe Gly Ala Thr Arg Ile65 70 75 80Val Leu Asp Glu Tyr Gly Leu Gly His Leu Ala Leu Glu Leu Asn Ile85 90 95Glu Ala Ala Lys Leu Ala Lys Gln Thr Ala Glu Ser Phe Ser Thr Pro100 105 110Asp Trp Pro Arg Phe Val Ala Gly Ser Met Gly Pro Thr Thr Lys Thr115 120 125Leu Ser Val Thr Gly Gly Ala Thr Phe Glu Glu Leu Val Ala Ala Tyr130 135 140Glu Glu Gln Ala Arg Gly Leu Leu Leu Gly Gly Val Asp Leu Leu Leu145 150 155 160Leu Glu Thr Cys Gln Asp Thr Leu Asn Val Lys Ala Gly Phe Leu Gly165 170 175Ile Ser Lys Ala Phe Glu Ala Val Gly Arg Arg Val Pro Leu Met Ile180 185 190Ser Gly Thr Ile Glu Pro Met Gly Thr Thr Leu Ala Gly Gln Ala Ile195 200 205Asp Ala Phe Phe Ile Ser Val Arg His Met Lys Pro Ile Ala Val Gly210 215 220Leu Asn Cys Ala Thr Gly Pro Glu Phe Met Thr Asp His Leu Arg Thr225 230 235 240Leu Ala Ser Leu Ala Asp Thr Ala Val Ser Cys Tyr Pro Asn Ala Gly245 250 255Leu Pro Asp Glu Glu Gly His Tyr His Glu Thr Pro Asn Met Leu Ala260 265 270Glu Lys Ile Arg Arg Phe Ala Glu Lys Gly Trp Ile Asn Ile Val Gly275 280 285Gly Cys Cys Gly Thr Thr Pro Asp His Ile Arg Ala Ile Ala Glu Ala290 295 300
Val Arg Asp Leu Pro Pro Arg Ala Ile Pro Ser Ser Phe Asp Val His305 310 315 320Ala Val Ser Gly Ile Glu Ala Leu Ile Tyr Asp Glu Thr Met Arg Pro325 330 335Leu Phe Val Gly Glu Arg Thr Asn Val Ile Gly Ser Arg Lys Phe Lys340 345 350Arg Leu Ile Ala Glu Gly Lys Tyr Glu Glu Ala Ala Glu Ile Ala Arg355 360 365Ala Gln Val Lys Asn Gly Ala His Val Ile Asp Ile Cys Leu Ala Asp370 375 380Pro Asp Arg Asp Glu Leu His Asp Met Glu Gln Phe Val Arg Glu Val385 390 395 400Val Lys Lys Val Lys Val Pro Leu Val Ile Asp Ser Thr Asp Glu Arg405 410 415Val Ile Glu Arg Ala Leu Thr Tyr Ser Gln Gly Lys Ala Ile Ile Asn420 425 430Ser Ile Asn Leu Glu Asp Gly Glu Glu Arg Phe Ala Lys Val Val Pro435 440 445Leu Leu His Gln Tyr Gly Ala Ala Val Val Val Gly Thr Ile Asp Glu450 455 460Gln Gly Met Ala Val Thr Ala Glu Arg Lys Leu Glu Ile Ala Leu Arg465 470 475 480Ser Tyr Asp Leu Leu Val Asn Arg Tyr Gly Val Pro Glu Arg Asp Ile485 490 495Ile Phe Asp Pro Leu Val Phe Pro Val Gly Thr Gly Asp Glu Gln Tyr500 505 510Ile Gly Ala Ala Lys Glu Thr Ile Glu Gly Ile Arg Leu Ile Lys Glu515 520 525Arg Leu Pro His Cys Leu Thr Met Leu Gly Ile Ser Asn Val Ser Phe530 535 540Gly Leu Pro Pro Ala Gly Arg Glu Val Leu Asn Ser Val Phe Leu Tyr545 550 555 560His Cys Thr Gln Ala Gly Leu Asp Tyr Ala Ile Val Asn Thr Glu Lys565 570 575Leu Glu Arg Phe Ala Ser Ile Pro Glu Glu Glu Val Arg Met Ala Glu580 585 590Ala Leu Leu Phe Asp Thr Asn Asp Glu Thr Leu Asn Ala Phe Ile Glu595 600 605Phe Tyr Arg Ser Lys Ile Thr Ala Ala Lys Pro Ala Gln Thr Asn Leu610 615 620Ser Leu Glu Glu Arg Leu Ala Arg Tyr Val Ile Glu Gly Ser Lys Asp625 630 635 640
Gly Leu Ile Leu Asp Leu Glu Lys Ala Leu Glu Thr Tyr Ser Asp Pro645 650 655Leu Ser Ile Ile Asn Gly Pro Leu Met Ala Gly Met Asp Glu Val Gly660 665 670Arg Leu Phe Asn Asn Asn Gln Leu Ile Val Ala Glu Val Leu Gln Ser675 680 685Ala Glu Val Met Lys Ala Ala Val Ala Phe Leu Glu Leu Tyr Met Glu690 695 700Lys Lys Glu Gly Ser Thr Lys Gly Lys Val Ile Leu Ala Thr Val Lys705 710 715 720Gly Asp Val His Asp Ile Gly Lys Asn Leu Val Asp Ile Ile Leu Ser725 730 735Asn Asn Gly Tyr Glu Val Ile Asp Leu Gly Ile Lys Val Ala Pro Gln740 745 750Gln Leu Ile Glu Ala Val Arg Glu His Gln Pro Asp Ile Ile Gly Leu755 760 765Ser Gly Leu Leu Val Lys Ser Ala Gln Gln Met Val Val Thr Ala Gln770 775 780Asp Leu Arg Gln Ala Gly Ile Ser Thr Pro Ile Leu Val Gly Gly Ala785 790 795 800Ala Leu Thr Arg Lys Phe Thr Glu Asn Lys Ile Ala Pro Glu Tyr Asp805 810 815Gly Val Val Leu Tyr Ala Lys Asp Ala Met Asp Gly Leu Ala Leu Ala820 825 830Asn Gln Ile Gln Gln Gly Glu Ile Asp Tyr Lys Lys Lys Glu Thr Ala835 840 845Glu Ser Glu Pro Thr Arg Gln Thr Thr Val Val Thr Ala Val Lys Ser850 855 860Thr Val Ser Thr Asp Val Pro Val Tyr Ile Pro Ala Asp Leu Glu Arg865 870 875 880His Ala Leu Arg Asn Val Pro Leu Asp His Ile Leu Pro Tyr Val Asn885 890 895Trp Gln Met Val Leu Gly His His Leu Gly Leu Lys Gly Lys Val Lys900 905 910Arg Leu Leu Glu Glu Lys Asp Glu Lys Ala Leu Ala Leu Lys Ala Val915 920 925Val Asp Glu Leu Leu Ala Glu Ala Lys Glu Arg Arg Trp Ile Gln Pro930 935 940Ala Gly Val Tyr Arg Phe Phe Pro Ala Gln Ser Asp Gly Asn Arg Val945 950 955 960Tyr Ile Tyr Asp Pro Thr Asp Gly Lys Thr Val Leu Glu Met Phe Asp
965 970 975Phe Pro Arg Gln Pro Arg Ala Pro Tyr Leu Cys Leu Ala Asp Tyr Leu980 985 990Lys Ser Lys Glu Ser Gly Glu Met Asp Tyr Val Gly Leu Phe Ala Val99510001005Thr Ala Gly His Gly Val Arg Glu Leu Ala Gln Arg Trp Lys Glu Glu101010151020Gly Glu Phe Leu Lys Ser His Ala Ile Gln Ala Leu Ala Leu Glu Ile1025 103010351040Ala Glu Gly Phe Ala Glu Arg Ile His Gln Ile Met Arg Asp Arg Trp104510501055Gly Phe Pro Asp Asp Pro Asp Phe Thr Met Glu Glu Arg Phe Ala Ala106010651070Lys Tyr Gln Gly Gln Arg Tyr Ser Phe Gly Tyr Pro Ala Cys Pro Asn107510801085Leu Glu Asp Gln Glu Lys Leu Phe Arg Leu Leu His Pro Glu Asp Ile109010951100Gly Ile Arg Leu Thr Asp Gly Tyr Met Met Glu Pro Glu Ala Ser Val1105 111011151120Ser Ala Ile Val Phe Ala His Pro Glu Ala Arg Tyr Phe Asn Val Leu112511301135210152113681212DNA213霍亂弧菌(vibrio cholerae)220
221CDS222(1)..(3678)223RVC0426540015gtg gga aaa gaa gta aga caa caa ctc gaa cag caa ttg aaa caa cgt48Val Gly Lys Glu Val Arg Gln Gln Leu Glu Gln Gln Leu Lys Gln Arg1 5 10 15atc cta ctg att gat ggt ggt atg ggt acc atg att cag agt tat aag96Ile Leu Leu Ile Asp Gly Gly Met Gly Thr Met Ile Gln Ser Tyr Lys20 25 30tta caa gag gaa gac tat cgc ggt gca cga ttt gtc gat tgg cac tgt144Leu Gln Glu Glu Asp Tyr Arg Gly Ala Arg Phe Val Asp Trp His Cys35 40 45gat ttg aaa gga aat aac gac ctc tta gtg ctt act cag ccg caa att192Asp Leu Lys Gly Asn Asn Asp Leu Leu Val Leu Thr Gln Pro Gln Ile50 55 60
att aaa gag att cac tcc gct tac ctt gaa gcg ggg gcg gat att ctt240Ile Lys Glu Ile His Ser Ala Tyr Leu Glu Ala Gly Ala Asp Ile Leu65 70 75 80gag acc aac acc ttt aac tca acc acg att gcc atg gca gac tat gac288Glu Thr Asn Thr Phe Asn Ser Thr Thr Ile Ala Met Ala Asp Tyr Asp85 90 95atg caa tcg ctc agt gct gaa att aac ttt gcc gcg gct aag ctt gca336Met Gln Ser Leu Ser Ala Glu Ile Asn Phe Ala Ala Ala Lys Leu Ala100 105 110cgt gaa gtc gcg gat gag tgg acg gct aaa gat cca agt cgg cca cgc384Arg Glu Val Ala Asp Glu Trp Thr Ala Lys Asp Pro Ser Arg Pro Arg115 120 125tat gtg gct ggt gtg ctt ggg cca acc aac cgt act tgc tct att tcg432Tyr Val Ala Gly Val Leu Gly Pro Thr Asn Arg Thr Cys Ser Ile Ser130 135 140cca gat gtg aac gat cca gga ttt cgt aac gtc act ttt gat ggg ctt480Pro Asp Val Asn Asp Pro Gly Phe Arg Asn Val Thr Phe Asp Gly Leu145 150 155 160gtt gaa gcc tat tcc gaa tcg acg cgc gct ttg atc aaa ggt ggc agc528Val Glu Ala Tyr Ser Glu Ser Thr Arg Ala Leu Ile Lys Gly Gly Ser165 170 175gat ctg atc ctc att gaa acc atc ttc gat aca ctt aac gcc aaa gcc576Asp Leu Ile Leu Ile Glu Thr Ile Phe Asp Thr Leu Asn Ala Lys Ala180 185 190tgt gcg ttt gcg gtc gat agc gta ttt gaa gag ctg ggc atc agc tta624Cys Ala Phe Ala Val Asp Ser Val Phe Glu Glu Leu Gly Ile Ser Leu195 200 205cct gtg atg att tcc ggc acg att acc gat gcc tct ggg cga act ctg672Pro Val Met Ile Ser Gly Thr Ile Thr Asp Ala Ser Gly Arg Thr Leu210 215 220tca gga cag aca acg gaa gct ttc tac aac gcc ttg cgt cat gta cgg720Ser Gly Gln Thr Thr Glu Ala Phe Tyr Asn Ala Leu Arg His Val Arg225 230 235 240ccg att tcg ttt ggc ttg aac tgt gcg tta ggt cct gat gag ctg cgc768Pro Ile Ser Phe Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu Arg245 250 255cag tac gtg gaa gag ctt tca cgc att tca gaa tgc tat gtt tcc gcg816Gln Tyr Val Glu Glu Leu Ser Arg Ile Ser Glu Cys Tyr Val Ser Ala260 265 270cac cca aat gcc gga ctg ccc aat gcg ttt ggt gaa tac gat ctc tct864His Pro Asn Ala Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu Ser275 280 285gcc gag gaa atg gca gaa cat att gcg gaa tgg gca caa gct ggc ttt912Ala Glu Glu Met Ala Glu His Ile Ala Glu Trp Ala Gln Ala Gly Phe290 295 300ttg aat ttg gtc ggt ggt tgc tgt gga act aca cct gag cat atc gcc960
Leu Asn Leu Val Gly Gly Cys Cys Gly Thr Thr Pro Glu His Ile Ala305 310 315 320gcc att gcc aaa gcc gtc gag ggt gta aaa cca agg gct ctg cca gat1008Ala Ile Ala Lys Ala Val Glu Gly Val Lys Pro Arg Ala Leu Pro Asp325 330 335ctg aaa gta gaa tgt cgt ctc tcg ggt tta gag ccg ctc aat att ggt1056Leu Lys Val Glu Cys Arg Leu Ser Gly Leu Glu Pro Leu Asn Ile Gly340 345 350cct gaa acc ttg ttt gtt aac gtg ggc gaa cgt act aac gtc acc ggt1104Pro Glu Thr Leu Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly355 360 365tct gcg cgt ttt aag cgt tta att aaa gaa gag caa tac gac gaa gcg1152Ser Ala Arg Phe Lys Arg Leu Ile Lys Glu Glu Gln Tyr Asp Glu Ala370 375 380ctc gat gtg gcg cgt gag caa gtc gaa aac ggc gcg cag atc att gat1200Leu Asp Val Ala Arg Glu Gln Val Glu Asn Gly Ala Gln Ile Ile Asp385 390 395 400atc aac atg gat gaa ggc atg ttg gac gcc gag gcg tgt atg gtg cgc1248Ile Asn Met Asp Glu Gly Met Leu Asp Ala Glu Ala Cys Met Val Arg405 4l0 415ttt ttg aat cta tgc gcc tct gaa cca gaa ata tcc aaa gtt ccg gtg1296Phe Leu Asn Leu Cys Ala Ser Glu Pro Glu Ile Ser Lys Val Pro Val420 425 430atg gtc gac tcc tct aaa tgg gaa gtc att gaa gcg ggt ctg aaa tgc1344Met Val Asp Ser Ser Lys Trp Glu Val Ile Glu Ala Gly Leu Lys Cys435 440 445att cag ggt aaa ggc atc gtc aac tct atc tct cta aaa gaa ggg aaa1392Ile Gln Gly Lys Gly Ile Val Asn Ser Ile Ser Leu Lys Glu Gly Lys450 455 460gag aag ttt att gcc caa gcc aaa ttg gtg cgc cgc tac ggt gcc gcg1440Glu Lys Phe Ile Ala Gln Ala Lys Leu Val Arg Arg Tyr Gly Ala Ala465 470 475 480gtg att gtg atg gca ttt gac gaa gtg ggc caa gcc gat acc cgt gag1488Val Ile Val Met Ala Phe Asp Glu Val Gly Gln Ala Asp Thr Arg Glu485 490 495cgc aaa tta gag atc tgt cgt cgg gct tac cat att ttg gtc gat gag1536Arg Lys Leu Glu Ile Cys Arg Arg Ala Tyr His Ile Leu Val Asp Glu500 505 510gtg ggc ttc cca ccg gaa gat att att ttt gac ccg aac atc ttt gct1584Val Gly Phe Pro Pro Glu Asp Ile Ile Phe Asp Pro Asn Ile Phe Ala515 520 525gtt gcg acc gga att gat gag cac aat aac tac gca ctg gat ttc att1632Val Ala Thr Gly Ile Asp Glu His Asn Asn Tyr Ala Leu Asp Phe Ile530 535 540aat gca gtg gcg gac att aag cgt gag ctg ccg cat gcg atg att tct1680Asn Ala Val Ala Asp Ile Lys Arg Glu Leu Pro His Ala Met Ile Ser545 550 555 560
ggc ggt gtt tct aac gtt tcc ttc tct ttc cgc ggc aac aac tat gtg1728Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asn Tyr Val565 570 575cgt gaa gcg atc cat gct gtt ttc ctt tat cac tgc ttc aaa cac ggc1776Arg Glu Ala Ile His Ala Val Phe Leu Tyr His Cys Phe Lys His Gly580 585 590atg gac atg ggg att gtc aac gca ggg cag ctt gaa atc tac gat aac1824Met Asp Met Gly Ile Val Asn Ala Gly Gln Leu Glu Ile Tyr Asp Asn595 600 605gtt ccg ctg aaa ctg cgt gag gca gtg gaa gat gtg atc ctc aat cga1872Val Pro Leu Lys Leu Arg Glu Ala Val Glu Asp Val Ile Leu Asn Arg610 615 620cgt agc gat ggc acg gaa aga ctg ctt gag atc gcc gaa gcg tat cgc1920Arg Ser Asp Gly Thr Glu Arg Leu Leu Glu Ile Ala Glu Ala Tyr Arg625 630 635 640gaa aac agt gtt ggt aaa gaa gag gat gct tct gca tta gag tgg cgc1968Glu Asn Ser Val Gly Lys Glu Glu Asp Ala Ser Ala Leu Glu Trp Arg645 650 655gca tgg cct gtg gct aag cgc cta gag cac gct tta gtc aaa ggc atc2016Ala Trp Pro Val Ala Lys Arg Leu Glu His Ala Leu Val Lys Gly Ile660 665 670acc gaa ttt atc gtc caa gac act gaa gaa gca cgt cag caa gcc agt2064Thr Glu Phe Ile Val Gln Asp Thr Glu Glu Ala Arg Gln Gln Ala Ser675 680 685aaa cca ctg gaa gtg att gaa ggg ccg ctg atg gat ggt atg aac gtg2112Lys Pro Leu Glu Val Ile Glu Gly Pro Leu Met Asp Gly Met Asn Val690 695 700gtc ggt gac ttg ttc ggg gaa ggg aaa atg ttc cta ccg caa gtc gta2160Val Gly Asp Leu Phe Gly Glu Gly Lys Met Phe Leu Pro Gln Val Val705 710 715 720aaa tca gcg cgt gtc atg aaa caa gcc gtt gcg tat ctt gag cct ttc2208Lys Ser Ala Arg Val Met Lys Gln Ala Val Ala Tyr Leu Glu Pro Phe725 730 735att aat gcg caa aaa agt ggt agc act tca aat ggt aag att ttg ctg2256Ile Asn Ala Gln Lys Ser Gly Ser Thr Ser Asn Gly Lys Ile Leu Leu740 745 750gcg acc gta aaa ggc gat gtg cat gac att ggt aag aac att gtt ggc2304Ala Thr Val Lys Gly Asp Val His Asp Ile Gly Lys Asn Ile Val Gly755 760 765gtc gtg ctg cag tgt aat aac ttc gag atc atc gat ctt ggt gtg atg2352Val Val Leu Gln Cys Asn Asn Phe Glu Ile Ile Asp Leu Gly Val Met770 775 780gtg cct tgc gag cag atc ctc aaa gtc gca cgc gag caa aat gtc gat2400Val Pro Cys Glu Gln Ile Leu Lys Val Ala Arg Glu Gln Asn Val Asp785 790 795 800atc atc ggt ctc tct ggg ctt atc acg ccg tct ttg gat gag atg gta2448
Ile Ile Gly Leu Ser Gly Leu Ile Thr Pro Ser Leu Asp Glu Met Val805 810 815cac gtg gcg aaa gag atg gag cga caa ggg ttt gaa ctg cca ctt ttg2496His Val Ala Lys Glu Met Glu Arg Gln Gly Phe Glu Leu Pro Leu Leu820 825 830att ggt ggg gca aca acg tct aaa gcg cat act gcg gtg aag att gaa2544Ile Gly Gly Ala Thr Thr Ser Lys Ala His Thr Ala Val Lys Ile Glu835 840 845cag aat tat cat gcg cct gta gtg tac gtg aat aac gcg tcg cgc gcg2592Gln Asn Tyr His Ala Pro Val Val Tyr Val Asn Asn Ala Ser Arg Ala850 855 860gta ggg gtg tgc aca tca tta ttg tct gat gaa cag cgc ccc gga ttt2640Val Gly Val Cys Thr Ser Leu Leu Ser Asp Glu Gln Arg Pro Gly Phe865 870 875 880atc gaa cgt ttg gat ctc gat tat gag cgc acg cgt gat cag cat gct2688Ile Glu Arg Leu Asp Leu Asp Tyr Glu Arg Thr Arg Asp Gln His Ala885 890 895cgt aaa acg ccc aaa tcg cgc cca gtc acg tta gag cag gca cgt gct2736Arg Lys Thr Pro Lys Ser Arg Pro Val Thr Leu Glu Gln Ala Arg Ala900 905 910aat aaa gcg gcg ctg gat tgg gca aat tac acg ccg ccc gct cct gcg2784Asn Lys Ala Ala Leu Asp Trp Ala Asn Tyr Thr Pro Pro Ala pro Ala915 920 925aaa ccg ggt gtg cat gtg ttt gaa aac att gcg tta gcc aca cta cgt2832Lys Pro Gly Val His Val Phe Glu Asn Ile Ala Leu Ala Thr Leu Arg930 935 940cct tat atc gat tgg acg cct ttt ttt atg act tgg tcg ctt atg ggc2880Pro Tyr Ile Asp Trp Thr Pro Phe Phe Met Thr Trp Ser Leu Met Gly945 950 955 960aaa tac cct gcc att ttg gag cat gaa gag gtc ggt gaa gag gcc aaa2928Lys Tyr Pro Ala Ile Leu Glu His Glu Glu Val Gly Glu Glu Ala Lys965 970 975cgt ctg ttt cat gat gcc aat gcc tta ctt gat aaa gta gag cga gaa2976Arg Leu Phe His Asp Ala Asn Ala Leu Leu Asp Lys Val Glu Arg Glu980 985 990gga cta ctg aaa gcc agt ggt atg tgt gca ctg ttt cca gca gcc agc3024Gly Leu Leu Lys Ala Ser Gly Met Cys Ala Leu Phe Pro Ala Ala Ser99510001005gtg ggc gat gac att gag gtg tac agt gat gaa tcg cgt acg caa gtc3072Val Gly Asp Asp Ile Glu Val Tyr Ser Asp Glu Ser Arg Thr Gln Val101010151020gcg cat gtg ctg tac aac ttg cgt cag cag act gag aaa ccg aaa ggg3120Ala His Val Leu Tyr Asn Leu Arg Gln Gln Thr Glu Lys Pro Lys Gly1025 103010351040gcc aac tac tgt ttg tcg gac tat gtt gct ccg aaa gag agc ggt aaa3168Ala Asn Tyr Cys Leu Ser Asp Tyr Val Ala Pro Lys Glu Ser Gly Lys104510501055
cgt gat tgg att ggc gcg ttt gca gta act ggt ggc att ggt gag cga3216Arg Asp Trp Ile Gly Ala Phe Ala Val Thr Gly Gly Ile Gly Glu Arg106010651070gcc ttg gcc gat gct tat aaa gct cag ggt gat gat tac aat gcg atc3264Ala Leu Ala Asp Ala Tyr Lys Ala Gln Gly Asp Asp Tyr Asn Ala Ile107510801085atg atc caa gcg gta gcc gat cgt ttg gcg gaa gcc ttt gcg gaa tat3312Met Ile Gln Ala Val Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Tyr109010951100ctg cat gaa aaa gtg cgt aaa gag att tgg ggt tat gcg agc gat gaa3360Leu His Glu Lys Val Arg Lys Glu Ile Trp Gly Tyr Ala Ser Asp Glu1105 111011151120aat ctc tcc aat gat gac ctg atc cgt gag cgt tat cag ggc att cga3408Asn Leu Ser Asn Asp Asp Leu Ile Arg Glu Arg Tyr Gln Gly Ile Arg112511301135ccc gcg ccg ggg tat ccc gcg tgt cct gag cat acc gag aaa gcg act3456Pro Ala Pro Gly Tyr Pro Ala Cys Pro Glu His Thr Glu Lys Ala Thr114011451150ttg tgg cag atg cta aat gtc gaa gag acc ata ggt atg tca ctg acc3504Leu Trp Gln Met Leu Asn Val Glu Glu Thr Ile Gly Met Ser Leu Thr115511601165aca agc tat gcg atg tgg ccg ggc gct tcg gta tcc ggt tgg tat ttc3552Thr Ser Tyr Ala Met Trp Pro Gly Ala Ser Val Ser Gly Trp Tyr Phe117011751180tcg cat ccc gat tct cgc tat ttt gcg gta gcg cag atc caa cca gat3600Ser His Pro Asp Ser Arg Tyr Phe Ala Val Ala Gln Ile Gln Pro Asp1185 119011951200caa ctg cac agc tac gct gag cgt aaa ggt tgg cgt ttg gaa gaa gct3648Gln Leu His Ser Tyr Ala Glu Arg Lys Gly Trp Arg Leu Glu Glu Ala120512101215gaa aag tgg cta gcg cct aac ctt gat gct taa3681Glu Lys Trp Leu Ala Pro Asn Leu Asp Ala12201225210162111226212PRT213霍亂弧菌40016Val Gly Lys Glu Val Arg Gln Gln Leu Glu Gln Gln Leu Lys Gln Arg1 5 10 15Ile Leu Leu Ile Asp Gly Gly Met Gly Thr Met Ile Gln Ser Tyr Lys20 25 30Leu Gln Glu Glu Asp Tyr Arg Gly Ala Arg Phe Val Asp Trp His Cys35 40 45Asp Leu Lys Gly Asn Asn Asp Leu Leu Val Leu Thr Gln Pro Gln Ile
50 55 60Ile Lys Glu Ile His Ser Ala Tyr Leu Glu Ala Gly Ala Asp Ile Leu65 70 75 80Glu Thr Asn Thr Phe Asn Ser Thr Thr Ile Ala Met Ala Asp Tyr Asp85 90 95Met Gln Ser Leu Ser Ala Glu Ile Asn Phe Ala Ala Ala Lys Leu Ala100 105 110Arg Glu Val Ala Asp Glu Trp Thr Ala Lys Asp Pro Ser Arg Pro Arg115 120 125Tyr Val Ala Gly Val Leu Gly Pro Thr Asn Arg Thr Cys Ser Ile Ser130 135 140Pro Asp Val Asn Asp Pro Gly Phe Arg Asn Val Thr Phe Asp Gly Leu145 150 155 160Val Glu Ala Tyr Ser Glu Ser Thr Arg Ala Leu Ile Lys Gly Gly Ser165 170 175Asp Leu Ile Leu Ile Glu Thr Ile Phe Asp Thr Leu Asn Ala Lys Ala180 185 190Cys Ala Phe Ala Val Asp Ser Val Phe Glu Glu Leu Gly Ile Ser Leu195 200 205Pro Val Met Ile Ser Gly Thr Ile Thr Asp Ala Ser Gly Arg Thr Leu210 215 220Ser Gly Gln Thr Thr Glu Ala Phe Tyr Asn Ala Leu Arg His Val Arg225 230 235 240Pro Ile Ser Phe Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu Arg245 250 255Gln Tyr Val Glu Glu Leu Ser Arg Ile Ser Glu Cys Tyr Val Ser Ala260 265 270His Pro Asn Ala Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu Ser275 280 285Ala Glu Glu Met Ala Glu His Ile Ala Glu Trp Ala Gln Ala Gly Phe290 295 300Leu Asn Leu Val Gly Gly Cys Cys Gly Thr Thr Pro Glu His Ile Ala305 310 315 320Ala Ile Ala Lys Ala Val Glu Gly Val Lys Pro Arg Ala Leu Pro Asp325 330 335Leu Lys Val Glu Cys Arg Leu Ser Gly Leu Glu Pro Leu Asn Ile Gly340 345 350Pro Glu Thr Leu Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly355 360 365Ser Ala Arg Phe Lys Arg Leu Ile Lys Glu Glu Gln Tyr Asp Glu Ala370 375 380
Leu Asp Val AIa Arg Glu Gln Val Glu Asn Gly Ala Gln Ile Ile Asp385 390 395 400Ile Asn Met Asp Glu Gly Met Leu Asp Ala Glu Ala Cys Met Val Arg405 410 415Phe Leu Asn Leu Cys Ala Ser Glu Pro Glu Ile Ser Lys Val Pro Val420 425 430Met Val Asp Ser Ser Lys Trp Glu Val Ile Glu Ala Gly Leu Lys Cys435 440 445Ile Gln Gly Lys Gly Ile Val Asn Ser Ile Ser Leu Lys Glu Gly Lys450 455 460Glu Lys Phe Ile Ala Gln Ala Lys Leu Val Arg Arg Tyr Gly Ala Ala465 470 475 480Val Ile Val Met Ala Phe Asp Glu Val Gly Gln Ala Asp Thr Arg Glu485 490 495Arg Lys Leu GIu Ile Cys Arg Arg Ala Tyr His Ile Leu Val Asp Glu500 505 510Val Gly Phe Pro Pro Glu Asp Ile Ile Phe Asp Pro Asn Ile Phe Ala515 520 525Val Ala Thr Gly Ile Asp Glu His Asn Asn Tyr Ala Leu Asp Phe Ile530 535 540Asn Ala Val Ala Asp Ile Lys Arg Glu Leu Pro His Ala Met Ile Ser545 550 555 560Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asn Tyr Val565 570 575Arg Glu Ala Ile His Ala Val Phe Leu Tyr His Cys Phe Lys His Gly580 585 590Met Asp Met Gly Ile Val Asn Ala Gly Gln Leu Glu Ile Tyr Asp Asn595 600 605Val Pro Leu Lys Leu Arg Glu Ala Val Glu Asp Val Ile Leu Asn Arg610 615 620Arg Ser Asp Gly Thr Glu Arg Leu Leu Glu Ile Ala Glu Ala Tyr Arg625 630 635 640Glu Asn Ser Val Gly Lys Glu Glu Asp Ala Ser Ala Leu Glu Trp Arg645 650 655Ala Trp Pro Val Ala Lys Arg Leu Glu His Ala Leu Val Lys Gly Ile660 665 670Thr Glu Phe Ile Val Gln Asp Thr Glu Glu Ala Arg Gln Gln Ala Ser675 680 685Lys Pro Leu Glu Val Ile Glu Gly Pro Leu Met Asp Gly Met Asn Val690 695 700Val Gly Asp Leu Phe Gly Glu Gly Lys Met Phe Leu Pro Gln Val Val705 710 715 720
Lys Ser Ala Arg Val Met Lys Gln Ala Val Ala Tyr Leu Glu Pro Phe725 730 735Ile Asn Ala Gln Lys Ser Gly Ser Thr Ser Asn Gly Lys Ile Leu Leu740 745 750Ala Thr Val Lys Gly Asp Val His Asp Ile Gly Lys Asn Ile Val Gly755 760 765Val Val Leu Gln Cys Asn Asn Phe Glu Ile Ile Asp Leu Gly Val Met770 775 780Val Pro Cys Glu Gln Ile Leu Lys Val Ala Arg Glu Gln Asn Val Asp785 790 795 800Ile Ile Gly Leu Ser Gly Leu Ile Thr Pro Ser Leu Asp Glu Met Val805 810 815His Val Ala Lys Glu Met Glu Arg Gln Gly Phe Glu Leu Pro Leu Leu820 825 830Ile Gly Gly Ala Thr Thr Ser Lys Ala His Thr Ala Val Lys Ile Glu835 840 845Gln Asn Tyr His Ala Pro Val Val Tyr Val Asn Asn Ala Ser Arg Ala850 855860Val Gly Val Cys Thr Ser Leu Leu Ser Asp Glu Gln Arg Pro Gly Phe865 870 875880Ile Glu Arg Leu Asp Leu Asp Tyr Glu Arg Thr Arg Asp Gln His Ala885 890 895Arg Lys Thr Pro Lys Ser Arg Pro Val Thr Leu Glu Gln Ala Arg Ala900 905 910Asn Lys Ala Ala Leu Asp Trp Ala Asn Tyr Thr Pro Pro Ala Pro Ala915 920 925Lys Pro Gly Val His Val Phe Glu Asn Ile Ala Leu Ala Thr Leu Arg930 935 940Pro Tyr Ile Asp Trp Thr Pro Phe Phe Met Thr Trp Ser Leu Met Gly945 950 955 960Lys Tyr Pro Ala Ile Leu Glu His Glu Glu Val Gly Glu Glu Ala Lys965 970 975Arg Leu Phe His Asp Ala Asn Ala Leu Leu Asp Lys Val Glu Arg Glu980 985 990Gly Leu Leu Lys Ala Ser Gly Met Cys Ala Leu Phe Pro Ala Ala Ser99510001005Val Gly Asp Asp Ile Glu Val Tyr Ser Asp Glu Ser Arg Thr Gln Val101010151020Ala His Val Leu Tyr Asn Leu Arg Gln Gln Thr Glu Lys Pro Lys Gly1025 103010351040Ala Asn Tyr Cys Leu Ser Asp Tyr Val Ala Pro Lys Glu Ser Gly Lys
104510501055Arg Asp Trp Ile Gly Ala Phe Ala Val Thr Gly Gly Ile Gly Glu Arg106010651070Ala Leu Ala Asp Ala Tyr Lys Ala Gln Gly Asp Asp Tyr Asn Ala Ile107510801085Met Ile Gln Ala Val Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Tyr109010951100Leu His Glu Lys Val Arg Lys Glu Ile Trp Gly Tyr Ala Ser Asp Glu1105 111011151120Asn Leu Ser Asn Asp Asp Leu Ile Arg Glu Arg Tyr Gln Gly Ile Arg1125 1130 1135Pro Ala Pro Gly Tyr Pro Ala Cys Pro Glu His Thr Glu Lys Ala Thr1140 1145 1150Leu Trp Gln Met Leu Asn Val Glu Glu Thr Ile Gly Met Ser Leu Thr115511601165Thr Ser Tyr Ala Met Trp Pro Gly Ala Ser Val Ser Gly Trp Tyr Phe117011751180Ser His Pro Asp Ser Arg Tyr Phe Ala Val Ala Gln Ile Gln Pro Asp1185 119011951200Gln Leu His Ser Tyr Ala Glu Arg Lys Gly Trp Arg Leu Glu Glu Ala120512101215Glu Lys Trp Leu Ala Pro Asn Leu Asp Ala12201225210172113822212DNA213苜蓿中華根瘤菌(Sinorhizobium meliloti)220
221CDS222(1)..(3819)223RSM0733840017gtg agt aaa tcg ata att ctt tgt cgt ttt cag aac ggg aga tct ccc48Val Ser Lys Ser Ile Ile Leu Cys Arg Phe Gln Asn Gly Arg Ser Pro1 5 10 15atg tcc gcc gcc gac gcc ctc ttt gga aac gtc tcg ccc aag ccg gat96Met Ser Ala Ala Asp Ala Leu phe Gly Asn Val Ser Pro Lys Pro Asp20 25 30ggt tcg gaa gtc ttt cgg cag ctc gcc cag gcg gcg gct gaa cgc atc144Gly Ser Glu Val Phe Arg Gln Leu Ala Gln Ala Ala Ala Glu Arg Ile35 40 45ctc atc atg gat ggc gcc atg gga acg gag atc cag cag ctc ggt ttc192Leu Ile Met Asp Gly Ala Met Gly Thr Glu Ile Gln Gln Leu Gly Phe50 55 60
gtg gag gat cac ttc cgc ggc gag cgc ttc ggt ggc tgc gcc tgc cat240Val Glu Asp His Phe Arg Gly Glu Arg Phe Gly Gly Cys Ala Cys His65 70 75 80cag cag ggc aac aac gac ctc ctg acg ctc act cag ccg aag gcg atc288Gln Gln Gly Asn Asn Asp Leu Leu Thr Leu Thr Gln Pro Lys Ala Ile85 90 95gag gat att cat tac cac tac gcc atc gcc ggc gcc gat atc ctc gaa336Glu Asp Ile His Tyr His Tyr Ala Ile Ala Gly Ala Asp Ile Leu Glu100 105 110acc aac acc ttc tcc tcg acg cgg atc gcc cag gcc gat tac ggc atg384Thr Asn Thr Phe Ser Ser Thr Arg Ile Ala Gln Ala Asp Tyr Gly Met115 120 125gag gac atg gtc tac gat ctc aat cgc gac ggc gcg cgg ctg gcg cgg432Glu Asp Met Val Tyr Asp Leu Asn Arg Asp Gly Ala Arg Leu Ala Arg130 135 140cga gcc gcg aag cgg gcc gag gcg gag gat ggc cgg cgg cgc ttc gtg480Arg Ala Ala Lys Arg Ala Glu Ala Glu Asp Gly Arg Arg Arg Phe Val145 150 155 160gca ggc gcg ctc ggc ccc acc aac cgc acc gct tcg att tcg ccg gac528Ala Gly Ala Leu Gly Pro Thr Asn Arg Thr Ala Ser Ile Ser Pro Asp165 170 175gtc aac aac ccc ggc tat cga gcc gtc agc ttc gac gat ctg agg ctc576Val Asn Asn Pro Gly Tyr Arg Ala Val Ser Phe Asp Asp Leu Arg Leu180 185 190gcc tat gcc gag cag gtg cgg ggc ctc atc gac ggc ggt gcc gac atc624Ala Tyr Ala Glu Gln Val Arg Gly Leu Ile Asp Gly Gly Ala Asp Ile195 200 205atc ctg atc gag acg atc ttc gac acg ctg aat gcc aag gcg gcg atc672Ile Leu Ile Glu Thr Ile Phe Asp Thr Leu Asn Ala Lys Ala Ala Ile210 215220ttc gcg acg cag gaa gtc ttt gcc gaa aag ggc gtc cgc ctt ccg gtg720Phe Ala Thr Gln Glu Val Phe Ala Glu Lys Gly Val Arg Leu Pro Val225 230 235 240atg atc tcc gga acg atc acc gat ctc tcc ggc cgt acc ctc tcc ggc768Met Ile Ser Gly Thr Ile Thr Asp Leu Ser Gly Arg Thr Leu Ser Gly245 250 255cag acg cct acg gcc ttc tgg tat tcg gtg cgc cat gcg gat ccg ttt816Gln Thr Pro Thr Ala Phe Trp Tyr Ser Val Arg His Ala Asp Pro Phe260 265 270acg atc ggg ctc aac tgc gcg ctc ggc gca aat gcg atg cgc gcc cat864Thr Ile Gly Leu Asn Cys Ala Leu Gly Ala Asn Ala Met Arg Ala His275 280 285ata gac gag ctt tcg gcg gtc gcc gac acg ctc gtc tgc gcc tat ccg912Ile Asp Glu Leu Ser Ala Val Ala Asp Thr Leu Val Cys Ala Tyr Pro290 295 300aat gcc ggc ctg ccg aac gag ttc ggc cgc tat gac gaa agc ccc gag960
Asn Ala Gly Leu Pro Asn Glu Phe Gly Arg Tyr Asp Glu Ser Pro Glu305 310 315 320cag atg gcg gcg cag gtc gag ggc ttc gcc cgg gac ggt ctc gtc aac1008Gln Met Ala Ala Gln Val Glu Gly Phe Ala Arg Asp Gly Leu Val Asn325 330 335atc gtc ggc ggc tgc tgc ggt tcc acg ccg gcc cat atc cgc gcc att1056Ile Val Gly Gly Cys Cys Gly Ser Thr Pro Ala His Ile Arg Ala Ile340 345 350gcc gaa gcg gtt gcc aaa tat ccg ccg cgc cgg gtg ccc gag atc gat1104Ala Glu Ala Val Ala Lys Tyr Pro Pro Arg Arg Val Pro Glu Ile Asp355 360 365cgc cgc atg cgg ctt tcc ggc ctc gaa ccc ttc acg ctt acc gac gag1152Arg Arg Met Arg Leu Ser Gly Leu Glu Pro Phe Thr Leu Thr Asp Glu370 375 380att ccc ttc gtc aac gtc ggc gaa cgc acc aac gtc acc ggc tcg gcg1200Ile Pro Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Ala385 390 395 400aag ttc cgc aag ctg atc acc gcc ggg gac tac gcc gcc gca ctc gat1248Lys Phe Arg Lys Leu Ile Thr Ala Gly Asp Tyr Ala Ala Ala Leu Asp405 410 415gtg gcg cgt gat cag gtg gcg aat ggc gcc cag atc atc gac gtc aac1296Val Ala Arg Asp Gln Val Ala Asn Gly Ala Gln Ile Ile Asp Val Asn420 425 430atg gac gaa ggc ctg atc gat tcg aag cag gtg atg gtc gag ttc ctg1344Met Asp Glu Gly Leu Ile Asp Ser Lys Gln Val Met Val Glu Phe Leu435 440 445aac ctc gtc gcc tcc gag ccg gat atc gcc cgt gta ccg gtg atg atc1392Asn Leu Val Ala Ser Glu Pro Asp Ile Ala Arg Val Pro Val Met Ile450 455 460gat tcg tcg aaa tgg gag gtg atc gaa gcc ggg ctc aaa tgc gtc cag1440Asp Ser Ser Lys Trp Glu Val Ile Glu Ala Gly Leu Lys Cys Val Gln465 470 475 480ggc aag gcg ctg gtg aac tcc atc tcg ctc aag gaa ggc gag gcg gct1488Gly Lys Ala Leu Val Asn Ser Ile Ser Leu Lys Glu Gly Glu Ala Ala485 490 495ttc ctg cac cat gcg cgc ctc gtg cgc gcc tat ggc gcc gcg gtc gtg1536Phe Leu His His Ala Arg Leu Val Arg Ala Tyr Gly Ala Ala Val Val500 505 510gtg atg gcg ttc gac gag aag ggc cag gcc gac acg aaa acc cgc aag1584Val Met Ala Phe Asp Glu Lys Gly Gln Ala Asp Thr Lys Thr Arg Lys515 520 525gtg gaa atc tgc cgg cgg gcc tat cgg ctg ctg acg gaa gag gtt ggc1632Val Glu Ile Cys Arg Arg Ala Tyr Arg Leu Leu Thr Glu Glu Val Gly530 535 540ttc ccc ccg gag gac atc atc ttc gac ccg aat atc ttc gcg gtc gcg1680Phe Pro Pro Glu Asp Ile Ile Phe Asp Pro Asn Ile Phe Ala Val Ala545 550 555 560
acc ggc atc gag gag cac aac aat tac ggc gtc gac ttc atc gag gcg1728Thr Gly Ile Glu Glu His Asn Asn Tyr Gly Val Asp Phe Ile Glu Ala565 570 575acg cac gag atc atc gcg gca ctg ccg cat gtc cac gtc tcc ggc ggc1776Thr His Glu Ile Ile Ala Ala Leu Pro His Val His Val Ser Gly Gly580 585 590gtg tcg aac ctc tcc ttt tcc ttc cgc ggc aac gag ccg gtg cgc gag1824Val Ser Asn Leu Ser Phe Ser Phe Arg Gly Asn Glu Pro Val Arg Glu595 600 605gcg atg cac gcc atc ttc ctt tat cac gcg atc cag gcc ggc atg gac1872Ala Met His Ala Ile Phe Leu Tyr His Ala Ile Gln Ala Gly Met Asp610 615 620atg ggc atc gtc aat gcc gga cag ctc gcc gtc tat gat gcg atc gac1920Met Gly Ile Val Asn Ala Gly Gln Leu Ala Val Tyr Asp Ala Ile Asp625 630 635 640ccg gaa ctg cgc gaa acc tgc gag gac gtg gtg ctc aac cgc cgg gcc1968Pro Glu Leu Arg Glu Thr Cys Glu Asp Val Val Leu Asn Arg Arg Ala645 650 655gat tcg acc gag cgc ctc ctg gag atc gcc gag cgc tat cgc ggg aag2016Asp Ser Thr Glu Arg Leu Leu Glu Ile Ala Glu Arg Tyr Arg Gly Lys660 665 670ggc ggg agc cag ggc aag gag aag gac ctt gcc tgg cgc gaa tgg ccg2064Gly Gly Ser Gln Gly Lys Glu Lys Asp Leu Ala Trp Arg Glu Trp Pro675 680 685gtg gag aag cgg ctc gaa cac gcg ctc gtc aat gga att acc gaa ttt2112Val Glu Lys Arg Leu Glu His Ala Leu Val Asn Gly Ile Thr Glu Phe690 695 700atc gaa gcc gat acg gaa gag gcc cgg ctt gcc gcc gag cgg ccg ctg2160Ile Glu Ala Asp Thr Glu Glu Ala Arg Leu Ala Ala Glu Arg Pro Leu705 710 715 720cat gtc atc gaa ggc ccg ctg atg gcc ggg atg aac gtc gtg ggc gat2208His Val Ile Glu Gly Pro Leu Met Ala Gly Met Asn Val Val Gly Asp725 730 735ctc ttc ggt tcc ggc aag atg ttc ctg ccg cag gtg gtc aag tcc gcc2256Leu Phe Gly Ser Gly Lys Met Phe Leu Pro Gln Val Val Lys Ser Ala740 745 750cgg gtg atg aag cag gcc gtt gcg gtg ctg ctc ccc cat atg gag gag2304Arg Val Met Lys Gln Ala Val Ala Val Leu Leu Pro His Met Glu Glu755 760 765gag aag cgc gcc aat ggc ggc ggc gag gcg cgc gag agt gcc ggc aag2352Glu Lys Arg Ala Asn Gly Gly Gly Glu Ala Arg Glu Ser Ala Gly Lys770 775 780atc ctg atg gcg acc gtc aag ggc gac gtg cac gac atc ggc aag aac2400Ile Leu Met Ala Thr Val Lys Gly Asp Val His Asp Ile Gly Lys Asn785 790 795 800atc gtc ggc gtc gtg ctc gcc tgc aac aat tac gag atc atc gac ctc2448
Ile Val Gly Val Val Leu Ala Cys Asn Asn Tyr Glu Ile Ile Asp Leu805 810 815ggc gtc atg gtg ccc tcg gct aag atc ctc gaa gtg gcg cgc gaa cag2496Gly Val Met Val Pro Ser Ala Lys Ile Leu Glu Val Ala Arg Glu Gln820 825 830aag gtc gac atc gtc ggt ctt tcc ggc ctc atc acg ccg tcg ctg gac2544Lys Val Asp Ile Val Gly Leu Ser Gly Leu Ile Thr Pro Ser Leu Asp835 840 845gag atg gcg cat gtc gct tcc gag ctc gaa cgg gag ggc ttc gat gtc2592Glu Met Ala His Val Ala Ser Glu Leu Glu Arg Glu Gly Phe Asp Val850 855 860ccg ctg ctg atc ggc ggg gcg acg acc agc cgc gtg cac acg gcc gtg2640Pro Leu Leu Ile Gly Gly Ala Thr Thr Ser Arg Val His Thr Ala Val865 870 875 880aag atc aat ccg cgt tac agc ctc ggc cag acg gtc tat gtc acc gac2688Lys Ile Asn Pro Arg Tyr Ser Leu Gly Gln Thr Val Tyr Val Thr Asp885 890 895gcc agc cgc gcg gtc ggc gtc gta tcg agc ctg ctc tcg ccg gaa gtc2736Ala Ser Arg Ala Val Gly Val Val Ser Ser Leu Leu Ser Pro Glu Val900 905 910cgc gac tcc tac aag aaa acg gtc cgc gcg gag tat ctg aag gtt gcc2784Arg Asp Ser Tyr Lys Lys Thr Val Arg Ala Glu Tyr Leu Lys Val Ala915 920 925gac gca cat gcc cgc aac gaa gcc gag aag cgc cgt ctg ccg ctt tcc2832Asp Ala His Ala Arg Asn Glu Ala Glu Lys Arg Arg Leu Pro Leu Ser930 935 940cag gcg cgg gcg aat gcc ttt cgg ata gat tgg gac gcc cac cag ccg2880Gln Ala Arg Ala Asn Ala Phe Arg Ile Asp Trp Asp Ala His Gln Pro945 950 955 960aag gtt ccg tcc ttc ctc ggc acg cgt gtt ttc gag gga tgg gac ctc2928Lys Val Pro Ser Phe Leu Gly Thr Arg Val Phe Glu Gly Trp Asp Leu965 970 975gcc gaa ctc gcc cgc tat atc gac tgg acg ccg ttc ttc cag acc tgg2976Ala Glu Leu Ala Arg Tyr Ile Asp Trp Thr Pro Phe Phe Gln Thr Trp980 985 990gag ctg aag ggg gta ttc ccg aaa atc ctc gat gac gaa cgc cag ggg3024Glu Leu Lys Gly Val Phe Pro Lys Ile Leu Asp Asp Glu Arg Gln Gly99510001005gct gcc gct cgc cag ctc ttc gag gat gcg cag gcg atg gtc gaa aag3072Ala Ala Ala Arg Gln Leu Phe Glu Asp Ala Gln Ala Met Val Glu Lys101010151020atc gtg gcc gag gca tgg ttc gcc ccg aag gcc gtg atc ggc ttc tgg3120Ile Val Ala Glu Ala Trp Phe Ala Pro Lys Ala Val Ile Gly Phe Trp1025 103010351040ccg gcc gcc agc atg ggc gac gac gtc cgc ctg ttt gcc gac gag gtg3168Pro Ala Ala Ser Met Gly Asp Asp Val Arg Leu Phe Ala Asp Glu Val104510501055
cgc gaa gcc gag ctt gcc acc ttc ttc acg ctc cgc cag cag atg gtg3216Arg Glu Ala Glu Leu Ala Thr Phe Phe Thr Leu Arg Gln Gln Met Val106010651070aag cgc gac ggc cgg ccg aac gtc gcc ctt gcc gac ttc gtc gcc ccg3264Lys Arg Asp Gly Arg Pro Asn Val Ala Leu Ala Asp Phe Val Ala Pro107510801085gcg gcg agc ggc aag cgg gac tat gtc ggc ggt ttc gtg gtg acg gcc3312Ala Ala Ser Gly Lys Arg Asp Tyr Val Gly Gly Phe Val Val Thr Ala109010951100ggc atc gag gaa gtg gcg atc gcc gaa cgc ttc gaa cgg gcg aac gac3360Gly Ile Glu Glu Val Ala Ile Ala Glu Arg Phe Glu Arg Ala Asn Asp1105 111011151120gat tat tcc tcg atc atg gtc aag gcg ctt gcg gac cgc ttc gca gag3408Asp Tyr Ser Ser Ile Met Val Lys Ala Leu Ala Asp Arg Phe Ala Glu112511301135gcc ttt gcc gag cgc atg cat gaa tat gtc cgc aag gag ctc tgg ggc3456Ala Phe Ala Glu Arg Met His Glu Tyr Val Arg Lys Glu Leu Trp Gly114011451150tat gct ccg gac gaa gcc ttc acg ccg cag gaa ttg atc gcc gag ccc3504Tyr Ala Pro Asp Glu Ala Phe Thr Pro Gln Glu Leu Ile Ala Glu Pro115511601165tat gcc ggc atc cgc cct gcg ccc ggc tac ccg gcg cag ccc gac cac3552Tyr Ala Gly Ile Arg Pro Ala Pro Gly Tyr Pro Ala Gln Pro Asp His117011751180acg gaa aag gag acg ctt ttc cgg ctc ctg gat gcg gaa gcc gct atc3600Thr Glu Lys Glu Thr Leu Phe Arg Leu Leu Asp Ala Glu Ala Ala Ile1185 119011951200ggc gtc cgg ctc acc gag agc tat gcg atg tgg ccg ggc tct tcg gta3648Gly Val Arg Leu Thr Glu Ser Tyr Ala Met Trp Pro Gly Ser Ser Val120512101215tcg ggc ctc tat gtc ggc cac ccc gat tcc tat tac ttc ggc gtc gca3696Ser Gly Leu Tyr Val Gly His Pro Asp Ser Tyr Tyr Phe Gly Val Ala122012251230aag atc gag cgc gat cag gtg gag gac tat gcc gat cgc aag cgc atg3744Lys Ile Glu Arg Asp Gln Val Glu Asp Tyr Ala Asp Arg Lys Arg Met123512401245agc gtc cgc gag gtc gag cgc tgg ctt tcg ccg atc ctc aat tac gtg3792Ser Val Arg Glu Val Glu Arg Trp Leu Ser Pro Ile Leu Asn Tyr Val125012551260ccg atg ccg gag acg gaa gcg gcg gag tag3822Pro Met Pro Glu Thr Glu Ala Ala Glu1265 1270210182111273212PRT213苜蓿中華根瘤菌
40018Val Ser Lys Ser Ile Ile Leu Cys Arg Phe Gln Asn Gly Arg Ser Pro1 5 10 15Met Ser Ala Ala Asp Ala Leu Phe Gly Asn Val Ser Pro Lys Pro Asp20 25 30Gly Ser Glu Val Phe Arg Gln Leu Ala Gln Ala Ala Ala Glu Arg Ile35 40 45Leu Ile Met Asp Gly Ala Met Gly Thr Glu Ile Gln Gln Leu Gly Phe50 55 60Val Glu Asp His Phe Arg Gly Glu Arg Phe Gly Gly Cys Ala Cys His65 70 75 80Gln Gln Gly Asn Asn Asp Leu Leu Thr Leu Thr Gln Pro Lys Ala Ile85 90 95Glu Asp Ile His Tyr His Tyr Ala Ile Ala Gly Ala Asp Ile Leu Glu100 105 110Thr Asn Thr Phe Ser Ser Thr Arg Ile Ala Gln Ala Asp Tyr Gly Met115 120 125Glu Asp Met Val Tyr Asp Leu Asn Arg Asp Gly Ala Arg Leu Ala Arg130 135 140Arg Ala Ala Lys Arg Ala Glu Ala Glu Asp Gly Arg Arg Arg Phe Val145 150 155 160Ala Gly Ala Leu Gly Pro Thr Asn Arg Thr Ala Ser Ile Ser Pro Asp165 170 175Val Asn Asn Pro Gly Tyr Arg Ala Val Ser Phe Asp Asp Leu Arg Leu180 185 190Ala Tyr Ala Glu Gln Val Arg Gly Leu Ile Asp Gly Gly Ala Asp Ile195 200 205Ile Leu Ile Glu Thr Ile Phe Asp Thr Leu Asn Ala Lys Ala Ala Ile210 215 220Phe Ala Thr Gln Glu Val Phe Ala Glu Lys Gly Val Arg Leu Pro Val225 230 235 240Met Ile Ser Gly Thr Ile Thr Asp Leu Ser Gly Arg Thr Leu Ser Gly245 250 255Gln Thr Pro Thr Ala Phe Trp Tyr Ser Val Arg His Ala Asp Pro Phe260 265 270Thr Ile Gly Leu Asn Cys Ala Leu Gly Ala Asn Ala Met Arg Ala His275 280 285Ile Asp Glu Leu Ser Ala Val Ala Asp Thr Leu Val Cys Ala Tyr Pro290 295 300Asn Ala Gly Leu Pro Asn Glu Phe Gly Arg Tyr Asp Glu Ser Pro Glu305 310 315 320
Gln Met Ala Ala Gln Val Glu Gly Phe Ala Arg Asp Gly Leu Val Asn325 330 335Ile Val Gly Gly Cys Cys Gly Ser Thr Pro Ala His Ile Arg Ala Ile340 345 350Ala Glu Ala Val Ala Lys Tyr Pro Pro Arg Arg Val Pro Glu Ile Asp355 360 365Arg Arg Met Arg Leu Ser Gly Leu Glu Pro Phe Thr Leu Thr Asp Glu370 375 380Ile Pro Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Ala385 390 395 400Lys Phe Arg Lys Leu Ile Thr Ala Gly Asp Tyr Ala Ala Ala Leu Asp405 410 415Val Ala Arg Asp Gln Val Ala Asn Gly Ala Gln Ile Ile Asp Val Asn420 425 430Met Asp Glu Gly Leu Ile Asp Ser Lys Gln Val Met Val Glu Phe Leu435 440 445Asn Leu Val Ala Ser Glu Pro Asp Ile Ala Arg Val Pro Val Met Ile450 455 460Asp Ser Ser Lys Trp Glu Val Ile Glu Ala Gly Leu Lys Cys Val Gln465 470 475 480Gly Lys Ala Leu Val Asn Ser Ile Ser Leu Lys Glu Gly Glu Ala Ala485 490 495Phe Leu His His Ala Arg Leu Val Arg Ala Tyr Gly Ala Ala Val Val500 505 510Val Met Ala Phe Asp Glu Lys Gly Gln Ala Asp Thr Lys Thr Arg Lys515 520 525Val Glu Ile Cys Arg Arg Ala Tyr Arg Leu Leu Thr Glu Glu Val Gly530 535 540Phe Pro Pro Glu Asp Ile Ile Phe Asp Pro Asn Ile Phe Ala Val Ala545 550 555 560Thr Gly Ile Glu Glu His Asn Asn Tyr Gly Val Asp Phe Ile Glu Ala565 570 575Thr His Glu Ile Ile Ala Ala Leu Pro His Val His Val Ser Gly Gly580 585 590Val Ser Asn Leu Ser Phe Ser Phe Arg Gly Asn Glu Pro Val Arg Glu595 600 605Ala Met His Ala Ile Phe Leu Tyr His Ala Ile Gln Ala Gly Met Asp610 615 620Met Gly Ile Val Asn Ala Gly Gln Leu Ala Val Tyr Asp Ala Ile Asp625 630 635 640Pro Glu Leu Arg Glu Thr Cys Glu Asp Val Val Leu Asn Arg Arg Ala645 650 655
Asp Ser Thr Glu Arg Leu Leu Glu Ile Ala Glu Arg Tyr Arg Gly Lys660 665 670Gly Gly Ser Gln Gly Lys Glu Lys Asp Leu Ala Trp Arg Glu Trp Pro675 680 685Val Glu Lys Arg Leu Glu His Ala Leu Val Asn Gly Ile Thr Glu Phe690 695 700Ile Glu Ala Asp Thr Glu Glu Ala Arg Leu Ala Ala Glu Arg Pro Leu705 710 715 720His Val Ile Glu Gly Pro Leu Met Ala Gly Met Asn Val Val Gly Asp725 730 735Leu Phe Gly Ser Gly Lys Met Phe Leu Pro Gln Val Val Lys Ser Ala740 745 750Arg Val Met Lys Gln Ala Val Ala Val Leu Leu Pro His Met Glu Glu755 760 765Glu Lys Arg Ala Asn Gly Gly Gly Glu Ala Arg Glu Ser Ala Gly Lys770 775 780Ile Leu Met Ala Thr Val Lys Gly Asp Val His Asp Ile Gly Lys Asn785 790 795 800Ile Val Gly Val Val Leu Ala Cys Asn Asn Tyr Glu Ile Ile Asp Leu805 810 815Gly Val Met Val Pro Ser Ala Lys Ile Leu Glu Val Ala Arg Glu Gln820 825 830Lys Val Asp Ile Val Gly Leu Ser Gly Leu Ile Thr Pro Ser Leu Asp835 840 845Glu Met Ala His Val Ala Ser Glu Leu Glu Arg Glu Gly Phe Asp Val850 855 860Pro Leu Leu Ile Gly Gly Ala Thr Thr Ser Arg Val His Thr Ala Val865 870 875 880Lys Ile Asn Pro Arg Tyr Ser Leu Gly Gln Thr Val Tyr Val Thr Asp885 890 895Ala Ser Arg Ala Val Gly Val Val Ser Ser Leu Leu Ser Pro Glu Val900 905 910Arg Asp Ser Tyr Lys Lys Thr Val Arg Ala Glu Tyr Leu Lys Val Ala915 920 925Asp Ala His Ala Arg Asn Glu Ala Glu Lys Arg Arg Leu Pro Leu Ser930 935 940Gln Ala Arg Ala Asn Ala Phe Arg Ile Asp Trp Asp Ala His Gln Pro945 950 955 960Lys Val Pro Ser Phe Leu Gly Thr Arg Val Phe Glu Gly Trp Asp Leu965 970 975Ala Glu Leu Ala Arg Tyr Ile Asp Trp Thr Pro Phe Phe Gln Thr Trp
980 985 990Glu Leu Lys Gly Val Phe Pro Lys Ile Leu Asp Asp Glu Arg Gln Gly99510001005Ala Ala Ala Arg Gln Leu Phe Glu Asp Ala Gln Ala Met Val Glu Lys101010151020Ile Val Ala Glu Ala Trp Phe Ala Pro Lys Ala Val Ile Gly Phe Trp1025 103010351040Pro Ala Ala Ser Met Gly Asp Asp Val Arg Leu Phe Ala Asp Glu Val104510501055Arg Glu Ala Glu Leu Ala Thr Phe Phe Thr Leu Arg Gln Gln Met Val106010651070Lys Arg Asp Gly Arg Pro Asn Val Ala Leu Ala Asp Phe Val Ala Pro107510801085Ala Ala Ser Gly Lys Arg Asp Tyr Val Gly Gly Phe Val Val Thr Ala109010951100Gly Ile Glu Glu Val Ala Ile Ala Glu Arg Phe Glu Arg Ala Asn Asp1105 111011151120Asp Tyr Ser Ser Ile Met Val Lys Ala Leu Ala Asp Arg Phe Ala Glu112511301135Ala Phe Ala Glu Arg Met His Glu Tyr Val Arg Lys Glu Leu Trp Gly114011451150Tyr Ala Pro Asp Glu Ala Phe Thr Pro Gln Glu Leu Ile Ala Glu Pro115511601165Tyr Ala Gly Ile Arg Pro Ala Pro Gly Tyr Pro Ala Gln Pro Asp His117011751180Thr Glu Lys Glu Thr Leu Phe Arg Leu Leu Asp Ala Glu Ala Ala Ile1185 119011951200Gly Val Arg Leu Thr Glu Ser Tyr Ala Met Trp Pro Gly Ser Ser Val120512101215Ser Gly Leu Tyr Val Gly His Pro Asp Ser Tyr Tyr Phe Gly Val Ala122012251230Lys Ile Glu Arg Asp Gln Val Glu Asp Tyr Ala Asp Arg Lys Arg Met123512401245Ser Val Arg Glu Val Glu Arg Trp Leu Ser Pro Ile Leu Asn Tyr Val125012551260Pro Met Pro Glu Thr Glu Ala Ala Glu1265 1270210192113684212DNA213大腸桿菌(Escherichia coli)
220
221CDS222(1)..(3681)223REC0390540019gtg agc agc aaa gtg gaa caa ctg cgt gcg cag tta aat gaa cgt att48Val Ser Ser Lys Val Glu Gln Leu Arg Ala Gln Leu Asn Glu Arg Ile1 5 10 15ctg gtg ctg gac ggc ggt atg ggc acc atg atc cag agt tat cga ctg96Leu Val Leu Asp Gly Gly Met Gly Thr Met Ile Gln Ser Tyr Arg Leu20 25 30aac gaa gcc gat ttt cgt ggt gaa cgc ttt gcc gac tgg cca tgc gac144Asn Glu Ala Asp Phe Arg Gly Glu Arg Phe Ala Asp Trp Pro Cys Asp35 40 45ctc aaa ggc aac aac gac ctg ctg gta ctc agt aaa ccg gaa gtg atc192Leu Lys Gly Asn Asn Asp Leu Leu Val Leu Ser Lys Pro Glu Val Ile50 55 60gcc gct atc cac aac gcc tac ttt gaa gcg ggc gcg gat atc atc gaa240Ala Ala Ile His Asn Ala Tyr Phe Glu Ala Gly Ala Asp Ile Ile Glu65 70 75 80acc aac acc ttc aac tcc acg acc att gcg atg gcg gat tac cag atg288Thr Asn Thr Phe Asn Ser Thr Thr Ile Ala Met Ala Asp Tyr Gln Met85 90 95gaa tcc ctg tcg gcg gaa atc aac ttt gcg gcg gcg aaa ctg gcg cga336Glu Ser Leu Ser Ala Glu Ile Asn Phe Ala Ala Ala Lys Leu Ala Arg100 105 110gct tgt gct gac gag tgg acc gcg cgc acg cca gag aaa ccg cgc tac384Ala Cys Ala Asp Glu Trp Thr Ala Arg Thr Pro Glu Lys Pro Arg Tyr115 120 125gtt gcc ggt gtt ctc ggc ccg acc aac cgc acg gcg tct att tct ccg432Val Ala Gly Val Leu Gly Pro Thr Asn Arg Thr Ala Ser Ile Ser Pro130 135 140gac gtc aac gat ccg gca ttt cgt aat atc act ttt gac ggg ctg gtg480Asp Val Asn Asp Pro Ala Phe Arg Asn Ile Thr Phe Asp Gly Leu Val145 150 155 160gcg gct tat cga gag tcc acc aaa gcg ctg gtg gaa ggt ggc gcg gat528Ala Ala Tyr Arg Glu Ser Thr Lys Ala Leu Val Glu Gly Gly Ala Asp165 170 175ctg atc ctg att gaa acc gtt ttc gac acc ctt aac gcc aaa gcg gcg576Leu Ile Leu Ile Glu Thr Val Phe Asp Thr Leu Asn Ala Lys Ala Ala180 185 190gta ttt gcg gtg aaa acg gag ttt gaa gcg ctg ggc gtt gag ctg ccg624Val Phe Ala Val Lys Thr Glu Phe Glu Ala Leu Gly Val Glu Leu Pro195 200 205att atg atc tcc ggc acc atc acc gac gcc tcc ggg cgc acg ctc tcc672Ile Met Ile Ser Gly Thr Ile Thr Asp Ala Ser Gly Arg Thr Leu Ser210 215 220
ggg cag acc acc gaa gca ttt tac aac tca ttg cgc cac gcc gaa gct720Gly Gln Thr Thr Glu Ala Phe Tyr Asn Ser Leu Arg His Ala Glu Ala225 230 235 240ctg acc ttt ggc ctg aac tgt gcg ctg ggg ccc gat gaa ctg cgc cag768Leu Thr Phe Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu Arg Gln245 250 255tac gtg cag gag ctg tca cgg att gcg gaa tgc tac gtc acc gcg cac816Tyr Val Gln Glu Leu Ser Arg Ile Ala Glu Cys Tyr Val Thr Ala His260 265 270ccg aac gcc ggg cta ccc aac gcc ttt ggt gag tac gat ctc gac gcc864Pro Asn Ala Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu Asp Ala275 280 285gac acg atg gca aaa cag ata cgt gaa tgg gcg caa gcg ggt ttt ctc912Asp Thr Met Ala Lys Gln Ile Arg Glu Trp Ala Gln Ala Gly Phe Leu290 295 300aat atc gtc ggc ggc tgc tgt ggc acc acg cca caa cat att gca gcg960Asn Ile Val Gly Gly Cys Cys Gly Thr Thr Pro Gln His Ile Ala Ala305 310 315 320atg agt cgt gca gta gaa gga tta gcg ccg cgc aaa ctg ccg gaa att1008Met Ser Arg Ala Val Glu Gly Leu Ala Pro Arg Lys Leu Pro Glu Ile325 330 335ccc gta gcc tgc cgt ttg tcc ggc ctg gag ccg ctg aac att ggc gaa1056Pro Val Ala Cys Arg Leu Ser Gly Leu Glu Pro Leu Asn Ile Gly Glu340 345 350gat agc ctg ttt gtg aac gtg ggt gaa cgc acc aac gtc acc ggt tcc1104Asp Ser Leu Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser355 360 365gct aag ttc aag cgc ctg atc aaa gaa gag aaa tac agc gag gcg ctg1152Ala Lys Phe Lys Arg Leu Ile Lys Glu Glu Lys Tyr Ser Glu Ala Leu370 375 380gat gtc gcg cgt caa cag gtg gaa aac ggc gcg cag att atc gat atc1200Asp Val Ala Arg Gln Gln Val Glu Asn Gly Ala Gln Ile Ile Asp Ile385 390 395 400aac atg gat gaa ggg atg ctc gat gcc gaa gcg gcg atg gtg cgt ttt1248Asn Met Asp Glu Gly Met Leu Asp Ala Glu Ala Ala Met Val Arg Phe405 410 415ctc aat ctg att gcc ggt gaa ccg gat atc gct cgc gtg ccg att atg1296Leu Asn Leu Ile Ala Gly Glu Pro Asp Ile Ala Arg Val Pro Ile Met420 425 430atc gac tcc tca aaa tgg gac gtc att gaa aaa ggt ctg aag tgt atc1344Ile Asp Ser Ser Lys Trp Asp Val Ile Glu Lys Gly Leu Lys Cys Ile435 440 445cag ggc aaa ggc att gtt aac tct atc tcg atg aaa gag ggc gtc gat1392Gln Gly Lys Gly Ile Val Asn Ser Ile Ser Met Lys Glu Gly Val Asp450 455 460gcc ttt atc cat cac gcg aaa ttg ttg cgt cgc tac ggt gcg gca gtg1440Ala Phe Ile His His Ala Lys Leu Leu Arg Arg Tyr Gly Ala Ala Val
465 470 475 480gtg gta atg gcc ttt gac gaa cag gga cag gcc gat act cgc gca cgg1488Val Val Met Ala Phe Asp Glu Gln Gly Gln Ala Asp Thr Arg Ala Arg485 490 495aaa atc gag att tgc cgt cgg gcg tac aaa atc ctc acc gaa gag gtt1536Lys Ile Glu Ile Cys Arg Arg Ala Tyr Lys Ile Leu Thr Glu Glu Val500 505 510ggc ttc ccg cca gaa gat atc atc ttc gac cca aac atc ttc gcg gtc1584Gly Phe Pro Pro Glu Asp Ile Ile Phe Asp Pro Asn Ile Phe Ala Val515 520 525gca act ggc att gaa gag cac aac aac tac gcg cag gac ttt atc ggc1632Ala Thr Gly Ile Glu Glu His Asn Asn Tyr Ala Gln Asp Phe Ile Gly530 535 540gcg tgt gaa gac atc aaa cgc gaa ctg ccg cac gcg ctg att tcc ggc1680Ala Cys Glu Asp Ile Lys Arg Glu Leu Pro His Ala Leu Ile Ser Gly545 550 555 560ggc gta tct aac gtt tct ttc tcg ttc cgt ggc aac gat ccg gtg cgc1728Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asp Pro Val Arg565 570 575gaa gcc att cac gca gtg ttc ctc tac tac gct att cgc aat ggc atg1776Glu Ala Ile His Ala Val Phe Leu Tyr Tyr Ala Ile Arg Asn Gly Met580 585 590gat atg ggg atc gtc aac gcc ggg caa ctg gcg att tac gac gac cta1824Asp Met Gly Ile Val Asn Ala Gly Gln Leu Ala Ile Tyr Asp Asp Leu595 600 605ccc gct gaa ctg cgc gac gcg gtg gaa gat gtg att ctt aat cgt cgc1872Pro Ala Glu Leu Arg Asp Ala Val Glu Asp Val Ile Leu Asn Arg Arg610 615 620gac gat ggc acc gag cgt tta ctg gag ctt gcc gag aaa tat cgc ggc1920Asp Asp Gly Thr Glu Arg Leu Leu Glu Leu Ala Glu Lys Tyr Arg Gly625 630 635 640agc aaa acc gac gac acc gcc aac gcc cag cag gcg gag tgg cgc tcg1968Ser Lys Thr Asp Asp Thr Ala Asn Ala Gln Gln Ala Glu Trp Arg Ser645 650 655tgg gaa gtg aat aaa cgt ctg gaa tac tcg ctg gtc aaa ggc att acc2016Trp Glu Val Asn Lys Arg Leu Glu Tyr Ser Leu Val Lys Gly Ile Thr660 665 670gag ttt atc gag cag gat acc gaa gaa gcc cgc cag cag gct acg cgc2064Glu Phe Ile Glu Gln Asp Thr Glu Glu Ala Arg Gln Gln Ala Thr Arg675 680 685ccg att gaa gtg att gaa ggc ccg ttg atg gac ggc atg aat gtg gtc2112Pro Ile Glu Val Ile Glu Gly Pro Leu Met Asp Gly Met Asn Val Val690 695 700ggc gac ctg ttt ggc gaa ggg aaa atg ttc ctg cca cag gtg gtc aaa2160Gly Asp Leu Phe Gly Glu Gly Lys Met Phe Leu Pro Gln Val Val Lys705 710 715 720
tcg gcg cgc gtc atg aaa cag gcg gtg gcc tac ctc gaa ccg ttt att2208Ser Ala Arg Val Met Lys Gln Ala Val Ala Tyr Leu Glu Pro Phe Ile725 730 735gaa gcc agc aaa gag cag ggc aaa acc aac ggc aag atg gtg atc gcc2256Glu Ala Ser Lys Glu Gln Gly Lys Thr Asn Gly Lys Met Val Ile Ala740 745 750acc gtg aag ggc gac gtc cac gac atc ggt aaa aat atc gtt ggt gtg2304Thr Val Lys Gly Asp Val His Asp Ile Gly Lys Asn Ile Val Gly Val755 760 765gtg ctg caa tgt aac aac tac gaa att gtc gat ctc ggc gtt atg gtg2352Val Leu Gln Cys Asn Asn Tyr Glu Ile Val Asp Leu Gly Val Met Val770 775 780cct gcg gaa aaa att ctc cgt acc gct aaa gaa gtg aat gct gat ctg2400Pro Ala Glu Lys Ile Leu Arg Thr Ala Lys Glu Val Asn Ala Asp Leu785 790 795 800att ggc ctt tcg ggg ctt atc acg ccg tcg ctg gac gag atg gtt aac2448Ile Gly Leu Ser Gly Leu Ile Thr Pro Ser Leu Asp Glu Met Val Asn805 810 815gtg gcg aaa gag atg gag cgt cag ggc ttc act att ccg tta ctg att2496Val Ala Lys Glu Met Glu Arg Gln Gly Phe Thr Ile Pro Leu Leu Ile820 825 830ggc ggc gcg acg acc tca aaa gcg cac acg gcg gtg aaa atc gag cag2544Gly Gly Ala Thr Thr Ser Lys Ala His Thr Ala Val Lys Ile Glu Gln835 840 845aac tac agc ggc ccg acg gtg tat gtg cag aat gcc tcg cgt acc gtt2592Asn Tyr Ser Gly Pro Thr Val Tyr Val Gln Asn Ala Ser Arg Thr Val850 855 860ggt gtg gtg gcg gcg ctg ctt tcc gat acc cag cgt gat gat ttt gtc2640Gly Val Val Ala Ala Leu Leu Ser Asp Thr Gln Arg Asp Asp Phe Val865 870 875 880gct cgt acc cgc aag gag tac gaa acc gta cgt att cag cac ggg cgc2688Ala Arg Thr Arg Lys Glu Tyr Glu Thr Val Arg Ile Gln His Gly Arg885 890 895aag aaa ccg cgc aca cca ccg gtc acg ctg gaa gcg gcg cgc gat aac2736Lys Lys Pro Arg Thr Pro Pro Val Thr Leu Glu Ala Ala Arg Asp Asn900 905 910gat ttc gct ttt gac tgg cag gct tac acg ccg ccg gtg gcg cac cgt2784Asp Phe Ala Phe Asp Trp Gln Ala Tyr Thr Pro Pro Val Ala His Arg915 920 925ctc ggc gtg cag gaa gtc gaa gcc agc atc gaa acg ctg cgt aat tac2832Leu Gly Val Gln Glu Val Glu Ala Ser Ile Glu Thr Leu Arg Asn Tyr930 935 940atc gac tgg aca ccg ttc ttt atg acc tgg tcg ctg gcc ggg aag tat2880Ile Asp Trp Thr Pro Phe Phe Met Thr Trp Ser Leu Ala Gly Lys Tyr945 950 955 960ccg cgc att ctg gaa gat gaa gtg gtg ggc gtt gag gcg cag cgg ctg2928Pro Arg Ile Leu Glu Asp Glu Val Val Gly Val Glu Ala Gln Arg Leu
965 970 975ttt aaa gac gcc aac gac atg ctg gat aaa tta agc gcc gag aaa acg2976Phe Lys Asp Ala Asn Asp Met Leu Asp Lys Leu Ser Ala Glu Lys Thr980 985 990ctg aat ccg cgt ggc gtg gtg ggc ctg ttc ccg gca aac cgt gtg ggc3024Leu Asn Pro Arg Gly Val Val Gly Leu Phe Pro Ala Asn Arg Val Gly99510001005gat gac att gaa atc tac cgt gac gaa acg cgt acc cat gtg atc aac3072Asp Asp Ile Glu Ile Tyr Arg Asp Glu Thr Arg Thr His Val Ile Asn101010151020gtc agc cac cat ctg cgt caa cag acc gaa aaa aca ggc ttc gct aac3120Val Ser His His Leu Arg Gln Gln Thr Glu Lys Thr Gly Phe Ala Asn1025 103010351040tac tgt ctc gct gac ttc gtt gcg ccg aag ctt tct ggt aaa gca gat3168Tyr Cys Leu Ala Asp Phe Val Ala Pro Lys Leu Ser Gly Lys Ala Asp104510501055tac atc ggc gca ttt gcc gtg act ggc ggg ctg gaa gag gac gca ctg3216Tyr Ile Gly Ala Phe Ala Val Thr Gly Gly Leu Glu Glu Asp Ala Leu106010651070gct gat gcc ttt gaa gcg cag cac gat gat tac aac aaa atc atg gtg3264Ala Asp Ala Phe Glu Ala Gln His Asp Asp Tyr Asn Lys Ile Met Val107510801085aaa gcg ctt gcc gac cgt tta gcc gaa gcc ttt gcg gag tat ctc cat3312Lys Ala Leu Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Tyr Leu His109010951100gag cgt gtg cgt aaa gtc tac tgg ggc tat gcg ccg aac gag aac ctc3360Glu Arg Val Arg Lys Val Tyr Trp Gly Tyr Ala Pro Asn Glu Asn Leu1105 111011151120agc aac gaa gag ctg atc cgc gaa aac tac cag ggc atc cgt ccg gca3408Ser Asn Glu Glu Leu Ile Arg Glu Asn Tyr Gln Gly Ile Arg Pro Ala112511301135ccg ggc tat ccg gcc tgc ccg gaa cat acg gaa aaa gcc acc atc tgg3456Pro Gly Tyr Pro Ala Cys Pro Glu His Thr Glu Lys Ala Thr Ile Trp114011451150gag ctg ctg gaa gtg gaa aaa cac act ggc atg aaa ctc aca gaa tct3504Glu Leu Leu Glu Val Glu Lys His Thr Gly Met Lys Leu Thr Glu Ser115511601165ttc gcc atg tgg ccc ggt gca tcg gtt tcg ggt tgg tac ttc agc cac3552Phe Ala Met Trp Pro Gly Ala Ser Val Ser Gly Trp Tyr Phe Ser His117011751180ccg gac agc aag tac tac gct gta gca caa att cag cgc gat cag gtt3600Pro Asp Ser Lys Tyr Tyr Ala Val Ala Gln Ile Gln Arg Asp Gln Val1185 119011951200gaa gat tat gcc cgc cgt aaa ggt atg agc gtt acc gaa gtt gag cgc3648Glu Asp Tyr Ala Arg Arg Lys Gly Met Ser Val Thr Glu Val Glu Arg120512101215
tgg ctg gca ccg aat ctg ggg tat gac gcg gac tga3684Trp Leu Ala Pro Asn Leu Gly Tyr Asp Ala Asp12201225210202111227212PRT213大腸桿菌40020Val Ser Ser Lys Val Glu Gln Leu Arg Ala Gln Leu Asn Glu Arg Ile1 5 10 15Leu Val Leu Asp Gly Gly Met Gly Thr Met Ile Gln Ser Tyr Arg Leu20 25 30Asn Glu Ala Asp Phe Arg Gly Glu Arg Phe Ala Asp Trp Pro Cys Asp35 40 45Leu Lys Gly Asn Asn Asp Leu Leu Val Leu Ser Lys Pro Glu Val Ile50 55 60Ala Ala Ile His Asn Ala Tyr Phe Glu Ala Gly Ala Asp Ile Ile Glu65 70 75 80Thr Asn Thr Phe Asn Ser Thr Thr Ile Ala Met Ala Asp Tyr Gln Met85 90 95Glu Ser Leu Ser Ala Glu Ile Asn Phe Ala Ala Ala Lys Leu Ala Arg100 105 110Ala Cys Ala Asp Glu Trp Thr Ala Arg Thr Pro Glu Lys Pro Arg Tyr115 120 125Val Ala Gly Val Leu Gly Pro Thr Asn Arg Thr Ala Ser Ile Ser Pro130 135 140Asp Val Asn Asp Pro Ala Phe Arg Asn Ile Thr Phe Asp Gly Leu Val145 150 155 160Ala Ala Tyr Arg Glu Ser Thr Lys Ala Leu Val Glu Gly Gly Ala Asp165 170 175Leu Ile Leu Ile Glu Thr Val Phe Asp Thr Leu Asn Ala Lys Ala Ala180 185 190Val Phe Ala Val Lys Thr Glu Phe Glu Ala Leu Gly Val Glu Leu Pro195 200 205Ile Met Ile Ser Gly Thr Ile Thr Asp Ala Ser Gly Arg Thr Leu Ser210 215 220Gly Gln Thr Thr Glu Ala Phe Tyr Asn Ser Leu Arg His Ala Glu Ala225 230 235 240Leu Thr Phe Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu Arg Gln245 250 255Tyr Val Gln Glu Leu Ser Arg Ile Ala Glu Cys Tyr Val Thr Ala His260 265 270
Pro Asn Ala Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu Asp Ala275 280 285Asp Thr Met Ala Lys Gln Ile Arg Glu Trp Ala Gln Ala Gly Phe Leu290 295 300Asn Ile Val Gly Gly Cys Cys Gly Thr Thr Pro Gln His Ile Ala Ala305 310 315 320Met Ser Arg Ala Val Glu Gly Leu Ala Pro Arg Lys Leu Pro Glu Ile325 330 335Pro Val Ala Cys Arg Leu Ser Gly Leu Glu Pro Leu Asn Ile Gly Glu340 345 350Asp Ser Leu Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser355 360 365Ala Lys Phe Lys Arg Leu Ile Lys Glu Glu Lys Tyr Ser Glu Ala Leu370 375 380Asp Val Ala Arg Gln Gln Val Glu Asn Gly Ala Gln Ile Ile Asp Ile385 390 395 400Asn Met Asp Glu Gly Met Leu Asp Ala Glu Ala Ala Met Val Arg Phe405 410 415Leu Asn Leu Ile Ala Gly Glu Pro Asp Ile Ala Arg Val Pro Ile Met420 425 430Ile Asp Ser Ser Lys Trp Asp Val Ile Glu Lys Gly Leu Lys Cys Ile435 440 445Gln Gly Lys Gly Ile Val Asn Ser Ile Ser Met Lys Glu Gly Val Asp450 455 460Ala Phe Ile His His Ala Lys Leu Leu Arg Arg Tyr Gly Ala Ala Val465 470 475 480Val Val Met Ala Phe Asp Glu Gln Gly Gln Ala Asp Thr Arg Ala Arg485 490 495Lys Ile Glu Ile Cys Arg Arg Ala Tyr Lys Ile Leu Thr Glu Glu Val500 505 510Gly Phe Pro Pro Glu Asp Ile Ile Phe Asp Pro Asn Ile Phe Ala Val515 520 525Ala Thr Gly Ile Glu Glu His Asn Asn Tyr Ala Gln Asp Phe Ile Gly530 535 540Ala Cys Glu Asp Ile Lys Arg Glu Leu Pro His Ala Leu Ile Ser Gly545 550 555 560Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asp Pro Val Arg565 570 575Glu Ala Ile His Ala Val Phe Leu Tyr Tyr Ala Ile Arg Asn Gly Met580 585 590Asp Met Gly Ile Val Asn Ala Gly Gln Leu Ala Ile Tyr Asp Asp Leu595 600 605
Pro Ala Glu Leu Arg Asp Ala Val Glu Asp Val Ile Leu Asn Arg Arg610 615 620Asp Asp Gly Thr Glu Arg Leu Leu Glu Leu Ala Glu Lys Tyr Arg Gly625 630 635 640Ser Lys Thr Asp Asp Thr Ala Asn Ala Gln Gln Ala Glu Trp Arg Ser645 650 655Trp Glu Val Asn Lys Arg Leu Glu Tyr Ser Leu Val Lys Gly Ile Thr660 665 670Glu Phe Ile Glu Gln Asp Thr Glu Glu Ala Arg Gln Gln Ala Thr Arg675 680 685Pro Ile Glu Val Ile Glu Gly Pro Leu Met Asp Gly Met Asn Val Val690 695 700Gly Asp Leu Phe Gly Glu Gly Lys Met Phe Leu Pro Gln Val Val Lys705 710 715 720Ser Ala Arg Val Met Lys Gln Ala Val Ala Tyr Leu Glu Pro Phe Ile725 730 735Glu Ala Ser Lys Glu Gln Gly Lys Thr Asn Gly Lys Met Val Ile Ala740 745 750Thr Val Lys Gly Asp Val His Asp Ile Gly Lys Asn Ile Val Gly Val755 760 765Val Leu Gln Cys Asn Asn Tyr Glu Ile Val Asp Leu Gly Val Met Val770 775 780Pro Ala Glu Lys Ile Leu Arg Thr Ala Lys Glu Val Asn Ala Asp Leu785 790 795 800Ile Gly Leu Ser Gly Leu Ile Thr Pro Ser Leu Asp Glu Met Val Asn805 810 815Val Ala Lys Glu Met Glu Arg Gln Gly Phe Thr Ile Pro Leu Leu Ile820 825 830Gly Gly Ala Thr Thr Ser Lys Ala His Thr Ala Val Lys Ile Glu Gln835 840 845Asn Tyr Ser Gly Pro Thr Val Tyr Val Gln Asn Ala Ser Arg Thr Val850 855 860Gly Val Val Ala Ala Leu Leu Ser Asp Thr Gln Arg Asp Asp Phe Val865 870 875 880Ala Arg Thr Arg Lys Glu Tyr Glu Thr Val Arg Ile Gln His Gly Arg885 890 895Lys Lys Pro Arg Thr Pro Pro Val Thr Leu Glu Ala Ala Arg Asp Asn900 905 910Asp Phe Ala Phe Asp Trp Gln Ala Tyr Thr Pro Pro Val Ala His Arg915 920 925Leu Gly Val Gln Glu Val Glu Ala Ser Ile Glu Thr Leu Arg Asn Tyr
930 935 940Ile Asp Trp Thr Pro Phe Phe Met Thr Trp Ser Leu Ala Gly Lys Tyr945 950 955 960Pro Arg Ile Leu Glu Asp Glu Val Val Gly Val Glu Ala Gln Arg Leu965 970 975Phe Lys Asp Ala Asn Asp Met Leu Asp Lys Leu Ser Ala Glu Lys Thr980 985 990Leu Asn Pro Arg Gly Val Val Gly Leu Phe Pro Ala Asn Arg Val Gly99510001005Asp Asp Ile Glu Ile Tyr Arg Asp Glu Thr Arg Thr His Val Ile Asn1010 10151020Val Ser His His Leu Arg Gln Gln Thr Glu Lys Thr Gly Phe Ala Asn1025 103010351040Tyr Cys Leu Ala Asp Phe Val Ala Pro Lys Leu Ser Gly Lys Ala Asp1045 10501055Tyr Ile Gly Ala Phe Ala Val Thr Gly Gly Leu Glu Glu Asp Ala Leu106010651070Ala Asp Ala Phe Glu Ala Gln His Asp Asp Tyr Asn Lys Ile Met Val107510801085Lys Ala Leu Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Tyr Leu His109010951100Glu Arg Val Arg Lys Val Tyr Trp Gly Tyr Ala Pro Asn Glu Asn Leu1105 111011151120Ser Asn Glu Glu Leu Ile Arg Glu Asn Tyr Gln Gly Ile Arg Pro Ala112511301135Pro Gly Tyr Pro Ala Cys Pro Glu His Thr Glu Lys Ala Thr Ile Trp114011451150Glu Leu Leu Glu Val Glu Lys His Thr Gly Met Lys Leu Thr Glu Ser115511601165Phe Ala Met Trp Pro Gly Ala Ser Val Ser Gly Trp Tyr Phe Ser His117011751180Pro Asp Ser Lys Tyr Tyr Ala Val Ala Gln Ile Gln Arg Asp Gln Val1185 119011951200Glu Asp Tyr Ala Arg Arg Lys Gly Met Ser Val Thr Glu Val Glu Arg120512101215Trp Leu Ala Pro Asn Leu Gly Tyr Asp Ala Asp12201225210212113771212DNA213鼠傷害沙門氏菌(Salmonella typhimurium)
220
221CDS222(1)..(3768)223RSY0299640021atg tct cat gtt gcc cgt tgt tct ctt ttc cgc cag cac gct ttg tgc48Met Ser His Val Ala Arg Cys Ser Leu Phe Arg Gln His Ala Leu Cys1 5 10 15cag tat ggc tcg tta cgt gga gcg ttg tcg gga gcg agt gtg agc agc96Gln Tyr Gly Ser Leu Arg Gly Ala Leu Ser Gly Ala Ser Val Ser Ser20 25 30aaa gtt gaa caa ctg cgt gcg cag tta aat gaa cgt att ctg gtg ctg144Lys Val Glu Gln Leu Arg Ala Gln Leu Asn Glu Arg Ile Leu Val Leu35 40 45gac ggc ggt atg ggc act atg atc cag agc tat cgt cta cat gaa gaa192Asp Gly Gly Met Gly Thr Met Ile Gln Ser Tyr Arg Leu His Glu Glu50 55 60gat ttc cgc ggg gag cgc ttt gcc gac tgg ccc tgc gac ctg aaa ggc240Asp Phe Arg Gly Glu Arg Phe Ala Asp Trp Pro Cys Asp Leu Lys Gly65 70 75 80aac aat gac ctg ctg gtc ctc agc aag ccg gag gtg atc gcc gct atc288Asn Asn Asp Leu Leu Val Leu Ser Lys Pro Glu Val Ile Ala Ala Ile85 90 95cac aac gcc tac ttt gag gct ggc gcg gat atc atc gaa acc aac acc336His Asn Ala Tyr Phe Glu Ala Gly Ala Asp Ile Ile Glu Thr Asn Thr100 105 110ttt aac tcg aca acc att gcg atg gcg gat tac cgg atg gaa tcc ctg384Phe Asn Ser Thr Thr Ile Ala Met Ala Asp Tyr Arg Met Glu Ser Leu115 120 125tcg gcg gaa att aac tat gcg gcg gcc aaa ctg gcg cgc gcc tgc gcc432Ser Ala Glu Ile Asn Tyr Ala Ala Ala Lys Leu Ala Arg Ala Cys Ala130 135 140gat gaa tgg acg gcg cga aca cca gaa aaa cca cgc ttt gtt gcg ggc480Asp Glu Trp Thr Ala Arg Thr Pro Glu Lys Pro Arg Phe Val Ala Gly145 150 155 160gtg ctt ggt cca act aac cgc acg gcc tcc att tcg ccg gac gtc aac528Val Leu Gly Pro Thr Asn Arg Thr Ala Ser Ile Ser Pro Asp Val Asn165 170 175gac ccg gcg ttt cgt aat atc acc ttc gat cag ctg gtg gcg gcc tac576Asp Pro Ala Phe Arg Asn Ile Thr Phe Asp Gln Leu Val Ala Ala Tyr180 185 190cgt gaa tcc acc aaa gcg ctg gtg gaa ggt ggc gca gat ctg att ctg624Arg Glu Ser Thr Lys Ala Leu Val Glu Gly Gly Ala Asp Leu Ile Leu195 200 205att gag acc gtt ttt gac acc ctg aat gcg aaa gcg gcg gtg ttt gcg672Ile Glu Thr Val Phe Asp Thr Leu Asn Ala Lys Ala Ala Val Phe Ala210 215 220
gtg aaa gaa gag ttt gaa gcg ctg ggc gtt gac ctg ccg atc atg att720Val Lys Glu Glu Phe Glu Ala Leu Gly Val Asp Leu Pro Ile Met Ile225 230 235 240tcc ggc acc atc acc gac gcc tct ggc cgt acg ctt tcc ggc cag act768Ser Gly Thr Ile Thr Asp Ala Ser Gly Arg Thr Leu Ser Gly Gln Thr245 250 255acc gaa gcc ttt tat aac tcg ctg cgc cac gcc gag gcg ctc act ttt816Thr Glu Ala Phe Tyr Asn Ser Leu Arg His Ala Glu Ala Leu Thr Phe260 265 270ggc ctt aac tgc gca ctg ggg cca gat gaa ctg cgc cag tac gtc cag864Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu Arg Gln Tyr Val Gln275 280 285gaa ctg tcg cgg att gcc gaa tgc tac gtc acc gcg cac ccg aac gcc912Glu Leu Ser Arg Ile Ala Glu Cys Tyr Val Thr Ala His Pro Asn Ala290 295 300ggc ctg ccg aac gct ttc ggc gag tat gac ctc gac gcc gac acc atg960Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu Asp Ala Asp Thr Met305 310 315 320gcg aaa cag att cgc gaa tgg gcg gaa gcg ggc ttc ctg aat atc gtt1008Ala Lys Gln Ile Arg Glu Trp Ala Glu Ala Gly Phe Leu Asn Ile Val325 330 335ggc ggc tgc tgc ggc acc acg ccg gag cat att gcg gcg atg agc cgc1056Gly Gly Cys Cys Gly Thr Thr Pro Glu His Ile Ala Ala Met Ser Arg340 345 350gcc gtt gcc ggt ttg ctg ccg cgc cag ctg ccg gat atc ccg gtc gcc1104Ala Val Ala Gly Leu Leu Pro Arg Gln Leu Pro Asp Ile Pro Val Ala355 360 365tgc cgc ctt tcc ggc ctg gag ccg ctg aac att ggc gac gat agc ctg1152Cys Arg Leu Ser Gly Leu Glu Pro Leu Asn Ile Gly Asp Asp Ser Leu370 375 380ttt gtg aac gtc ggc gaa cgt act aac gtc acc ggc tcg gcc aaa ttt1200Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Ala Lys Phe385 390 395 400aaa cgg ctg atc aaa gaa gag aaa tac agc gaa gcg ctg gat gtc gcc1248Lys Arg Leu Ile Lys Glu Glu Lys Tyr Ser Glu Ala Leu Asp Val Ala405 410 415cgt cag cag gta gaa agc ggc gcg cag att att gat atc aat atg gat1296Arg Gln Gln Val Glu Ser Gly Ala Gln Ile Ile Asp Ile Asn Met Asp420 425 430gag ggg atg ctc gac gcc gaa gcg gcg atg gtg cgt ttc ctc agc ctg1344Glu Gly Met Leu Asp Ala Glu Ala Ala Met Val Arg Phe Leu Ser Leu435 440 445att gcc ggt gag ccg gac att gcc cgt gta cca atc atg atc gac tcc1392Ile Ala Gly Glu Pro Asp Ile Ala Arg Val Pro Ile Met Ile Asp Ser450 455 460tcc aaa tgg gag gtt atc gaa aaa ggg ctg aag tgc att cag ggt aaa1440Ser Lys Trp Glu Val Ile Glu Lys Gly Leu Lys Cys Ile Gln Gly Lys
465 470 475 480ggc atc gtc aac tct att tcg atg aaa gag ggc gtg gaa gcc ttt att1488Gly Ile Val Asn Ser Ile Ser Met Lys Glu Gly Val Glu Ala Phe Ile485 490 495cat cat gcg aag ctt ctg cgt cgc tac ggc gcg gca gtg gtg gtg atg1536His His Ala Lys Leu Leu Arg Arg Tyr Gly Ala Ala Val Val Val Met500 505 510gcc ttt gat gag cag ggg cag gcc gat acc cgc gcg cgt aaa atc gaa1584Ala Phe Asp Glu Gln Gly Gln Ala Asp Thr Arg Ala Arg Lys Ile Glu515 520 525att tgc cgc cga gcc tac aaa att ctc acc gaa gag gtg ggt ttc ccg1632Ile Cys Arg Arg Ala Tyr Lys Ile Leu Thr Glu Glu Val Gly Phe Pro530 535 540ccg gaa gac atc atc ttc gac ccg aat atc ttc gcc gtg gcg acc ggt1680Pro Glu Asp Ile Ile Phe Asp Pro Asn Ile Phe Ala Val Ala Thr Gly545 550 555 560att gaa gag cac aac aac tac gcg cag gac ttt atc ggc gct tgc gaa1728Ile Glu Glu His Asn Asn Tyr Ala Gln Asp Phe Ile Gly Ala Cys Glu565 570 575gac atc aaa cgc gag ctg ccg cac gcg ctg atc tcc ggc ggc gtg tct1776Asp Ile Lys Arg Glu Leu Pro His Ala Leu Ile Ser Gly Gly Val Ser580 585 590aac gtg tcc ttc tcg ttc cgc ggc aac gac ccg gta cgt gag gct atc1824Asn Val Ser Phe Ser Phe Arg Gly Asn Asp Pro Val Arg Glu Ala Ile595 600 605cac gcg gta ttc ctc tac tac gcc atc cgc aac ggt atg gac atg ggc1872His Ala Val Phe Leu Tyr Tyr Ala Ile Arg Asn Gly Met Asp Met Gly610 615 620atc gtc aac gcc gga cag ctg gct atc tac gac gac ctg ccc gcc gag1920Ile Val Asn Ala Gly Gln Leu Ala Ile Tyr Asp Asp Leu Pro Ala Glu625 630 635 640ctg cgc gat gcg gtt gaa gat gtc att ctt aac cgt cgc gat gac ggc1968Leu Arg Asp Ala Val Glu Asp Val Ile Leu Asn Arg Arg Asp Asp Gly645 650 655act gag cgt ttg ctg gat ttg gcg gag aaa tac cgc ggc agc aaa acc2016Thr Glu Arg Leu Leu Asp Leu Ala Glu Lys Tyr Arg Gly Ser Lys Thr660 665 670gac gaa gct gcc aac gcc cag cag gcg gaa tgg cgt agc tgg gac gtg2064Asp Glu Ala Ala Asn Ala Gln Gln Ala Glu Trp Arg Ser Trp Asp Val675 680 685aaa aag cgt ctc gaa tac tcg ctg gtg aaa ggc att acc gaa ttt atc2112Lys Lys Arg Leu Glu Tyr Ser Leu Val Lys Gly Ile Thr Glu Phe Ile690 695 700gaa cag gat acc gaa gaa gcc cgt cag cag gcc gcc cgc ccg att gag2160Glu Gln Asp Thr Glu Glu Ala Arg Gln Gln Ala Ala Arg Pro Ile Glu705 710 715 720
gtg att gaa ggg ccg ttg atg gac ggc atg aac gtg gtc ggc gac ctg2208Val Ile Glu Gly Pro Leu Met Asp Gly Met Asn Val Val Gly Asp Leu725 730 735ttc ggc gaa ggg aaa atg ttc ctg ccg cag gtg gtg aaa tcc gct cgc2256Phe Gly Glu Gly Lys Met Phe Leu Pro Gln Val Val Lys Ser Ala Arg740 745 750gtg atg aaa caa gcg gtg gcc tac ctg gag ccg ttt att gaa gcc agc2304Val Met Lys Gln Ala Val Ala Tyr Leu Glu Pro Phe Ile Glu Ala Ser755 760 765aaa gaa aaa ggc tcc agc aac ggc aag atg gtg atc gcc acc gtg aag2352Lys Glu Lys Gly Ser Ser Asn Gly Lys Met Val Ile Ala Thr Val Lys770 775 780ggc gat gtg cac gat att ggt aaa aat atc gtt ggc gtg gtg ctg caa2400Gly Asp Val His Asp Ile Gly Lys Asn Ile Val Gly Val Val Leu Gln785 790 795 800tgt aac aac tac gaa atc gtc gat ctt ggc gtg atg gtg cca gcg gag2448Cys Asn Asn Tyr Glu Ile Val Asp Leu Gly Val Met Val Pro Ala Glu805 810 815aaa atc ctc aga acg gcg cgt gaa gtg aat gcc gat ctg atc ggt ctt2496Lys Ile Leu Arg Thr Ala Arg Glu Val Asn Ala Asp Leu Ile Gly Leu820 825 830tcc ggg ctg att acc ccg tcg ctg gac gaa atg gtc aac gtg gcg aaa2544Ser Gly Leu Ile Thr Pro Ser Leu Asp Glu Met Val Asn Val Ala Lys835 840 845gag atg gag cgt cag ggc ttt act atc ccg cta ctg atc ggc ggc gca2592Glu Met Glu Arg Gln Gly Phe Thr Ile Pro Leu Leu Ile Gly Gly Ala850 855 860acc act tcg aaa gcg cat acg gcg gtg aaa atc gag cag aac tac agc2640Thr Thr Ser Lys Ala His Thr Ala Val Lys Ile Glu Gln Asn Tyr Ser865 870 875 880ggt ccg acg gtc tac gtg cag aat gct tcg cgt acc gtg ggc gtg gtg2688Gly Pro Thr Val Tyr Val Gln Asn Ala Ser Arg Thr Val Gly Val Val885 890 895gcg gcg cta ctc tcc gac acc cag cgt gat gac ttt gtc gcc cgt acc2736Ala Ala Leu Leu Ser Asp Thr Gln Arg Asp Asp Phe Val Ala Arg Thr900 905 910cgc aaa gag tac gaa acc gtg cgt att cag cac gcc cgc aaa aaa ccg2784Arg Lys Glu Tyr Glu Thr Val Arg Ile Gln His Ala Arg Lys Lys Pro915 920 925cgc acg ccg ccg gtc acg ctg gag gcg gcg cgc gat aac gat ctg gca2832Arg Thr Pro Pro Val Thr Leu Glu Ala Ala Arg Asp Asn Asp Leu Ala930 935 940ttc gat tgg gaa cgc tac acc ccg ccg gta gcc cac cgt ctg ggc gtg2880Phe Asp Trp Glu Arg Tyr Thr Pro Pro Val Ala His Arg Leu Gly Val945 950 955 960cag gag gtg gaa gcc agc atc gaa act ctg cgc aac tat atc gac tgg2928Gln Glu Val Glu Ala Ser Ile Glu Thr Leu Arg Asn Tyr Ile Asp Trp
965 970 975acg ccg ttc ttt atg acc tgg tcg ctg gcc ggc aaa tac ccg cgc att2976Thr Pro Phe Phe Met Thr Trp Ser Leu Ala Gly Lys Tyr Pro Arg Ile980 985 990ctg gaa gat gag gtg gtg ggc gtt gag gcg cag cgt ctg ttt aaa gac3024Leu Glu Asp Glu Val Val Gly Val Glu Ala Gln Arg Leu Phe Lys Asp99510001005gcc aat gat atg ctg gat aaa ctg agc gcc gag aaa ctg ttg aat ccg3072Ala Asn Asp Met Leu Asp Lys Leu Ser Ala Glu Lys Leu Leu Asn Pro101010151020cgt ggc gtg gtg ggc ctg ttc ccg gcg aac cgt gtg ggt gac gac atc3120Arg Gly Val Val Gly Leu Phe Pro Ala Asn Arg Val Gly Asp Asp Ile1025 103010351040gaa atc tat cgc gac gaa acc cgt act cat gtt ctg acg gtc agc cac3168Glu Ile Tyr Arg Asp Glu Thr Arg Thr His Val Leu Thr Val Ser His104510501055cac ctg cgc cag cag acc gag aaa gtc ggt ttt gcc aac tac tgt ctg3216His Leu Arg Gln Gln Thr Glu Lys Val Gly Phe Ala Asn Tyr Cys Leu106010651070gcg gat ttt gtc gcg ccg aaa ctg agc ggc aaa gcg gat tac atc ggt3264Ala Asp Phe Val Ala Pro Lys Leu Ser Gly Lys Ala Asp Tyr Ile Gly107510801085gct ttc gcg gtg acc ggc ggt ctg gag gag gat gcg ctg gcg gac gcc3312Ala Phe Ala Val Thr Gly Gly Leu Glu Glu Asp Ala Leu Ala Asp Ala109010951100ttc gaa gcg caa cac gac gac tat aac aag atc atg gtg aaa gcg att3360Phe Glu Ala Gln His Asp Asp Tyr Asn Lys Ile Met Val Lys Ala Ile1105 111011151120gcc gac cgt ctg gcg gaa gcg ttc gcg gaa tat ctg cat gag cgt gta3408Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Tyr Leu His Glu Arg Val112511301135cgt aag gtt tac tgg gga tat gcg ccg aac gag agc ctg agt aac gac3456Arg Lys Val Tyr Trp Gly Tyr Ala Pro Asn Glu Ser Leu Ser Asn Asp114011451150gaa tta atc cgc gaa aac tac cag ggg att cgc ccg gcg ccg ggt tat3504Glu Leu Ile Arg Glu Asn Tyr Gln Gly Ile Arg Pro Ala Pro Gly Tyr115511601165cct gcc tgc ccg gaa cat acc gaa aaa ggc act atc tgg cag cta ctg3552Pro Ala Cys Pro Glu His Thr Glu Lys Gly Thr Ile Trp Gln Leu Leu117011751180gat gtc gaa aaa cac acc ggg atg aag ctc acc gaa tct ttc gcc atg3600Asp Val Glu Lys His Thr Gly Met Lys Leu Thr Glu Ser Phe Ala Met1185 119011951200tgg cca ggc gcg tcg gtc tcc ggc tgg tac ttc agc cat cct gag agc3648Trp Pro Gly Ala Ser Val Ser Gly Trp Tyr Phe Ser His Pro Glu Ser120512101215
aaa tac ttc gcg gta gcg cag atc caa cgc gat cag gtg aca gat tat3696Lys Tyr Phe Ala Val Ala Gln Ile Gln Arg Asp Gln Val Thr Asp Tyr122012251230gct ttc cgt aaa gga atg agc gtt gag gat gtt gag cgg tgg ctc gcg3744Ala Phe Arg Lys Gly Met Ser Val Glu Asp Val Glu Arg Trp Leu Ala123512401245ccg aac ctg ggt tac gat gcg gac tga3771Pro Asn Leu Gly Tyr Asp Ala Asp12501255210222111256212PRT213鼠傷寒沙門氏菌40022Met Ser His Val Ala Arg Cys Ser Leu Phe Arg Gln His Ala Leu Cys1 5 10 15Gln Tyr Gly Ser Leu Arg Gly Ala Leu Ser Gly Ala Ser Val Ser Ser20 25 30Lys Val Glu Gln Leu Arg Ala Gln Leu Asn Glu Arg Ile Leu Val Leu35 40 45Asp Gly Gly Met Gly Thr Met Ile Gln Ser Tyr Arg Leu His Glu Glu50 55 60Asp Phe Arg Gly Glu Arg Phe Ala Asp Trp Pro Cys Asp Leu Lys Gly65 70 75 80Asn Asn Asp Leu Leu Val Leu Ser Lys Pro Glu Val Ile Ala Ala Ile85 90 95His Asn Ala Tyr Phe Glu Ala Gly Ala Asp Ile Ile Glu Thr Asn Thr100 105 110Phe Asn Ser Thr Thr Ile Ala Met Ala Asp Tyr Arg Met Glu Ser Leu115 120 125Ser Ala Glu Ile Asn Tyr Ala Ala Ala Lys Leu Ala Arg Ala Cys Ala130 135 140Asp Glu Trp Thr Ala Arg Thr Pro Glu Lys Pro Arg Phe Val Ala Gly145 150 155 160Val Leu Gly Pro Thr Asn Arg Thr Ala Ser Ile Ser Pro Asp Val Asn165 170 175Asp Pro Ala Phe Arg Asn Ile Thr Phe Asp Gln Leu Val Ala Ala Tyr180 185 190Arg Glu Ser Thr Lys Ala Leu Val Glu Gly Gly Ala Asp Leu Ile Leu195 200 205Ile Glu Thr Val Phe Asp Thr Leu Asn Ala Lys Ala Ala Val Phe Ala210 215 220Val Lys Glu Glu Phe Glu Ala Leu Gly Val Asp Leu Pro Ile Met Ile
225 230 235 240Ser Gly Thr Ile Thr Asp Ala Ser Gly Arg Thr Leu Ser Gly Gln Thr245 250 255Thr Glu Ala Phe Tyr Asn Ser Leu Arg His Ala Glu Ala Leu Thr Phe260 265 270Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu Arg Gln Tyr Val Gln275 280 285Glu Leu Ser Arg Ile Ala Glu Cys Tyr Val Thr Ala His Pro Asn Ala290 295 300Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu Asp Ala Asp Thr Met305 310 315 320Ala Lys Gln Ile Arg Glu Trp Ala Glu Ala Gly Phe Leu Asn Ile Val325 330 335Gly Gly Cys Cys Gly Thr Thr Pro Glu His Ile Ala Ala Met Ser Arg340 345 350Ala Val Ala Gly Leu Leu Pro Arg Gln Leu Pro Asp Ile Pro Val Ala355 360 365Cys Arg Leu Ser Gly Leu Glu Pro Leu Asn Ile Gly Asp Asp Ser Leu370 375 380Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Ala Lys Phe385 390 395 400Lys Arg Leu Ile Lys Glu Glu Lys Tyr Ser Glu Ala Leu Asp Val Ala405 410 415Arg Gln Gln Val Glu Ser Gly Ala Gln Ile Ile Asp Ile Asn Met Asp420 425 430Glu Gly Met Leu Asp Ala Glu Ala Ala Met Val Arg Phe Leu Ser Leu435 440 445Ile Ala Gly Glu Pro Asp Ile Ala Arg Val Pro Ile Met Ile Asp Ser450 455 460Ser Lys Trp Glu Val Ile Glu Lys Gly Leu Lys Cys Ile Gln Gly Lys465 470 475 480Gly Ile Val Asn Ser Ile Ser Met Lys Glu Gly Val Glu Ala Phe Ile485 490 495His His Ala Lys Leu Leu Arg Arg Tyr Gly Ala Ala Val Val Val Met500 505 510Ala Phe Asp Glu Gln Gly Gln Ala Asp Thr Arg Ala Arg Lys Ile Glu515 520 525Ile Cys Arg Arg Ala Tyr Lys Ile Leu Thr Glu Glu Val Gly Phe Pro530 535 540Pro Glu Asp Ile Ile Phe Asp Pro Asn Ile Phe Ala Val Ala Thr Gly545 550 555 560
Ile Glu Glu His Asn Asn Tyr Ala Gln Asp Phe Ile Gly Ala Cys Glu565 570 575Asp Ile Lys Arg Glu Leu Pro His Ala Leu Ile Ser Gly Gly Val Ser580 585 590Asn Val Ser Phe Ser Phe Arg Gly Asn Asp Pro Val Arg Glu Ala Ile595 600 605His Ala Val Phe Leu Tyr Tyr Ala Ile Arg Asn Gly Met Asp Met Gly610 615 620Ile Val Asn Ala Gly Gln Leu Ala Ile Tyr Asp Asp Leu Pro Ala Glu625 630 635 640Leu Arg Asp Ala Val Glu Asp Val Ile Leu Asn Arg Arg Asp Asp Gly645 650 655Thr Glu Arg Leu Leu Asp Leu Ala Glu Lys Tyr Arg Gly Ser Lys Thr660 665 670Asp Glu Ala Ala Asn Ala Gln Gln Ala Glu Trp Arg Ser Trp Asp Val675 680 685Lys Lys Arg Leu Glu Tyr Ser Leu Val Lys Gly Ile Thr Glu Phe Ile690 695 700Glu Gln Asp Thr Glu Glu Ala Arg Gln Gln Ala Ala Arg Pro Ile Glu705 710 715 720Val Ile Glu Gly Pro Leu Met Asp Gly Met Asn Val Val Gly Asp Leu725 730 735Phe Gly Glu Gly Lys Met Phe Leu Pro Gln Val Val Lys Ser Ala Arg740 745 750Val Met Lys Gln Ala Val Ala Tyr Leu Glu Pro Phe Ile Glu Ala Ser755 760 765Lys Glu Lys Gly Ser Ser Asn Gly Lys Met Val Ile Ala Thr Val Lys770 775 780Gly Asp Val His Asp Ile Gly Lys Asn Ile Val Gly Val Val Leu Gln785 790 795 800Cys Asn Asn Tyr Glu Ile Val Asp Leu Gly Val Met Val Pro Ala Glu805 810 815Lys Ile Leu Arg Thr Ala Arg Glu Val Asn Ala Asp Leu Ile Gly Leu820 825 830Ser Gly Leu Ile Thr Pro Ser Leu Asp Glu Met Val Asn Val Ala Lys835 840 845Glu Met Glu Arg Gln Gly Phe Thr Ile Pro Leu Leu Ile Gly Gly Ala850 855 860Thr Thr Ser Lys Ala His Thr Ala Val Lys Ile Glu Gln Asn Tyr Ser865 870 875 880Gly Pro Thr Val Tyr Val Gln Asn Ala Ser Arg Thr Val Gly Val Val885 890 895
Ala Ala Leu Leu Ser Asp Thr Gln Arg Asp Asp Phe Val Ala Arg Thr900 905 910Arg Lys Glu Tyr Glu Thr Val Arg Ile Gln His Ala Arg Lys Lys Pro915 920 925Arg Thr Pro Pro Val Thr Leu Glu Ala Ala Arg Asp Asn Asp Leu Ala930 935 940Phe Asp Trp Glu Arg Tyr Thr Pro Pro Val Ala His Arg Leu Gly Val945 950 955 960Gln Glu Val Glu Ala Ser Ile Glu Thr Leu Arg Asn Tyr Ile Asp Trp965 970 975Thr Pro Phe Phe Met Thr Trp Ser Leu Ala Gly Lys Tyr Pro Arg Ile980 985 990Leu Glu Asp Glu Val Val Gly Val Glu Ala Gln Arg Leu Phe Lys Asp995 10001005Ala Asn Asp Met Leu Asp Lys Leu Ser Ala Glu Lys Leu Leu Asn Pro101010151020Arg Gly Val Val Gly Leu Phe Pro Ala Asn Arg Val Gly Asp Asp Ile1025 103010351040Glu Ile Tyr Arg Asp Glu Thr Arg Thr His Val Leu Thr Val Ser His104510501055His Leu Arg Gln Gln Thr Glu Lys Val Gly Phe Ala Asn Tyr Cys Leu106010651070Ala Asp Phe Val Ala Pro Lys Leu Ser Gly Lys Ala Asp Tyr Ile Gly107510801085Ala Phe Ala Val Thr Gly Gly Leu Glu Glu Asp Ala Leu Ala Asp Ala109010951100Phe Glu Ala Gln His Asp Asp Tyr Asn Lys Ile Met Val Lys Ala Ile1105 111011151120Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Tyr Leu His Glu Arg Val112511301135Arg Lys Val Tyr Trp Gly Tyr Ala Pro Asn Glu Ser Leu Ser Asn Asp114011451150Glu Leu Ile Arg Glu Asn Tyr Gln Gly Ile Arg Pro Ala Pro Gly Tyr115511601165Pro Ala Cys Pro Glu His Thr Glu Lys Gly Thr Ile Trp Gln Leu Leu117011751180Asp Val Glu Lys His Thr Gly Met Lys Leu Thr Glu Ser Phe Ala Met1185 1190 1195 1200Trp Pro Gly Ala Ser Val Ser Gly Trp Tyr Phe Ser His Pro Glu Ser120512101215Lys Tyr Phe Ala Val Ala Gln Ile Gln Arg Asp Gln Val Thr Asp Tyr
1220 12251230Ala Phe Arg Lys Gly Met Ser Val Glu Asp Val Glu Arg Trp Leu Ala123512401245Pro Asn Leu Gly Tyr Asp Ala Asp1250 1255210232113771212DNA213傷寒沙門氏菌(Salmonella typhi)220
221CDS222(1)..(3768)223RTY0368640023atg tct cat gtt gcc cgt tgt tct ctt ttc cgc cag cac gct ttg tgc48Met Ser His Val Ala Arg Cys Ser Leu Phe Arg Gln His Ala Leu Cys1 5 10 15cag tat ggc tcg tta cgt gga gcg ttg tcg gga gcg agt gtg agc agc96Gln Tyr Gly Ser Leu Arg Gly Ala Leu Ser Gly Ala Ser Val Ser Ser20 25 30aaa gtt gaa caa ctg cgt gcg cag tta aat gaa cgt att ctg gtg ctg144Lys Val Glu Gln Leu Arg Ala Gln Leu Asn Glu Arg Ile Leu Val Leu35 40 45gac ggc ggt atg ggc acc atg atc cag agc tat cgt cta cat gaa gaa192Asp Gly Gly Met Gly Thr Met Ile Gln Ser Tyr Arg Leu His Glu Glu50 55 60gat ttc cgc ggg gag cgc ttt gcc gac tgg ccc tgc gac ctg aaa ggc240Asp Phe Arg Gly Glu Arg Phe Ala Asp Trp Pro Cys Asp Leu Lys Gly65 70 75 80aac aat gac ctg ctg gtc ctc agc aag ccg gag gtg atc gcc gct atc288Asn Asn Asp Leu Leu Val Leu Ser Lys Pro Glu Val Ile Ala Ala Ile85 90 95cac aac gcc tac ttt gag gct ggc gcg gat atc atc gaa acc aac acc336His Asn Ala Tyr Phe Glu Ala Gly Ala Asp Ile Ile Glu Thr Asn Thr100 105 110ttt aac tcg aca acc att gcg atg gcg gat tac cgg atg gaa tcc ctg384Phe Asn Ser Thr Thr Ile Ala Met Ala Asp Tyr Arg Met Glu Ser Leu115 120 125tcg gcg gaa att aac tat gcg gcg gcc aaa ctg gcg cgc gcc tgc gcc432Ser Ala Glu Ile Asn Tyr Ala Ala Ala Lys Leu Ala Arg Ala Cys Ala130 135 140gat gaa tgg acg gcg cga aca cca gaa aaa cca cgc ttt gtt gcg ggc480Asp Glu Trp Thr Ala Arg Thr Pro Glu Lys Pro Arg Phe Val Ala Gly145 150 155 160gtg ctt ggt cca act aac cgc acg gcc tcc att tcg ccg gac gtc aac528Val Leu Gly Pro Thr Asn Arg Thr Ala Ser Ile Ser Pro Asp Val Asn
165 170 175gac ccg gcg ttt cgt aat atc acc ttc gat cag ctg gtg gcg gcc tac576Asp Pro Ala Phe Arg Asn Ile Thr Phe Asp Gln Leu Val Ala Ala Tyr180 185 190cgt gaa tcc acc aaa gcg ctg gtg gaa ggc ggg gcg gac ctg atc ctg624Arg Glu Ser Thr Lys Ala Leu Val Glu Gly Gly Ala Asp Leu Ile Leu195 200 205att gaa act gtc ttc gac acc ctc aac gcc aaa gcg gcg gtg ttt gcg672Ile Glu Thr Val Phe Asp Thr Leu Asn Ala Lys Ala Ala Val Phe Ala210 215 220gtg aaa gaa gag ttt gaa gcg ctg ggc gtt gat ctg ccg atc atg att720Val Lys Glu Glu Phe Glu Ala Leu Gly Val Asp Leu Pro Ile Met Ile225 230 235 240tcc ggc acc atc acc gac gcc tct ggc cgt acg ctt tcc ggc cag acg768Ser Gly Thr Ile Thr Asp Ala Ser Gly Arg Thr Leu Ser Gly Gln Thr245 250 255acc gaa gcc ttt tat aac tcg ctg cgc cac gcc gag gcg ctc act ttt816Thr Glu Ala Phe Tyr Asn Ser Leu Arg His Ala Glu Ala Leu Thr Phe260 265 270ggc ctt aac tgc gcg ctg ggg cca gat gaa ctg cgc cag tac gtc cag864Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu Arg Gln Tyr Val Gln275 280 285gaa ctg tcg cgg att gcc gaa tgc tac gtc acc gcg cac ccg aac gcc912Glu Leu Ser Arg Ile Ala Glu Cys Tyr Val Thr Ala His Pro Asn Ala290 295 300ggc ctg ccg aac gct ttc ggc gag tac gac ctc gac gcc gac acc atg960Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu Asp Ala Asp Thr Met305 310 315 320gcg aaa cag att cgc gaa tgg gcg gaa gcg ggc ttc ctg aat atc gtt1008Ala Lys Gln Ile Arg Glu Trp Ala Glu Ala Gly Phe Leu Asn Ile Val325 330 335ggc ggc tgc tgc ggc acc acg ccg gag cat att gcg gcg atg agc cgc1056Gly Gly Cys Cys Gly Thr Thr Pro Glu His Ile Ala Ala Met Ser Arg340 345 350gcc gtt gcc ggt ttg tcg ccg cgc cag ctg ccg gat atc ccg gtg gcc1104Ala Val Ala Gly Leu Ser Pro Arg Gln Leu Pro Asp Ile Pro Val Ala355 360 365tgc cgc ctt tcc ggc ctg gag ccg ctg aac att ggt gac gat agc ctg1152Cys Arg Leu Ser Gly Leu Glu Pro Leu Asn Ile Gly Asp Asp Ser Leu370 375 380ttt gtc aac gtc ggc gaa cgt act aac gtc acc ggc tcg gcc aaa ttt1200Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Ala Lys Phe385 390 395 400aaa cgc ttg atc aaa gaa gag aaa tac agc gaa gcg ctg gat gtc gcc1248Lys Arg Leu Ile Lys Glu Glu Lys Tyr Ser Glu Ala Leu Asp Val Ala405 410 415
cgt cag cag gtc gaa agc ggc gcg cag att att gat atc aat atg gat1296Arg Gln Gln Val Glu Ser Gly Ala Gln Ile Ile Asp Ile Asn Met Asp420 425 430gag ggg atg ctc gac gcc gaa gcg gcg atg gtg cgt ttc ctc agc ctg1344Glu Gly Met Leu Asp Ala Glu Ala Ala Met Val Arg Phe Leu Ser Leu435 440 445att gcc ggt gag ccg gac att gcc cgt gta cca atc atg att gac tcc1392Ile Ala Gly Glu Pro Asp Ile Ala Arg Val Pro Ile Met Ile Asp Ser450 455 460tcc aaa tgg gag gtt atc gaa aaa ggg ctg aag tgc att cag ggt aaa1440Ser Lys Trp Glu Val Ile Glu Lys Gly Leu Lys Cys Ile Gln Gly Lys465 470 475 480ggc atc gtc aac tct att tcg atg aaa gag ggc gtg gaa gcc ttt att1488Gly Ile Val Asn Ser Ile Ser Met Lys Glu Gly Val Glu Ala Phe Ile485 490 495cat cat gcg aag ttg cta cgt cgc tac ggc gcc gca gtg gtg gtg atg1536His His Ala Lys Leu Leu Arg Arg Tyr Gly Ala Ala Val Val Val Met500 505 510gct ttt gat gag cag ggg cag gcc gac acc cgc gaa cgt aaa atc gag1584Ala Phe Asp Glu Gln Gly Gln Ala Asp Thr Arg Glu Arg Lys Ile Glu515 520 525att tgc cgc cgc gct tac aaa att ttg ctc gaa gag gta ggc ttt ccg1632Ile Cys Arg Arg Ala Tyr Lys Ile Leu Leu Glu Glu Val Gly Phe Pro530 535 540ccg gaa gac atc atc ttc gac ccg aat atc ttc gcc gtc gcc acc ggt1680Pro Glu Asp Ile Ile Phe Asp Pro Asn Ile Phe Ala Val Ala Thr Gly545 550 555 560att gaa gag cac aac aac tac gcg cag gac ttt atc ggc gct tgt gaa1728Ile Glu Glu His Asn Asn Tyr Ala Gln Asp Phe Ile Gly Ala Cys Glu565 570 575gac atc aaa cgc gag ctg ccg cac gcg ctg atc tcc ggc ggc gtg tct1776Asp Ile Lys Arg Glu Leu Pro His Ala Leu Ile Ser Gly Gly Val Ser580 585 590aac gtg tcc ttc tcg ttt cgc ggc aac gac ccg gta cgt gag gct atc1824Asn Val Ser Phe Ser Phe Arg Gly Asn Asp Pro Val Arg Glu Ala Ile595 600 605cac gcg gta ttc ctc tac tac gcc atc cgc aac ggc atg gac atg ggc1872His Ala Val Phe Leu Tyr Tyr Ala Ile Arg Asn Gly Met Asp Met Gly610 615 620atc gtc aac gcc ggg caa ctg gcg att tat gac aac ctg cct gcc gaa1920Ile Val Asn Ala Gly Gln Leu Ala Ile Tyr Asp Asn Leu Pro Ala Glu625 630 635 640ctg cgc gat gca gtt gaa gat gtc att ctt aac cgt cgc gat gac ggc1968Leu Arg Asp Ala Val Glu Asp Val Ile Leu Asn Arg Arg Asp Asp Gly645 650 655acc gag cgt ttg ctg gat ttg gcg gag aaa tat cgc ggc agc aaa acc2016Thr Glu Arg Leu Leu Asp Leu Ala Glu Lys Tyr Arg Gly Ser Lys Thr
660 665 670gac gag gct gcc agt gcc cag cag gcg gaa tgg cgt agc tgg gac gtg2064Asp Glu Ala Ala Ser Ala Gln Gln Ala Glu Trp Arg Ser Trp Asp Val675 680 685aaa aag cgt ctc gaa tac tcg ctg gtg aaa ggc att acc gag ttt atc2112Lys Lys Arg Leu Glu Tyr Ser Leu Val Lys Gly Ile Thr Glu Phe Ile690 695 700gaa cag gat acc gaa gaa gcc cgt cag cag gcc gcc cgc ccg att gag2160Glu Gln Asp Thr Glu Glu Ala Arg Gln Gln Ala Ala Arg Pro Ile Glu705 710 715 720gtg att gaa ggg ccg ctg atg gac ggc atg aac gtg gtc ggc gac ctg2208Val Ile Glu Gly Pro Leu Met Asp Gly Met Asn Val Val Gly Asp Leu725 730 735ttc ggc gaa ggg aaa atg ttc ctg ccg cag gtg gtg aaa tcc gct cgc2256Phe Gly Glu Gly Lys Met Phe Leu Pro Gln Val Val Lys Ser Ala Arg740 745 750gtg atg aaa caa gcg gtg gcc tac ctg gag ccg ttt att gaa gcc agc2304Val Met Lys Gln Ala Val Ala Tyr Leu Glu Pro Phe Ile Glu Ala Ser755 760 765aaa gaa aaa ggc tcc agc aac ggc aag atg gtg att gcc acc gtg aag2352Lys Glu Lys Gly Ser Ser Asn Gly Lys Met Val Ile Ala Thr Val Lys770 775 780ggc gat gtg cac gac att ggc aag aac att gtc ggc gtg gtg ctg caa2400Gly Asp Val His Asp Ile Gly Lys Asn Ile Val Gly Val Val Leu Gln785 790 795 800tgc aac aac tac gaa atc gtc gat ctt ggc gtg atg gtg cca gcg gag2448Cys Asn Asn Tyr Glu Ile Val Asp Leu Gly Val Met Val Pro Ala Glu805 810 815aaa atc ctc aga acg gcg cgt gaa gtg aat gcc gat ctg att ggt ctt2496Lys Ile Leu Arg Thr Ala Arg Glu Val Asn Ala Asp Leu Ile Gly Leu820 825 830tcc ggg ctt atc acc ccg tcg ctg gac gaa atg gtc aac gtg gcg aaa2544Ser Gly Leu Ile Thr Pro Ser Leu Asp Glu Met Val Asn Val Ala Lys835 840 845gag atg gag cgt cag ggc ttt act atc ccg cta ctg atc ggc ggc gca2592Glu Met Glu Arg Gln Gly Phe Thr Ile Pro Leu Leu Ile Gly Gly Ala850 855 860acc act tcg aaa gcg cat acg gcg gtg aaa atc gag cag aac tac agc2640Thr Thr Ser Lys Ala His Thr Ala Val Lys Ile Glu Gln Asn Tyr Ser865 870 875 880ggt ccg acg gtc tac gtg cag aat gct tcg cgt acc gtg ggc gtg gtg2688Gly Pro Thr Val Tyr Val Gln Asn Ala Ser Arg Thr Val Gly Val Val885 890 895gcg gcg cta ctc tcc gac acc cag cgt gat gac ttt gtc gcc cgt acc2736Ala Ala Leu Leu Ser Asp Thr Gln Arg Asp Asp Phe Val Ala Arg Thr900 905 910
cgc aaa gag tac gaa acc gtg cgt att cag cac gcc cgc aaa aaa ccg2784Arg Lys Glu Tyr Glu Thr Val Arg Ile Gln His Ala Arg Lys Lys Pro915 920 925cgc acg ccg ccg gtc acg ctg gaa gcg gcg cgc gat aat gat ctg gca2832Arg Thr Pro Pro Val Thr Leu Glu Ala Ala Arg Asp Asn Asp Leu Ala930 935 940ttt gat tgg gaa cgc tac acc ccg ccg gta gcc cac cgt ctg ggc gtg2880Phe Asp Trp Glu Arg Tyr Thr Pro Pro Val Ala His Arg Leu Gly Val945 950 955 960cag gag gtg gaa gcc agc atc gaa acg ctg cgc aac tac atc gac tgg2928Gln Glu Val Glu Ala Ser Ile Glu Thr Leu Arg Asn Tyr Ile Asp Trp965 970 975acg ccg ttc ttt atg acc tgg tcg ctg gcc ggc aaa tac ccg cgc att2976Thr Pro Phe Phe Met Thr Trp Ser Leu Ala Gly Lys Tyr Pro Arg Ile980 985 990ctg gaa gat gag gtg gtg ggc gtt gag gcg cag cgt ctg ttt aaa gac3024Leu Glu Asp Glu Val Val Gly Val Glu Ala Gln Arg Leu Phe Lys Asp99510001005gcc aat gat atg ctg gat aaa ctg agc gcc gag aaa ctg ttg aat ccg3072Ala Asn Asp Met Leu Asp Lys Leu Ser Ala Glu Lys Leu Leu Asn Pro101010151020cgt ggc gtg gtg ggc ctg ttc ccg gcg aac cgt gtg ggt gac gac atc3120Arg Gly Val Val Gly Leu Phe Pro Ala Asn Arg Val Gly Asp Asp Ile1025 103010351040gaa atc tat cgc gac gaa acc cgt act cat gtt ctg acg gtc agc cac3168Glu Ile Tyr Arg Asp Glu Thr Arg Thr His Val Leu Thr Val Ser His104510501055cac ctg cgc cag cag acc gag aaa gtt ggt ttt gct aac tac tgt ctg3216His Leu Arg Gln Gln Thr Glu Lys Val Gly Phe Ala Asn Tyr Cys Leu106010651070gcg gat ttt gtc gcg ccg aaa ctg agc ggc aaa gcg gac tac atc ggt3264Ala Asp Phe Val Ala Pro Lys Leu Ser Gly Lys Ala Asp Tyr Ile Gly107510801085gct ttc gcg gtg acc ggc ggt ctg aag gag gat gcg ctg gcg gac gcc3312Ala Phe Ala Val Thr Gly Gly Leu Lys Glu Asp Ala Leu Ala Asp Ala109010951100ttc gaa gcg caa cac gac gac tat aac aag atc atg gtg aaa gcg att3360Phe Glu Ala Gln His Asp Asp Tyr Asn Lys Ile Met Val Lys Ala Ile1105 111011151120gcc gac cgt ctg gcg gaa gcg ttt gcc gag tat ctg cat gag cgt gta3408Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Tyr Iau His Glu Arg Val112511301135cgt aag gtt tac tgg gga tat gcg ccg aac gag agc ctg agt aac gac3456Arg Lys Val Tyr Trp Gly Tyr Ala Pro Asn Glu Ser Leu Ser Asn Asp114011451150gaa tta atc cgc gaa aac tac cag ggg att cgc ccg gcg ccg ggt tat3504Glu Leu Ile Arg Glu Asn Tyr Gln Gly Ile Arg Pro Ala Pro Gly Tyr
115511601165cct gcc tgc ccg gaa cat acc gaa aaa ggc act atc tgg cag cta ctg3552Pro Ala Cys Pro Glu His Thr Glu Lys Gly Thr Ile Trp Gln Leu Leu117011751180gat gtc gaa aaa cac acc ggg atg aag ctc acc gaa tct ttc gcc atg3600Asp Val Glu Lys His Thr Gly Met Lys Leu Thr Glu Ser Phe Ala Met1185 119011951200tgg cct ggc gcg tcg gtc tcc ggc tgg tac ttc agc cat cct gag agc3648Trp Pro Gly Ala Ser Val Ser Gly Trp Tyr Phe Ser His Pro Glu Ser120512101215aaa tac ttc gcg gta gcg cag atc caa cgc gat cag gtg aca gat tat3696Lys Tyr Phe Ala Val Ala Gln Ile Gln Arg Asp Gln Val Thr Asp Tyr122012251230gct ttc cgt aaa gga atg agc gtt gag gac gtt gag cgg tgg ctc gcg3744Ala Phe Arg Lys Gly Met Ser Val Glu Asp Val Glu Arg Trp Leu Ala123512401245ccg aac ctg ggt tac gat gcg gac tga3771Pro Asn Leu Gly Tyr Asp Ala Asp12501255210242111256212PRT213傷寒沙門氏菌40024Met Ser His Val Ala Arg Cys Ser Leu Phe Arg Gln His Ala Leu Cys1 5 10 15Gln Tyr Gly Ser Leu Arg Gly Ala Leu Ser Gly Ala Ser Val Ser Ser20 25 30Lys Val Glu Gln Leu Arg Ala Gln Leu Asn Glu Arg Ile Leu Val Leu35 40 45Asp Gly Gly Met Gly Thr Met Ile Gln Ser Tyr Arg Leu His Glu Glu50 55 60Asp Phe Arg Gly Glu Arg Phe Ala Asp Trp Pro Cys Asp Leu Lys Gly65 70 75 80Asn Asn Asp Leu Leu Val Leu Ser Lys Pro Glu Val Ile Ala Ala Ile85 90 95His Asn Ala Tyr Phe Glu Ala Gly Ala Asp Ile Ile Glu Thr Asn Thr100 105 110Phe Asn Ser Thr Thr Ile Ala Met Ala Asp Tyr Arg Met Glu Ser Leu115 120 125Ser Ala Glu Ile Asn Tyr Ala Ala Ala Lys Leu Ala Arg Ala Cys Ala130 135 140Asp Glu Trp Thr Ala Arg Thr Pro Glu Lys Pro Arg Phe Val Ala Gly145 150 155 160
Val Leu Gly Pro Thr Asn Arg Thr Ala Ser Ile Ser Pro Asp Val Asn165 170 175Asp Pro Ala Phe Arg Asn Ile Thr Phe Asp Gln Leu Val Ala Ala Tyr180 185 190Arg Glu Ser Thr Lys Ala Leu Val Glu Gly Gly Ala Asp Leu Ile Leu195 200 205Ile Glu Thr Val Phe Asp Thr Leu Asn Ala Lys Ala Ala Val Phe Ala210 215 220Val Lys Glu Glu Phe Glu Ala Leu Gly Val Asp Leu Pro Ile Met Ile225 230 235 240Ser Gly Thr Ile Thr Asp Ala Ser Gly Arg Thr Leu Ser Gly Gln Thr245 250 255Thr Glu Ala Phe Tyr Asn Ser Leu Arg His Ala Glu Ala Leu Thr Phe260 265 270Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu Arg Gln Tyr Val Gln275 280 285Glu Leu Ser Arg Ile Ala Glu Cys Tyr Val Thr Ala His Pro Asn Ala290 295 300Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu Asp Ala Asp Thr Met305 310 315 320Ala Lys Gln Ile Arg Glu Trp Ala Glu Ala Gly Phe Leu Asn Ile Val325 330 335Gly Gly Cys Cys Gly Thr Thr Pro Glu His Ile Ala Ala Met Ser Arg340 345 350Ala Val Ala Gly Leu Ser Pro Arg Gln Leu Pro Asp Ile Pro Val Ala355 360 365Cys Arg Leu Ser Gly Leu Glu Pro Leu Asn Ile Gly Asp Asp Ser Leu370 375 380Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Ala Lys Phe385 390 395 400Lys Arg Leu Ile Lys Glu Glu Lys Tyr Ser Glu Ala Leu Asp Val Ala405 410 415Arg Gln Gln Val Glu Ser Gly Ala Gln Ile Ile Asp Ile Asn Met Asp420 425 430Glu Gly Met Leu Asp Ala Glu Ala Ala Met Val Arg Phe Leu Ser Leu435 440 445Ile Ala Gly Glu Pro Asp Ile Ala Arg Val Pro Ile Met Ile Asp Ser450 455 460Ser Lys Trp Glu Val Ile Glu Lys Gly Leu Lys Cys Ile Gln Gly Lys465 470 475 480Gly Ile Val Asn Ser Ile Ser Met Lys Glu Gly Val Glu Ala Phe Ile
485 490 495His His Ala Lys Leu Leu Arg Arg Tyr Gly Ala Ala Val Val Val Met500 505 510Ala Phe Asp Glu Gln Gly Gln Ala Asp Thr Arg Glu Arg Lys Ile Glu515 520 525Ile Cys Arg Arg Ala Tyr Lys Ile Leu Leu Glu Glu Val Gly Phe Pro530 535 540Pro Glu Asp Ile Ile Phe Asp Pro Asn Ile Phe Ala Val Ala Thr Gly545 550 555 560Ile Glu Glu His Asn Asn Tyr Ala Gln Asp Phe Ile Gly Ala Cys Glu565 570 575Asp Ile Lys Arg Glu Leu Pro His Ala Leu Ile Ser Gly Gly Val Ser580 585 590Asn Val Ser Phe Ser Phe Arg Gly Asn Asp Pro Val Arg Glu Ala Ile595 600 605His Ala Val Phe Leu Tyr Tyr Ala Ile Arg Asn Gly Met Asp Met Gly610 615 620Ile Val Asn Ala Gly Gln Leu Ala Ile Tyr Asp Asn Leu Pro Ala Glu625 630 635 640Leu Arg Asp Ala Val Glu Asp Val Ile Leu Asn Arg Arg Asp Asp Gly645 650 655Thr Glu Arg Leu Leu Asp Leu Ala Glu Lys Tyr Arg Gly Ser Lys Thr660 665 670Asp Glu Ala Ala Ser Ala Gln Gln Ala Glu Trp Arg Ser Trp Asp Val675 680 685Lys Lys Arg Leu Glu Tyr Ser Leu Val Lys Gly Ile Thr Glu Phe Ile690 695 700Glu Gln Asp Thr Glu Glu Ala Arg Gln Gln Ala Ala Arg Pro Ile Glu705 710 715 720Val Ile Glu Gly Pro Leu Met Asp Gly Met Asn Val Val Gly Asp Leu725 730 735Phe Gly Glu Gly Lys Met Phe Leu Pro Gln Val Val Lys Ser Ala Arg740 745 750Val Met Lys Gln Ala Val Ala Tyr Leu Glu Pro Phe Ile Glu Ala Ser755 760 765Lys Glu Lys Gly Ser Ser Asn Gly Lys Met Val Ile Ala Thr Val Lys770 775 780Gly Asp Val His Asp Ile Gly Lys Asn Ile Val Gly Val Val Leu Gln785 790 795 800Cys Asn Asn Tyr Glu Ile Val Asp Leu Gly Val Met Val Pro Ala Glu805 810 815
Lys Ile Leu Arg Thr Ala Arg Glu Val Asn Ala Asp Leu Ile Gly Leu820 825 830Ser Gly Leu Ile Thr Pro Ser Leu Asp Glu Met Val Asn Val Ala Lys835 840 845Glu Met Glu Arg Gln Gly Phe Thr Ile Pro Leu Leu Ile Gly Gly Ala850 855 860Thr Thr Ser Lys Ala His Thr Ala Val Lys Ile Glu Gln Asn Tyr Ser865 870 875 880Gly Pro Thr Val Tyr Val Gln Asn Ala Ser Arg Thr Val Gly Val Val885 890 895Ala Ala Leu Leu Ser Asp Thr Gln Arg Asp Asp Phe Val Ala Arg Thr900 905 910Arg Lys Glu Tyr Glu Thr Val Arg Ile Gln His Ala Arg Lys Lys Pro915 920 925Arg Thr Pro Pro Val Thr Leu Glu Ala Ala Arg Asp Asn Asp Leu Ala930 935 940Phe Asp Trp Glu Arg Tyr Thr Pro Pro Val Ala His Arg Leu Gly Val945 950 955 960Gln Glu Val Glu Ala Ser Ile Glu Thr Leu Arg Asn Tyr Ile Asp Trp965 970 975Thr Pro Phe Phe Met Thr Trp Ser Leu Ala Gly Lys Tyr Pro Arg Ile980 985 990Leu Glu Asp Glu Val Val Gly Val Glu Ala Gln Arg Leu Phe Lys Asp99510001005Ala Asn Asp Met Leu Asp Lys Leu Ser Ala Glu Iys Leu Leu Asn Pro101010151020Arg Gly Val Val Gly Leu Phe Pro Ala Asn Arg Val Gly Asp Asp Ile1025 103010351040Glu Ile Tyr Arg Asp Glu Thr Arg Thr His Val Leu Thr Val Ser His104510501055His Leu Arg Gln Gln Thr Glu Lys Val Gly Phe Ala Asn Tyr Cys Leu106010651070Ala Asp Phe Val Ala Pro Lys Leu Ser Gly Lys Ala Asp Tyr Ile Gly107510801085Ala Phe Ala Val Thr Gly Gly Leu Lys Glu Asp Ala Leu Ala Asp Ala109010951100Phe Glu Ala Gln His Asp Asp Tyr Asn Lys Ile Met Val Lys Ala Ile1105 111011151120Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Tyr Leu His Glu Arg Val112511301135Arg Lys Val Tyr Trp Gly Tyr Ala Pro Asn Glu Ser Leu Ser Asn Asp114011451150
Glu Leu Ile Arg Glu Asn Tyr Gln Gly Ile Arg Pro Ala Pro Gly Tyr115511601165Pro Ala Cys Pro Glu His Thr Glu Lys Gly Thr Ile Trp Gln Leu Leu117011751180Asp Val Glu Lys His Thr Gly Met Lys Leu Thr Glu Ser Phe Ala Met1185 119011951200Trp Pro Gly Ala Ser Val Ser Gly Trp Tyr Phe Ser His Pro Glu Ser120512101215Lys Tyr Phe Ala Val Ala Gln Ile Gln Arg Asp Gln Val Thr Asp Tyr122012251230Ala Phe Arg Lys Gly Met Ser Val Glu Asp Val Glu Arg Trp Leu Ala123512401245Pro Asn Leu Gly Tyr Asp Ala Asp12501255210252113711212DNA213螢光假單胞菌(Pseudomonas fluorescens)220
221CDS222(1)..(3708)223RPU0356340025atg tcc gat cgc agc gtc cgc ctt caa gct ctc aag caa gct ctc aaa48Met Ser Asp Arg Ser Val Arg Leu Gln Ala Leu Lys Gln Ala Leu Lys1 5 10 15gag cgc atc ctg att ctc gac ggc ggc atg ggc acg atg atc cag agc96Glu Arg Ile Leu Ile Leu Asp Gly Gly Met Gly Thr Met Ile Gln Ser20 25 30tac aag ctc gaa gag cag gat tat cgc ggc aaa cgc ttc gcc gac tgg144Tyr Lys Leu Glu Glu Gln Asp Tyr Arg Gly Lys Arg Phe Ala Asp Trp35 40 45ccg agc gac gtc aag ggc aac aac gac ctg ttg gtg ctg acc cgc ccg192Pro Ser Asp Val Lys Gly Asn Asn Asp Leu Leu Val Leu Thr Arg Pro50 55 60gac gtg atc ggc ggc atc gag aaa gcc tat ctg gat gcc ggt gcc gac240Asp Val Ile Gly Gly Ile Glu Lys Ala Tyr Leu Asp Ala Gly Ala Asp65 70 75 80atc ctc gag acc aac acc ttc aac gcc acg cag att tcc atg gcc gac288Ile Leu Glu Thr Asn Thr Phe Asn Ala Thr Gln Ile Ser Met Ala Asp85 90 95tac ggc atg gaa gaa ctg gtc tac gaa ctc aac gta gaa ggc gcc cgt336Tyr Gly Met Glu Glu Leu Val Tyr Glu Leu Asn Val Glu Gly Ala Arg100 105 110
ctg gca cgc aag gtc gcc gac gcg aaa acc ctc gag acc ccc gac aag384Leu Ala Arg Lys Val Ala Asp Ala Lys Thr Leu Glu Thr Pro Asp Lys115 120 125ccg cgc ttc gtc gcc ggc gtt ctc ggc ccg acc agc cgc acc tgc tcg432Pro Arg Phe Val Ala Gly Val Leu Gly Pro Thr Ser Arg Thr Cys Ser130 135 140ctg tcg ccg gac gtc aac aac ccg ggc tat cgc aac gtc acc ttc gat480Leu Ser Pro Asp Val Asn Asn Pro Gly Tyr Arg Asn Val Thr Phe Asp145 150 155 160gag ctg gtc gaa aac tac acc gag gcc acc aaa ggc ctg atc gag ggc528Glu Leu Val Glu Asn Tyr Thr Glu Ala Thr Lys Gly Leu Ile Glu Gly165 170 175ggc gcg gat ctg atc ctg atc gaa acc atc ttc gac acc ctc aac gcc576Gly Ala Asp Leu Ile Leu Ile Glu Thr Ile phe Asp Thr Leu Asn Ala180 185 190aaa gcc gcg atc ttc gcc gtg caa ggc gtg ttc gaa gaa ctg ggc ttc624Lys Ala Ala Ile Phe Ala Val Gln Gly Val Phe Glu Glu Leu Gly Phe195 200 205gaa ttg ccg atc atg atc tcc ggc acc atc acc gac gcc tcc ggc cgt672Glu Leu Pro Ile Met Ile Ser Gly Thr Ile Thr Asp Ala Ser Gly Arg210 215 220acc ctg tcg ggc cag acc acc gaa gcg ttc tgg aac tcc gtg gct cac720Thr Leu Ser Gly Gln Thr Thr Glu Ala Phe Trp Asn Ser Val Ala His225 230 235 240gcc aaa ccg att tcc gtc ggt ctt aac tgc gcc ctc ggc gcc cgc gaa768Ala Lys Pro Ile Ser Val Gly Leu Asn Cys Ala Leu Gly Ala Arg Glu245 250 255ctg cgt ccg tac ctg gaa gag ctg tcg gac aag gcc agc acc cac gtt816Leu Arg Pro Tyr Leu Glu Glu Leu Ser Asp Lys Ala Ser Thr His Val260 265 270tcg gcg cac ccg aac gcc ggc ctg ccg aac gaa ttc ggc gag tac gac864Ser Ala His Pro Asn Ala Gly Leu Pro Asn Glu Phe Gly Glu Tyr Asp275 280 285gag ctg ccg gtg gac acc gcc aag gtc atc gaa gag ttc gcc cag agc912Glu Leu Pro Val Asp Thr Ala Lys Val Ile Glu Glu Phe Ala Gln Ser290 295 300ggt ttc ctc aac atc gtc ggc ggt tgc tgc ggc acc acg ccg ggc cat960Gly Phe Leu Asn Ile Val Gly Gly Cys Cys Gly Thr Thr Pro Gly His305 310 315 320atc gaa gcc atc gcc aaa gcc gtt gcc ggt tac gcg cca cgg cag att1008Ile Glu Ala Ile Ala Lys Ala Val Ala Gly Tyr Ala pro Arg Gln Ile325 330 335ccg gac att ccc aag gcc tgc cgc ctg tcg ggt ctg gaa ccg ttc acc1056Pro Asp Ile Pro Lys Ala Cys Arg Leu Ser Gly Leu Glu Pro Phe Thr340 345 350att gat cgc agc tcg ctg ttc gtc aac gtc ggc gag cgg acc aac atc1104Ile Asp Arg Ser Ser Leu Phe Val Asn Val Gly Glu Arg Thr Asn Ile
355 360 365acc ggg tcc gcg aaa ttt gcc cgg ctg atc cgt gaa gac aac tac acc1152Thr Gly Ser Ala Lys Phe Ala Arg Leu Ile Arg Glu Asp Asn Tyr Thr370 375 380gaa gcc ctg gaa gtc gcc ctg cag cag gtc gag gcc ggc gcc cag gtg1200Glu Ala Leu Glu Val Ala Leu Gln Gln Val Glu Ala Gly Ala Gln Val385 390 395 400atc gac atc aac atg gac gaa ggg atg ctc gat tcg aag aag gcc atg1248Ile Asp Ile Asn Met Asp Glu Gly Met Leu Asp Ser Lys Lys Ala Met405 410 415gtg acc ttc ctc aat ctg att gcc ggc gaa ccg gac atc tcc cgc gta1296Val Thr Phe Leu Asn Leu Ile Ala Gly Glu Pro Asp Ile Ser Arg Val420 425 430ccg atc atg atc gac tcc tcg aaa tgg gac gtg atc gaa gcc ggc ctc1344Pro Ile Met Ile Asp Ser Ser Lys Trp Asp Val Ile Glu Ala Gly Leu435 440 445aag tgc att cag ggc aag ggc atc gtc aac tcg atc agc atg aaa gaa1392Lys Cys Ile Gln Gly Lys Gly Ile Val Asn Ser Ile Ser Met Lys Glu450 455 460ggc gtc gag cag ttc atc cac cac gcc aaa ctg tgc aag cgc tat ggc1440Gly Val Glu Gln Phe Ile His His Ala Lys Leu Cys Lys Arg Tyr Gly465 470 475 480gcc gcc gtg gtg gtg atg gcg ttc gac gaa gcc ggc cag gct gac acc1488Ala Ala Val Val Val Met Ala Phe Asp Glu Ala Gly Gln Ala Asp Thr485 490 495gaa gcg cgc aag aaa gag atc tgc aaa cgc tcc tac gac att ctg gtc1536Glu Ala Arg Lys Lys Glu Ile Cys Lys Arg Ser Tyr Asp Ile Leu Val500 505 510aac gaa gtc ggc ttc ccg ccg gaa gac atc att ttc gac ccg aac atc1584Asn Glu Val Gly Phe Pro Pro Glu Asp Ile Ile Phe Asp Pro Asn Ile515 520 525ttc gcc gtg gcc acc ggc atc gaa gaa cac aac aac tac gct gtg gac1632Phe Ala Val Ala Thr Gly Ile Glu Glu His Asn Asn Tyr Ala Val Asp530 535 540ttc atc aac gcc tgt gcc tac atc cgc gac gag ctg ccg tat gcc ctg1680Phe Ile Asn Ala Cys Ala Tyr Ile Arg Asp Glu Leu Pro Tyr Ala Leu545 550 555560agc tcc ggc ggc gtg tcc aac gtg tcg ttc tcg ttc cgc ggc aac aac1728Ser Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asn565 570 575ccg gtg cgc gag gcg atc cac tcg gtg ttc ctg ctg tac gcg atc cgc1776Pro Val Arg Glu Ala Ile His Ser Val Phe Leu Leu Tyr Ala Ile Arg580 585 590gcc ggc ctg acc atg ggt atc gtc aac gcc ggt cag ctg gag atc tac1824Ala Gly Leu Thr Met Gly Ile Val Asn Ala Gly Gln Leu Glu Ile Tyr595 600 605
gac cag atc ccg cag gaa ctg cgc gac gcc gtt gaa gac gtg atc ctc1872Asp Gln Ile Pro Gln Glu Leu Arg Asp Ala Val Glu Asp Val Ile Leu610 615 620aac cgc acg ccg gaa ggc acc gac gcc ctc ctc gcc atc gcc gac aag1920Asn Arg Thr Pro Glu Gly Thr Asp Ala Leu Leu Ala Ile Ala Asp Lys625 630 635 640tac aag ggc gac ggc agc gtc aag gaa gcc gag acc gaa gaa tgg cgc1968Tyr Lys Gly Asp Gly Ser Val Lys Glu Ala Glu Thr Glu Glu Trp Arg645 650 655ggc tgg gac gtc aac aaa cgt ctg gaa cat gcg ctg gtc aag ggc atc2016Gly Trp Asp Val Asn Lys Arg Leu Glu His Ala Leu Val Lys Gly Ile660 665 670acc acc cac atc gtc gaa gac acc gaa gaa tcc cgt cag tcc ttc gcc2064Thr Thr His Ile Val Glu Asp Thr Glu Glu Ser Arg Gln Ser Phe Ala675 680 685cgc ccg atc gaa gtg atc gaa ggc ccg ctg atg tcc ggc atg aac atc2112Arg Pro Ile Glu Val Ile Glu Gly Pro Leu Met Ser Gly Met Asn Ile690 695 700gtc ggc gac ctg ttc ggc gcc ggc aaa atg ttc ctg ccg caa gtg gtg2160Val Gly Asp Leu Phe Gly Ala Gly Lys Met Phe Leu Pro Gln Val Val705 710 715 720aaa tcc gcc cgc gtg atg aag cag gcc gtg gcg cac ctg att ccg ttc2208Lys Ser Ala Arg Val Met Lys Gln Ala Val Ala His Leu Ile Pro Phe725 730 735atc gaa ctg gaa aaa ggc gac aag ccg gaa gcc aag ggc aag atc ctg2256Ile Glu Leu Glu Lys Gly Asp Lys Pro Glu Ala Lys Gly Lys Ile Leu740 745 750atg gcc acg gtc aaa ggc gac gtg cac gac atc ggc aag aac atc gtc2304Met Ala Thr Val Lys Gly Asp Val His Asp Ile Gly Lys Asn Ile Val755 760 765ggc gtg gtg ctg ggt tgc aac ggc tac gac atc gtc gac ctc ggc gtg2352Gly Val Val Leu Gly Cys Asn Gly Tyr Asp Ile Val Asp Leu Gly Val770 775 780atg gtg ccg gcg gag aag atc ctg cag gtg gcc aag gag cag aag tgc2400Met Val Pro Ala Glu Lys Ile Leu Gln Val Ala Lys Glu Gln Lys Cys785 790 795 800gac atc atc ggc ctg tcc ggt ctg atc acc ccg tcg ctg gat gag atg2448Asp Ile Ile Gly Leu Ser Gly Leu Ile Thr Pro Ser Leu Asp Glu Met805 810 815gtc cat gtg gcc cgc gag atg cag cgc cag gac ttc cac ctg ccg ctg2496Val His Val Ala Arg Glu Met Gln Arg Gln Asp Phe His Leu Pro Leu820 825 830atg atc ggc ggc gcg acc acc tcc aag gcg cac acg gcg gtg aag atc2544Met Ile Gly Gly Ala Thr Thr Ser Lys Ala His Thr Ala Val Lys Ile835 840 845gag ccc aag tac agc aac gac gca gtg gtc tac gtg acc gac gcc tcc2592Glu Pro Lys Tyr Ser Asn Asp Ala Val Val Tyr Val Thr Asp Ala Ser
850 855 860cgc gcc gtg ggc gtg gcg acg cag ttg ctg tcc aag gaa ctg aaa gcc2640Arg Ala Val Gly Val Ala Thr Gln Leu Leu Ser Lys Glu Leu Lys Ala865 870 875 880ggt ttc gtc cag aag acc cgc gaa gag tac atc gac gtc cgc gag cgc2688Gly Phe Val Gln Lys Thr Arg Glu Glu Tyr Ile Asp Val Arg Glu Arg885 890 895acc gcc aac cgc agc gcc cgc acc gaa cgc ctg agc tac gcc gcc gcg2736Thr Ala Asn Arg Ser Ala Arg Thr Glu Arg Leu Ser Tyr Ala Ala Ala900 905 910atc gcc aag aag ccg cag ttc gac tgg gcc act tac acc ccg gtc aaa2784Ile Ala Lys Lys Pro Gln Phe Asp Trp Ala Thr Tyr Thr Pro Val Lys915 920 925ccg acc ttc acc ggc acc cgc gtg ctg gac aac atc gac ctc aac gtt2832Pro Thr Phe Thr Gly Thr Arg Val Leu Asp Asn Ile Asp Leu Asn Val930 935 940ctc gcc gag tac atc gac tgg acg ccg ttc ttc atc tcc tgg gac ctg2880Leu Ala Glu Tyr Ile Asp Trp Thr Pro Phe phe Ile Ser Trp Asp Leu945 950 955 960gcc ggc aag ttc ccg cgc atc ctc gaa gac gaa gtg gtc ggc gaa gcg2928Ala Gly Lys Phe Pro Arg Ile Leu Glu Asp Glu Val Val Gly Glu Ala965 970 975gcg acc gcg ctg tac aag gac gct cgc gag atg ctg acc aag ctg atc2976Ala Thr Ala Leu Tyr Lys Asp Ala Arg Glu Met Leu Thr Lys Leu Ile980 985 990gac gag aaa ctg atc agc gcc cgt gcg gtg ttc ggc ttc tgg ccg gcc3024Asp Glu Lys Leu Ile Ser Ala Arg Ala Val Phe Gly Phe Trp Pro Ala99510001005aat cag gtg cac gac gac gat atc gag ctg tac ggc gat gac ggc aag3072Asn Gln Val His Asp Asp Asp Ile Glu Leu Tyr Gly Asp Asp Gly Lys101010151020cca atg gcg cgc ctg cat cac ctg cgc cag cag atc atc aag acc gac3120Pro Met Ala Arg Leu His His Leu Arg Gln Gln Ile Ile Lys Thr Asp1025 103010351040ggc aaa ccg aac ttc tcc ctc gcc gac ttc gtc gcg ccg aag gac agc3168Gly Lys Pro Asn Phe Ser Leu Ala Asp Phe Val Ala Pro Lys Asp Ser104510501055gaa gtg acc gac tac gtt ggt ggt ttc atc acc acc gcc ggg atc ggc3216Glu Val Thr Asp Tyr Val Gly Gly Phe Ile Thr Thr Ala Gly Ile Gly106010651070gcc gaa gaa gtg gcc aag gcc tat cag gac gcc ggc gac gat tac aac3264Ala Glu Glu Val Ala Lys Ala Tyr Gln Asp Ala Gly Asp Asp Tyr Asn107510801085tcg atc atg gtc aag gcc ctg gcc gac cgt ctg gcc gag gcg tgc gcc3312Ser Ile Met Val Lys Ala Leu Ala Asp Arg Leu Ala Glu Ala Cys Ala109010951100
gag tgg ctg cac cag cag gtg cgc aaa gag cac tgg ggt tac gcc aag3360Glu Trp Leu His Gln Gln Val Arg Lys Glu His Trp Gly Tyr Ala Lys1105 111011151120gat gaa gcc ctc gat aac gag gcg ctg atc aaa gag cag tat tcc ggc3408Asp Glu Ala Leu Asp Asn Glu Ala Leu Ile Lys Glu Gln Tyr Ser Gly112511301135atc cgc cct gcc ccc ggc tac ccg gcg tgc ccg gat cac acc gag aag3456Ile Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Asp His Thr Glu Lys114011451150gcc acc ctg ttc gcc ctg ctc gac cct gaa gca cag gaa atg cgc gcc3504Ala Thr Leu Phe Ala Leu Leu Asp Pro Glu Ala Gln Glu Met Arg Ala115511601165ggc cgc agc ggt gtg ttc ctc acc gag cac tac gcg atg ttc ccg gcg3552Gly Arg Ser Gly Val Phe Leu Thr Glu His Tyr Ala Met Phe Pro Ala117011751180gca gcc gtc agc ggc tgg tac ttc gcc cat ccg cag gcg cag tac ttc3600Ala Ala Val Ser Gly Trp Tyr Phe Ala His Pro Gln Ala Gln Tyr Phe1185 119011951200gcc gtg ggc aag gtc gac aag gat cag gtg cag agc tac acc tcg cgc3648Ala Val Gly Lys Val Asp Lys Asp Gln Val Gln Ser Tyr Thr Ser Arg120512101215aaa ggc cag gaa ctg agc ctg acc gag cgc tgg ctg gca ccc aat ctg3696Lys Gly Gln Glu Leu Ser Leu Thr Glu Arg Trp Leu Ala Pro Asn Leu122012251230ggc tac gac aac tga3711Gly Tyr Asp Asn1235210262111236212PRT213螢光假單胞菌40026Met Ser Asp Arg Ser Val Arg Leu Gln Ala Leu Lys Gln Ala Leu Lys1 5 10 15Glu Arg Ile Leu Ile Leu Asp Gly Gly Met Gly Thr Met Ile Gln Ser20 25 30Tyr Lys Leu Glu Glu Gln Asp Tyr Arg Gly Lys Arg Phe Ala Asp Trp35 40 45Pro Ser Asp Val Lys Gly Asn Asn Asp Leu Leu Val Leu Thr Arg Pro50 55 60Asp Val Ile Gly Gly Ile Glu Lys Ala Tyr Leu Asp Ala Gly Ala Asp65 70 75 80Ile Leu Glu Thr Asn Thr Phe Asn Ala Thr Gln Ile Ser Met Ala Asp85 90 95Tyr Gly Met Glu Glu Leu Val Tyr Glu Leu Asn Val Glu Gly Ala Arg
100 105 110Leu Ala Arg Lys Val Ala Asp Ala Lys Thr Leu Glu Thr Pro Asp Lys115 120 125Pro Arg Phe Val Ala Gly Val Leu Gly Pro Thr Ser Arg Thr Cys Ser130 135 140Leu Ser Pro Asp Val Asn Asn Pro Gly Tyr Arg Asn Val Thr Phe Asp145 150 155 160Glu Leu Val Glu Asn Tyr Thr Glu Ala Thr Lys Gly Leu Ile Glu Gly165 170 175Gly Ala Asp Leu Ile Leu Ile Glu Thr Ile Phe Asp Thr Leu Asn Ala180 185 190Lys Ala Ala Ile Phe Ala Val Gln Gly Val Phe Glu Glu Leu Gly Phe195 200 205Glu Leu Pro Ile Met Ile Ser Gly Thr Ile Thr Asp Ala Ser Gly Arg210 215 220Thr Leu Ser Gly Gln Thr Thr Glu Ala Phe Trp Asn Ser Val Ala His225 230 235 240Ala Lys Pro Ile Ser Val Gly Leu Asn Cys Ala Leu Gly Ala Arg Glu245 250 255Leu Arg Pro Tyr Leu Glu Glu Leu Ser Asp Lys Ala Ser Thr His Val260 265 270Ser Ala His Pro Asn Ala Gly Leu Pro Asn Glu Phe Gly Glu Tyr Asp275 280 285Glu Leu Pro Val Asp Thr Ala Lys Val Ile Glu Glu Phe Ala Gln Ser290 295 300Gly Phe Leu Asn Ile Val Gly Gly Cys Cys Gly Thr Thr Pro Gly His305 310 315 320Ile Glu Ala Ile Ala Lys Ala Val Ala Gly Tyr Ala Pro Arg Gln Ile325 330 335Pro Asp Ile Pro Lys Ala Cys Arg Leu Ser Gly Leu Glu Pro Phe Thr340 345 350Ile Asp Arg Ser Ser Leu Phe Val Asn Val Gly Glu Arg Thr Asn Ile355 360 365Thr Gly Ser Ala Lys Phe Ala Arg Leu Ile Arg Glu Asp Asn Tyr Thr370 375 380Glu Ala Leu Glu Val Ala Leu Gln Gln Val Glu Ala Gly Ala Gln Val385 390 395 400Ile Asp Ile Asn Met Asp Glu Gly Met Leu Asp Ser Lys Lys Ala Met405 410 415Val Thr Phe Leu Asn Leu Ile Ala Gly Glu Pro Asp Ile Ser Arg Val420 425 430
Pro Ile Met Ile Asp Ser Ser Lys Trp Asp Val Ile Glu Ala Gly Leu435 440 445Lys Cys Ile Gln Gly Lys Gly Ile Val Asn Ser Ile Ser Met Lys Glu450 455 460Gly Val Glu Gln Phe Ile His His Ala Lys Leu Cys Lys Arg Tyr Gly465 470 475 480Ala Ala Val Val Val Met Ala Phe Asp Glu Ala Gly Gln Ala Asp Thr485 490 495Glu Ala Arg Lys Lys Glu Ile Cys Lys Arg Ser Tyr Asp Ile Leu Val500 505 510Asn Glu Val Gly Phe Pro Pro Glu Asp Ile Ile Phe Asp Pro Asn Ile515 520 525Phe Ala Val Ala Thr Gly Ile Glu Glu His Asn Asn Tyr Ala Val Asp530 535 540Phe Ile Asn Ala Cys Ala Tyr Ile Arg Asp Glu Leu Pro Tyr Ala Leu545 550 555 560Ser Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asn565 570 575Pro Val Arg Glu Ala Ile His Ser Val Phe Leu Leu Tyr Ala Ile Arg580 585 590Ala Gly Leu Thr Met Gly Ile Val Asn Ala Gly Gln Leu Glu Ile Tyr595 600 605Asp Gln Ile Pro Gln Glu Leu Arg Asp Ala Val Glu Asp Val Ile Leu610 615 620Asn Arg Thr Pro Glu Gly Thr Asp Ala Leu Leu Ala Ile Ala Asp Lys625 630 635 640Tyr Lys Gly Asp Gly Ser Val Lys Glu Ala Glu Thr Glu Glu Trp Arg645 650 655Gly Trp Asp Val Asn Lys Arg Leu Glu His Ala Leu Val Lys Gly Ile660 665 670Thr Thr His Ile Val Glu Asp Thr Glu Glu Ser Arg Gln Ser Phe Ala675 680 685Arg Pro Ile Glu Val Ile Glu Gly Pro Leu Met Ser Gly Met Asn Ile690 695 700Val Gly Asp Leu Phe Gly Ala Gly Lys Met Phe Leu Pro Gln Val Val705 710 715 720Lys Ser Ala Arg Val Met Lys Gln Ala Val Ala His Leu Ile Pro Phe725 730 735Ile Glu Leu Glu Lys Gly Asp Lys Pro Glu Ala Lys Gly Lys Ile Leu740 745 750Met Ala Thr Val Lys Gly Asp Val His Asp Ile Gly Lys Asn Ile Val755 760 765
Gly Val Val Leu Gly Cys Asn Gly Tyr Asp Ile Val Asp Leu Gly Val770 775 780Met Val Pro Ala Glu Lys Ile Leu Gln Val Ala Lys Glu Gln Lys Cys785 790 795 800Asp Ile Ile Gly Leu Ser Gly Leu Ile Thr Pro Ser Leu Asp Glu Met805 810 815Val His Val Ala Arg Glu Met Gln Arg Gln Asp Phe His Leu Pro Leu820 825 830Met Ile Gly Gly Ala Thr Thr Ser Lys Ala His Thr Ala Val Lys Ile835 840 845Glu Pro Lys Tyr Ser Asn Asp Ala Val Val Tyr Val Thr Asp Ala Ser850 855 860Arg Ala Val Gly Val Ala Thr Gln Leu Leu Ser Lys Glu Leu Lys Ala865 870 875 880Gly Phe Val Gln Lys Thr Arg Glu Glu Tyr Ile Asp Val Arg Glu Arg885 890 895Thr Ala Asn Arg Ser Ala Arg Thr Glu Arg Leu Ser Tyr Ala Ala Ala900 905 910Ile Ala Lys Lys Pro Gln Phe Asp Trp Ala Thr Tyr Thr Pro Val Lys915 920 925Pro Thr Phe Thr Gly Thr Arg Val Leu Asp Asn Ile Asp Leu Asn Val930 935 940Leu Ala Glu Tyr Ile Asp Trp Thr Pro Phe Phe Ile Ser Trp Asp Leu945 950 955 960Ala Gly Lys Phe Pro Arg Ile Leu Glu Asp Glu Val Val Gly Glu Ala965 970 975Ala Thr Ala Leu Tyr Lys Asp Ala Arg Glu Met Leu Thr Lys Leu Ile980 985 990Asp Glu Lys Leu Ile Ser Ala Arg Ala Val Phe Gly Phe Trp Pro Ala99510001005Asn Gln Val His Asp Asp Asp Ile Glu Leu Tyr Gly Asp Asp Gly Lys101010151020Pro Met Ala Arg Leu His His Leu Arg Gln Gln Ile Ile Lys Thr Asp1025 103010351040Gly Lys Pro Asn Phe Ser Leu Ala Asp Phe Val Ala Pro Lys Asp Ser104510501055Glu Val Thr Asp Tyr Val Gly Gly Phe Ile Thr Thr Ala Gly Ile Gly106010651070Ala Glu Glu Val Ala Lys Ala Tyr Gln Asp Ala Gly Asp Asp Tyr Asn107510801085Ser Ile Met Val Lys Ala Leu Ala Asp Arg Leu Ala Glu Ala Cys Ala
1090 10951100Glu Trp Leu His Gln Gln Val Arg Lys Glu His Trp Gly Tyr Ala Lys1105 111011151120Asp Glu Ala Leu Asp Asn Glu Ala Leu Ile Lys Glu Gln Tyr Ser Gly112511301135Ile Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Asp His Thr Glu Lys114011451150Ala Thr Leu Phe Ala Leu Leu Asp Pro Glu Ala Gln Glu Met Arg Ala115511601165Gly Arg Ser Gly Val Phe Leu Thr Glu His Tyr Ala Met Phe Pro Ala117011751180Ala Ala Val Ser Gly Trp Tyr Phe Ala His Pro Gln Ala Gln Tyr Phe1185 119011951200Ala Val Gly Lys Val Asp Lys Asp Gln Val Gln Ser Tyr Thr Ser Arg120512101215Lys Gly Gln Glu Leu Ser Leu Thr Glu Arg Trp Leu Ala Pro Asn Leu122012251230Gly Tyr Asp Asn1235210272113705212DNA213銅綠假單胞菌(Pseudomonas aeruginosa)220
221CDS222(1)..(3702)223RPA0177240027atg tcc agc ccg ctc acc gat cgc agc gcc cgc ctg caa gcc ctc cag48Met Ser Ser Pro Leu Thr Asp Arg Ser Ala Arg Leu Gln Ala Leu Gln1 5 10 15cac gcc ctc agg gaa cgt atc ctg atc ctc gat ggc ggc atg ggc acc96His Ala Leu Arg Glu Arg Ile Leu Ile Leu Asp Gly Gly Met Gly Thr20 25 30atg atc cag agc tac aag ctg gaa gag gcc gac tac cgc ggc gag cgc144Met Ile Gln Ser Tyr Lys Leu Glu Glu Ala Asp Tyr Arg Gly Glu Arg35 40 45ttc gcc gac tgg ccg agc gac gtg aaa ggc aac aac gac ctc ttg ctg192Phe Ala Asp Trp Pro Ser Asp Val Lys Gly Asn Asn Asp Leu Leu Leu50 55 60ctg agc cgc ccg gac gtg atc cag gcc atc gag aag gcc tac ctc gac240Leu Ser Arg Pro Asp Val Ile Gln Ala Ile Glu Lys Ala Tyr Leu Asp65 70 75 80gcc ggc gcc gac atc ctc gag acc aac acc ttc aac gcc acc cag gtg288
Ala Gly Ala Asp Ile Leu Glu Thr Asn Thr Phe Asn Ala Thr Gln Val85 90 95tcc cag gcc gac tac ggc atg cag tcg ctg gcc tac gaa ctc aac gtc336Ser Gln Ala Asp Tyr Gly Met Gln Ser Leu Ala Tyr Glu Leu Asn Val100 105 110gaa ggg gcg cgc ctg gcc cgc cag gtg gcg gac gcg aag acc gcc gag384Glu Gly Ala Arg Leu Ala Arg Gln Val Ala Asp Ala Lys Thr Ala Glu115 120 125acc ccg gac aag ccg cgt ttc gtc gcc ggc gtg ctc ggc ccg acc agc432Thr Pro Asp Lys Pro Arg Phe Val Ala Gly Val Leu Gly Pro Thr Ser130 135 140cgc acc tgc tcg att tcc ccg gac gtg aac aac ccc ggc tac cgc aac480Arg Thr Cys Ser Ile Ser Pro Asp Val Asn Asn Pro Gly Tyr Arg Asn145 150 155 160gtc acc ttc gac gaa ctg gtg gag aac tac gtc gag gcg acc cga ggc528Val Thr Phe Asp Glu Leu Val Glu Asn Tyr Val Glu Ala Thr Arg Gly165 170 175ctg atc gaa ggc ggc gcc gac ctg atc ctg atc gag acc atc ttc gac576Leu Ile Glu Gly Gly Ala Asp Leu Ile Leu Ile Glu Thr Ile Phe Asp180 185 190acc ctc aac gcc aag gcg gcg atc ttc gcc gtc cag ggc gtg ttc gag624Thr Leu Asn Ala Lys Ala Ala Ile Phe Ala Val Gln Gly Val Phe Glu195 200 205gaa ctc ggc gtg gag ctg ccg atc atg atc tcc gga acc atc acc gac672Glu Leu Gly Val Glu Leu Pro Ile Met Ile Ser Gly Thr Ile Thr Asp210 215 220gcc tcc ggc cgc acc ctg tcg ggc cag acc acc gag gcc ttc tgg aac720Ala Ser Gly Arg Thr Leu Ser Gly Gln Thr Thr Glu Ala Phe Trp Asn225 230 235 240tcg gtg cgg cat gcc cgg ccg atc tcg gta ggc ctg aac tgc gcc ctc768Ser Val Arg His Ala Arg Pro Ile Ser Val Gly Leu Asn Cys Ala Leu245 250 255ggc gcc aag gaa ttg cgg ccg tac atc gag gaa ctg tcg acc aag gcc816Gly Ala Lys Glu Leu Arg Pro Tyr Ile Glu Glu Leu Ser Thr Lys Ala260 265 270gac act cat gtc tcg gcc cac ccc aac gcc ggc ctg ccg aac gcc ttc864Asp Thr His Val Ser Ala His Pro Asn Ala Gly Leu Pro Asn Ala Phe275 280 285ggc gaa tac gac gaa tcg ccg gcg gaa atg gcc gtg gtg gtc gag gaa912Gly Glu Tyr Asp Glu Ser Pro Ala Glu Met Ala Val Val Val Glu Glu290 295 300ttc gcc gcc gcc ggc ttc ctc aat atc gtc ggc ggc tgc tgc ggc acc960Phe Ala Ala Ala Gly Phe Leu Asn Ile Val Gly Gly Cys Cys Gly Thr305 310 315 320acc ccg gcg cac atc gag gcg atc gcc aag gca gtg gcc aag tac ccg1008Thr Pro Ala His Ile Glu Ala Ile Ala Lys Ala Val Ala Lys Tyr Pro325 330 335
ccg cgg gcc atc ccg gag att ccc cgg gcc tgt cgc ctg tcc ggc ctg1056Pro Arg Ala Ile Pro Glu Ile Pro Arg Ala Cys Arg Leu Ser Gly Leu340 345 350gag ccg ttc acc atc gac cgc agc tcg ctg ttc gtc aac gtc ggc gag1104Glu Pro Phe Thr Ile Asp Arg Ser Ser Leu Phe Val Asn Val Gly Glu355 360 365cgc acc aac atc acc ggt tcg gcc aag ttc gcc cgg ctg atc cgc gag1152Arg Thr Asn Ile Thr Gly Ser Ala Lys Phe Ala Arg Leu Ile Arg Glu370 375 380gaa aac tac gcg gaa gct ctc gag gtc gcc cag cag cag gtg gaa gcc1200Glu Asn Tyr Ala Glu Ala Leu Glu Val Ala Gln Gln Gln Val Glu Ala385 390 395 400ggc gcc cag gtg atc gac atc aac atg gac gaa ggc atg ctg gac tcg1248Gly Ala Gln Val Ile Asp Ile Asn Met Asp Glu Gly Met Leu Asp Ser405 410 415aag gcg gcc atg gtc acc ttc ctc aac ctg atc gcc tcc gag ccc gac1296Lys Ala Ala Met Val Thr Phe Leu Asn Leu Ile Ala Ser Glu Pro Asp420 425 430atc tcg cgc gtg ccg atc atg atc gac tcc tcc aag tgg gaa gtg atc1344Ile Ser Arg Val Pro Ile Met Ile Asp Ser Ser Lys Trp Glu Val Ile435 440 445gag gcc ggc ctg aag tgc atc cag ggc aag ggc atc gtc aac tcg atc1392Glu Ala Gly Leu Lys Cys Ile Gln Gly Lys Gly Ile Val Asn Ser Ile450 455 460tcg atg aag gaa ggc gtc gag gcc ttc aag cac cat gcc cgc ctg tgc1440Ser Met Lys Glu Gly Val Glu Ala Phe Lys His His Ala Arg Leu Cys465 470 475 480aag cgc tac ggc gcc gcg gtg gtg gtg atg gcc ttc gac gag gac ggc1488Lys Arg Tyr Gly Ala Ala Val Val Val Met Ala Phe Asp Glu Asp Gly485 490 495cag gcc gac acc cag gcg cgc aag gaa gaa atc tgc aag cgc tcc tac1536Gln Ala Asp Thr Gln Ala Arg Lys Glu Glu Ile Cys Lys Arg Ser Tyr500 505 510gac atc ctg gtc gac gaa gtc ggc ttc cca ccg gaa gac atc atc ttc1584Asp Ile Leu Val Asp Glu Val Gly Phe Pro Pro Glu Asp Ile Ile Phe515 520 525gat gcg aac atc ttc gcc atc gcc acc ggc atc gag gaa cac aac aac1632Asp Ala Asn Ile Phe Ala Ile Ala Thr Gly Ile Glu Glu His Asn Asn530 535 540tac gcg gtc gat ttc atc aac gcc tgc gcc tac atc cgc gac aac ctc1680Tyr Ala Val Asp Phe Ile Asn Ala Cys Ala Tyr Ile Arg Asp Asn Leu545 550 555 560ccc tac gcc ctg agc tcg ggc ggg gtg tcc aac gtg tcc ttc tcg ttc1728Pro Tyr Ala Leu Ser Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe565 570 575cgc ggc aac aac ccg gta cgc gag gcg atc cac tcg gtg ttc ctc tac1776
Arg Gly Asn Asn Pro Val Arg Glu Ala Ile His Ser Val Phe Leu Tyr580 585 590tac gcg atc cgc aac ggc ctg acc atg ggc atc gtc aac gcc ggc cag1824Tyr Ala Ile Arg Asn Gly Leu Thr Met Gly Ile Val Asn Ala Gly Gln595 600 605ctg gaa atc tac gac gag att ccg aaa gcg ctg cgc gac cgg gtc gag1872Leu Glu Ile Tyr Asp Glu Ile Pro Lys Ala Leu Arg Asp Arg Val Glu610 615 620gac gtg gtg ctc aac cgc acg ccc gag gcc acc gag gcc ctg ctg gcg1920Asp Val Val Leu Asn Arg Thr Pro Glu Ala Thr Glu Ala Leu Leu Ala625 630 635 640atc gcc gac gac tac aag ggc ggc ggc gcg gtc aag gag gcc gag gac1968Ile Ala Asp Asp Tyr Lys Gly Gly Gly Ala Val Lys Glu Ala Glu Asp645 650 655gag gaa tgg cgc agc tac agc gtc gag aag cgc ctc gag cat gcg ctg2016Glu Glu Trp Arg Ser Tyr Ser Val Glu Lys Arg Leu Glu His Ala Leu660 665 670gtc aag ggc atc acc acc tgg atc gtc gag gac acc gag gaa tgc cgc2064Val Lys Gly Ile Thr Thr Trp Ile Val Glu Asp Thr Glu Glu Cys Arg675 680 685cag cag tgt gcg cgt ccc atc gag gtc atc gaa ggt ccg ctg atg tcc2112Gln Gln Cys Ala Arg Pro Ile Glu Val Ile Glu Gly Pro Leu Met Ser690 695 700ggg atg aac gtg gtc ggc gac ctg ttc ggc gcc ggc aag atg ttc ctc2160Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala Gly Lys Met Phe Leu705 710 715 720ccg cag gtg gtc aag tcc gcg cga gtg atg aag cag gcg gtg gcc cac2208Pro Gln Val Val Lys Ser Ala Arg Val Met Lys Gln Ala Val Ala His725 730 735ctg att ccc ttc atc gag gcg gag aaa ggc gac aag ccg gaa gcc aag2256Leu Ile Pro Phe Ile Glu Ala Glu Lys Gly Asp Lys Pro Glu Ala Lys740 745 750ggc aag atc ctg atg gcc acg gtg aag ggc gac gtg cac gac atc ggc2304Gly Lys Ile Leu Met Ala Thr Val Lys Gly Asp Val His Asp Ile Gly755 760 765aag aac atc gtc ggc gtg gtg ctc ggc tgc aac ggc tat gac gtg gtc2352Lys Asn Ile Val Gly Val Val Leu Gly Cys Asn Gly Tyr Asp Val Val770 775 780gac ctc ggc gtg atg gtg ccg gcg gag aag atc ctg cag acc gcc atc2400Asp Leu Gly Val Met Val Pro Ala Glu Lys Ile Leu Gln Thr Ala Ile785 790 795 800gcc gag aaa tgc gac atc atc ggc ctg tct ggc ctg atc acg ccg tcg2448Ala Glu Lys Cys Asp Ile Ile Gly Leu Ser Gly Leu Ile Thr Pro Ser805 810 815ctg gac gag atg gtc cac gtc gcc aag gaa atg cag cgg cag aat ttc2496Leu Asp Glu Met Val His Val Ala Lys Glu Met Gln Arg Gln Asn Phe820 825 830
cag ttg ccg ctg atg atc ggc ggc gcc act acc tcg aag gcg cat acc2544Gln Leu Pro Leu Met Ile Gly Gly Ala Thr Thr Ser Lys Ala His Thr835 840 845gcg gtg aag atc gat ccg cag tac agc aac gac gcg gtg gtc tac gtc2592Ala Val Lys Ile Asp Pro Gln Tyr Ser Asn Asp Ala Val Val Tyr Val850 855 860acc gac gcc tcg cgc gcg gta ggc gtg gcc acc agc ctg ctg tcc aag2640Thr Asp Ala Ser Arg Ala Val Gly Val Ala Thr Ser Leu Leu Ser Lys865 870 875 880gag ctg aag gcc gac tac gtg gcc cgc acc cgc gcc gac tac gcg gtg2688Glu Leu Lys Ala Asp Tyr Val Ala Arg Thr Arg Ala Asp Tyr Ala Val885 890 895gtc cgc gaa cgc acg gcc aac cgc agc gcc cgc acc gag cgg ctg agc2736Val Arg Glu Arg Thr Ala Asn Arg Ser Ala Arg Thr Glu Arg Leu Ser900 905 910tac gaa cag gcg atc gcc aac aag ccg gcg ttc gac tgg gcc ggc tac2784Tyr Glu Gln Ala Ile Ala Asn Lys Pro Ala Phe Asp Trp Ala Gly Tyr915 920 925cag gcg ccg acg cct tcc ttc acc ggc gtc agg gtg ctc gac gag atc2832Gln Ala Pro Thr Pro Ser Phe Thr Gly Val Arg Val Leu Asp Glu Ile930 935 940gac ctc gcg gtg ctc gcc gag tac atc gac tgg acg ccg ttc ttc att2880Asp Leu Ala Val Leu Ala Glu Tyr Ile Asp Trp Thr Pro Phe Phe Ile945 950 955 960tcc tgg gac ctg gcc ggc aag tac ccg cgc atc ctc acc gac gag gtg2928Ser Trp Asp Leu Ala Gly Lys Tyr Pro Arg Ile Leu Thr Asp Glu Val965 970 975gtc ggc gag gcc gcc acc tcg ttg ttc aac gac gcc cag gcg atg ctg2976Val Gly Glu Ala Ala Thr Ser Leu Phe Asn Asp Ala Gln Ala Met Leu980 985 990aag aag ctg atc gac gag aag ctg atc aag gcc cgc gcg gtg ttc ggc3024Lys Lys Leu Ile Asp Glu Lys Leu Ile Lys Ala Arg Ala Val Phe Gly99510001005ttc tgg ccg gcc aac cag gtc gag cac gac gac ctg gag gtc tac ggc3072Phe Trp Pro Ala Asn Gln Val Glu His Asp Asp Leu Glu Val Tyr Gly101010151020gcc gat ggc gag acc ctc gcc acc ctg cac cac ctg cgg cag cag acg3120Ala Asp Gly Glu Thr Leu Ala Thr Leu His His Leu Arg Gln Gln Thr1025 103010351040atc aag ccg gac ggc aag ccg aac ctg tcg ctg gcc gat ttc gtc gcg3168Ile Lys Pro Asp Gly Lys Pro Asn Leu Ser Leu Ala Asp Phe Val Ala104510501055ccg aag gaa agc ggc gtg cgc gac tac atc ggc ggc ttc atc acc acc3216Pro Lys Glu Ser Gly Val Arg Asp Tyr Ile Gly Gly Phe Ile Thr Thr106010651070gcc ggg atc ggc gcc gag gaa gtg gcc aag gcg tac gaa gcc aag ggc3264
Ala Gly Ile Gly Ala Glu Glu Val Ala Lys Ala Tyr Glu Ala Lys Gly107510801085gac gac tac aac agc atc atg gtc aag gcg ctc gcc gac cgc ctc gcc3312Asp Asp Tyr Asn Ser Ile Met Val Lys Ala Leu Ala Asp Arg Leu Ala109010951100gaa gcc tgc gcc gag tgg ctg cac gag cgg gtg cgc aag gag tac tgg3360Glu Ala Cys Ala Glu Trp Leu His Glu Arg Val Arg Lys Glu Tyr Trp1105 111011151120ggc tac gcc cgc gac gaa cac ctc gac aac gag gcc ttg atc aag gag3408Gly Tyr Ala Arg Asp Glu His Leu Asp Asn Glu Ala Leu Ile Lys Glu112511301135caa tac gtc ggc atc cgc ccg gca ccg ggc tac ccg gcc tgc ccc gac3456Gln Tyr Val Gly Ile Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Asp114011451150cat acc gag aaa ggc act ctg ttc gaa ctg ctc gat ccg cag ggc ctg3504His Thr Glu Lys Gly Thr Leu Phe Glu Leu Leu Asp Pro Gln Gly Leu115511601165tcc ggc gtc agc ctg acc gag cac tac gcg atg ttc ccg gcc gcg gcg3552Ser Gly Val Ser Leu Thr Glu His Tyr Ala Met Phe Pro Ala Ala Ala117011751180gtc agc ggt tgg tat ttc gcc cac ccg cag gcg cag tac ttc gcg gtc3600Val Ser Gly Trp Tyr Phe Ala His Pro Gln Ala Gln Tyr Phe Ala Val1185 119011951200ggc aag atc gac aag gac cag gtg gaa cgc tac agc cag cgc aag ggc3648Gly Lys Ile Asp Lys Asp Gln Val Glu Arg Tyr Ser Gln Arg Lys Gly120512101215cag gaa gcc agc gtc agc gag cgc tgg ctg gcg ccg aac ctt ggc tac3696Gln Glu Ala Ser Val Ser Glu Arg Trp Leu Ala Pro Asn Leu Gly Tyr122012251230gat gac tga3705Asp Asp210282111234212PRT213銅綠假單胞菌40028Met Ser Ser Pro Leu Thr Asp Arg Ser Ala Arg Leu Gln Ala Leu Gln1 5 10 15His Ala Leu Arg Glu Arg Ile Leu Ile Leu Asp Gly Gly Met Gly Thr20 25 30Met Ile Gln Ser Tyr Lys Leu Glu Glu Ala Asp Tyr Arg Gly Glu Arg35 40 45Phe Ala Asp Trp Pro Ser Asp Val Lys Gly Asn Asn Asp Leu Leu Leu50 55 60
Leu Ser Arg Pro Asp Val Ile Gln Ala Ile Glu Lys Ala Tyr Leu Asp65 70 75 80Ala Gly Ala Asp Ile Leu Glu Thr Asn Thr Phe Asn Ala Thr Gln Val85 90 95Ser Gln Ala Asp Tyr Gly Met Gln Ser Leu Ala Tyr Glu Leu Asn Val100 105 110Glu Gly Ala Arg Leu Ala Arg Gln Val Ala Asp Ala Lys Thr Ala Glu115 120 125Thr Pro Asp Lys Pro Arg Phe Val Ala Gly Val Leu Gly Pro Thr Ser130 135 140Arg Thr Cys Ser Ile Ser Pro Asp Val Asn Asn Pro Gly Tyr Arg Asn145 150 155 160Val Thr Phe Asp Glu Leu Val Glu Asn Tyr Val Glu Ala Thr Arg Gly165 170 175Leu Ile Glu Gly Gly Ala Asp Leu Ile Leu Ile Glu Thr Ile Phe Asp180 185 190Thr Leu Asn Ala Lys Ala Ala Ile Phe Ala Val Gln Gly Val Phe Glu195 200 205Glu Leu Gly Val Glu Leu Pro Ile Met Ile Ser Gly Thr Ile Thr Asp210 215 220Ala Ser Gly Arg Thr Leu Ser Gly Gln Thr Thr Glu Ala Phe Trp Asn225 230 235 240Ser Val Arg His Ala Arg Pro Ile Ser Val Gly Leu Asn Cys Ala Leu245 250 255Gly Ala Lys Glu Leu Arg Pro Tyr Ile Glu Glu Leu Ser Thr Lys Ala260 265 270Asp Thr His Val Ser Ala His Pro Asn Ala Gly Leu Pro Asn Ala Phe275 280 285Gly Glu Tyr Asp Glu Ser Pro Ala Glu Met Ala Val Val Val Glu Glu290 295 300Phe Ala Ala Ala Gly Phe Leu Asn Ile Val Gly Gly Cys Cys Gly Thr305 310 315 320Thr Pro Ala His Ile Glu Ala Ile Ala Lys Ala Val Ala Lys Tyr Pro325 330 335Pro Arg Ala Ile Pro Glu Ile Pro Arg Ala Cys Arg Leu Ser Gly Leu340 345 350Glu Pro Phe Thr Ile Asp Arg Ser Ser Leu Phe Val Asn Val Gly Glu355 360 365Arg Thr Asn Ile Thr Gly Ser Ala Lys Phe Ala Arg Leu Ile Arg Glu370 375 380Glu Asn Tyr Ala Glu Ala Leu Glu Val Ala Gln Gln Gln Val Glu Ala385 390 395 400
Gly Ala Gln Val Ile Asp Ile Asn Met Asp Glu Gly Met Leu Asp Ser405 410 415Lys Ala Ala Met Val Thr Phe Leu Asn Leu Ile Ala Ser Glu Pro Asp420 425 430Ile Ser Arg Val Pro Ile Met Ile Asp Ser Ser Lys Trp Glu Val Ile435 440 445Glu Ala Gly Leu Lys Cys Ile Gln Gly Lys Gly Ile Val Asn Ser Ile450 455 460Ser Met Lys Glu Gly Val Glu Ala Phe Lys His His Ala Arg Leu Cys465 470 475 480Lys Arg Tyr Gly Ala Ala Val Val Val Met Ala Phe Asp Glu Asp Gly485 490 495Gln Ala Asp Thr Gln Ala Arg Lys Glu Glu Ile Cys Lys Arg Ser Tyr500 505 510Asp Ile Leu Val Asp Glu Val Gly Phe Pro Pro Glu Asp Ile Ile Phe515 520 525Asp Ala Asn Ile Phe Ala Ile Ala Thr Gly Ile Glu Glu His Asn Asn530 535 540Tyr Ala Val Asp Phe Ile Asn Ala Cys Ala Tyr Ile Arg Asp Asn Leu545 550 555 560Pro Tyr Ala Leu Ser Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe565 570 575Arg Gly Asn Asn Pro Val Arg Glu Ala Ile His Ser Val Phe Leu Tyr580 585 590Tyr Ala Ile Arg Asn Gly Leu Thr Met Gly Ile Val Asn Ala Gly Gln595 600 605Leu Glu Ile Tyr Asp Glu Ile Pro Lys Ala Leu Arg Asp Arg Val Glu610 615 620Asp Val Val Leu Asn Arg Thr Pro Glu Ala Thr Glu Ala Leu Leu Ala625 630 635 640Ile Ala Asp Asp Tyr Lys Gly Gly Gly Ala Val Lys Glu Ala Glu Asp645 650 655Glu Glu Trp Arg Ser Tyr Ser Val Glu Lys Arg Leu Glu His Ala Leu660 665 670Val Lys Gly Ile Thr Thr Trp Ile Val Glu Asp Thr Glu Glu Cys Arg675 680 685Gln Gln Cys Ala Arg Pro Ile Glu Val Ile Glu Gly Pro Leu Met Ser690 695 700Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala Gly Lys Met Phe Leu705 710 715 720Pro Gln Val Val Lys Ser Ala Arg Val Met Lys Gln Ala Val Ala His
725 730 735Leu Ile Pro Phe Ile Glu Ala Glu Lys Gly Asp Lys Pro Glu Ala Lys740 745 750Gly Lys Ile Leu Met Ala Thr Val Lys Gly Asp Val His Asp Ile Gly755 760 765Lys Asn Ile Val Gly Val Val Leu Gly Cys Asn Gly Tyr Asp Val Val770 775 780Asp Leu Gly Val Met Val Pro Ala Glu Lys Ile Leu Gln Thr Ala Ile785 790 795 800Ala Glu Lys Cys Asp Ile Ile Gly Leu Ser Gly Leu Ile Thr Pro Ser805 810 815Leu Asp Glu Met Val His Val Ala Lys Glu Met Gln Arg Gln Asn Phe820 825 830Gln Leu Pro Leu Met Ile Gly Gly Ala Thr Thr Ser Lys Ala His Thr835 840 845Ala Val Lys Ile Asp Pro Gln Tyr Ser Asn Asp Ala Val Val Tyr Val850 855 860Thr Asp Ala Ser Arg Ala Val Gly Val Ala Thr Ser Leu Leu Ser Lys865 870 875 880Glu Leu Lys Ala Asp Tyr Val Ala Arg Thr Arg Ala Asp Tyr Ala Val885 890 895Val Arg Glu Arg Thr Ala Asn Arg Ser Ala Arg Thr Glu Arg Leu Ser900 905 910Tyr Glu Gln Ala Ile Ala Asn Lys Pro Ala Phe Asp Trp Ala Gly Tyr915 920 925Gln Ala Pro Thr Pro Ser Phe Thr Gly Val Arg Val Leu Asp Glu Ile930 935 940Asp Leu Ala Val Leu Ala Glu Tyr Ile Asp Trp Thr Pro Phe Phe Ile945 950 955 960Ser Trp Asp Leu Ala Gly Lys Tyr Pro Arg Ile Leu Thr Asp Glu Val965 970 975Val Gly Glu Ala Ala Thr Ser Leu Phe Asn Asp Ala Gln Ala Met Leu980 985 990Lys Lys Leu Ile Asp Glu Lys Leu Ile Lys Ala Arg Ala Val Phe Gly99510001005Phe Trp Pro Ala Asn Gln Val Glu His Asp Asp Leu Glu Val Tyr Gly101010151020Ala Asp Gly Glu Thr Leu Ala Thr Leu His His Leu Arg Gln Gln Thr1025 103010351040Ile Lys Pro Asp Gly Lys Pro Asn Leu Ser Leu Ala Asp Phe Val Ala104510501055
Pro Lys Glu Ser Gly Val Arg Asp Tyr Ile Gly Gly Phe Ile Thr Thr106010651070Ala Gly Ile Gly Ala Glu Glu Val Ala Lys Ala Tyr Glu Ala Lys Gly107510801085Asp Asp Tyr Asn Ser Ile Met Val Lys Ala Leu Ala Asp Arg Leu Ala109010951100Glu Ala Cys Ala Glu Trp Leu His Glu Arg Val Arg Lys Glu Tyr Trp1105 111011151120Gly Tyr Ala Arg Asp Glu His Leu Asp Asn Glu Ala Leu Ile Lys Glu112511301135Gln Tyr Val Gly Ile Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Asp114011451150His Thr Glu Lys Gly Thr Leu Phe Glu Leu Leu Asp Pro Gln Gly Leu115511601165Ser Gly Val Ser Leu Thr Glu His Tyr Ala Met Phe Pro Ala Ala Ala117011751180Val Ser Gly Trp Tyr Phe Ala His Pro Gln Ala Gln Tyr Phe Ala Val1185 119011951200Gly Lys Ile Asp Lys Asp Gln Val Glu Arg Tyr Ser Gln Arg Lys Gly120512101215Gln Glu Ala Ser Val Ser Glu Arg Trp Leu Ala Pro Asn Leu Gly Tyr122012251230Asp Asp210292113714212DNA213歐洲亞硝化單胞菌(Nitrosomas europeae)220
221CDS222(1)..(3711)223RNE0173240029atg aca atg cat gaa cgt gct gat ttg ctg aaa cgg ttg ctt gcc gag 48Met Thr Met His Glu Arg Ala Asp Leu Leu Lys Arg Leu Leu Ala Glu1 5 10 15cgt atc ctg atg ctc gac ggt gcc atg ggt acg atg atc cag agc tac 96Arg Ile Leu Met Leu Asp Gly Ala Met Gly Thr Met Ile Gln Ser Tyr20 25 30aaa ctg acc gag tcg gat tat cgg ggg gaa cgt ttt gcc gat ttt ccg144Lys Leu Thr Glu Ser Asp Tyr Arg Gly Glu Arg Phe Ala Asp Phe Pro35 40 45cat gat ctc aaa ggc aac aat gat ctg ctc tgc ctg acc aga ccg gaa192His Asp Leu Lys Gly Asn Asn Asp Leu Leu Cys Leu Thr Arg Pro Glu
50 55 60gtc atc cgc tcc att cat cgt gct tac ctc gaa gcc ggg tcg gat atc240Val Ile Arg Ser Ile His Arg Ala Tyr Leu Glu Ala Gly Ser Asp Ile65 70 75 80atc gag acc aac acg ttc aac tcg aat gcg ccg tcg atg gcg gac tac288Ile Glu Thr Asn Thr Phe Asn Ser Asn Ala Pro Ser Met Ala Asp Tyr85 90 95cac atg cag gat ctg gtg tat gaa ctg aat gtg gcg ggt gcg cgc ctg336His Met Gln Asp Leu Val Tyr Glu Leu Asn Val Ala Gly Ala Arg Leu100 105 110gcg tgt gag gaa gcg cgg gca atg gaa acg cag caa cct gac cgg ccc384Ala Cys Glu Glu Ala Arg Ala Met Glu Thr Gln Gln Pro Asp Arg Pro115 120 125cgt ttc gtt gcc ggt gtg atc ggg cct acc acc aaa acg gct tca ctc432Arg Phe Val Ala Gly Val Ile Gly Pro Thr Thr Lys Thr Ala Ser Leu130 135 140tca ccg gat gtc aat gat cct gga ttc cgg gcc att acc ttc gat gat480Ser Pro Asp Val Asn Asp Pro Gly Phe Arg Ala Ile Thr Phe Asp Asp145 150 155 160ctg gtg gaa agc tat acc gag tcg gtg cgc ggg ctg atc gac gga ggc528Leu Val Glu Ser Tyr Thr Glu Ser Val Arg Gly Leu Ile Asp Gly Gly165 170 175gcg gat att ctg ctg gtc gaa acc att ttt gac acc ttg aat gcc aaa576Ala Asp Ile Leu Leu Val Glu Thr Ile Phe Asp Thr Leu Asn Ala Lys180 185 190gcc gca ttg ttt gcc atc gat cag tat ttc gaa acg cat gga tta cgt624Ala Ala Leu Phe Ala Ile Asp Gln Tyr Phe Glu Thr His Gly Leu Arg195 200 205ctg ccg gtg atg ata tcg gtc acg att acc gat gct tcg gga cgt aat672Leu Pro Val Met Ile Ser Val Thr Ile Thr Asp Ala Ser Gly Arg Asn210 215 220ctt tcc ggg cag aca ccg gaa gct ttc tgg aat tcg gta cgg cat gca720Leu Ser Gly Gln Thr Pro Glu Ala Phe Trp Asn Ser Val Arg His Ala225 230 235 240cgt ccg ctt tcg gtg gga atc aac tgc gcg ttg ggt gcg gag ttg atg768Arg Pro Leu Ser Val Gly Ile Asn Cys Ala Leu Gly Ala Glu Leu Met245 250 255cgc ccc tac gtg gaa gag ttg tcc aat gtg gct gag gtt ttc acc agc816Arg Pro Tyr Val Glu Glu Leu Ser Asn Val Ala Glu Val Phe Thr Ser260 265 270gcc cat ccc aat gcc ggc ttg cct aat ccc ttg gcg gaa acc ggt tat864Ala His Pro Asn Ala Gly Leu Pro Asn Pro Leu Ala Glu Thr Gly Tyr275 280 285gac gaa acg ccg gaa tat acc gcc cgt ctg atc aag gat ttt gcg caa912Asp Glu Thr Pro Glu Tyr Thr Ala Arg Leu Ile Lys Asp Phe Ala Gln290 295 300
tcc ggg ttc gtc aac att gtc ggc ggc tgc tgt ggc act aca ccg aaa 960Ser Gly Phe Val Asn Ile Val Gly Gly Cys Cys Gly Thr Thr Pro Lys305 310 315 320cat atc gcg gcc att gca gaa gcg gta cgg gac atc cct ccg cgc cca1008His Ile Ala Ala Ile Ala Glu Ala Val Arg Asp Ile Pro Pro Arg Pro325 330 335ctg ccc gat att cct aaa aaa ctg agg ctt tcc ggc ctc gag ccg ctc1056Leu Pro Asp Ile Pro Lys Lys Leu Arg Leu Ser Gly Leu Glu Pro Leu340 345 350aat atc gat gaa cat tcc ctg ttc gta aac gtg ggt gaa cgt acc aat1104Asn Ile Asp Glu His Ser Leu Phe Val Asn Val Gly Glu Arg Thr Asn355 360 365gtc acc ggc tcc aag gca ttt gcc cgg ctg att ctc aat ggc ggt tat1152Val Thr Gly Ser Lys Ala Phe Ala Arg Leu Ile Leu Asn Gly Gly Tyr370 375 380gct gaa ggg ctg gtg atc gcg cgc agc cag gtg gag aac ggc gca caa1200Ala Glu Gly Leu Val Ile Ala Arg Ser Gln Val Glu Asn Gly Ala Gln385 390 395 400atc atc gat atc aac atg gat gaa gcg atg ctg gat tca cag aag gcg1248Ile Ile Asp Ile Asn Met Asp Glu Ala Met Leu Asp Ser Gln Lys Ala405 410 415atg gtg acc ttt ctg aat ctg ctc gct gcc gaa ccg gat atc agc cgg1296Met Val Thr Phe Leu Asn Leu Leu Ala Ala Glu Pro Asp Ile Ser Arg420 425 430ctg ccg atc atg ctc gat tcc agc aaa tgg tcg gtg atc gaa gcc gga1344Leu Pro Ile Met Leu Asp Ser Ser Lys Trp Ser Val Ile Glu Ala Gly435 440 445ctg aaa tgt gtc cag ggt aag gcg gtc atc aat tcc atc agc ctc aag1392Leu Lys Cys Val Gln Gly Lys Ala Val Ile Asn Ser Ile Ser Leu Lys450 455 460gaa ggt gaa gcg gag ttt tta cat cat gcc agg ctg gcg cgt cgt tat1440Glu Gly Glu Ala Glu Phe Leu His His Ala Arg Leu Ala Arg Arg Tyr465 470 475 480ggg gcc gcg gtg att gtc atg gct ttc gac gaa acc ggg cag gcc gat1488Gly Ala Ala Val Ile Val Met Ala Phe Asp Glu Thr Gly Gln Ala Asp485 490 495acc ttg cag cgc aag gtg gaa atc tgc acg cgt tgt tac cat aca ctg1536Thr Leu Gln Arg Lys Val Glu Ile Cys Thr Arg Cys Tyr His Thr Leu500 505 510att gaa cag gcc gat ttc cca ccc gag gat atc att ttc gac ccc aat1584Ile Glu Gln Ala Asp Phe Pro Pro Glu Asp Ile Ile Phe Asp Pro Asn515 520 525att ttt gcc att gct acg ggt atc gaa gaa cac agt aac tat gca gtg1632Ile Phe Ala Ile Ala Thr Gly Ile Glu Glu His Ser Asn Tyr Ala Val530 535 540gat ttt atc gag gcg aca cac gtc atc cgg caa acg ctg cct tat gcc1680Asp Phe Ile Glu Ala Thr His Val Ile Arg Gln Thr Leu Pro Tyr Ala
545 550 555 560aaa gtc agc ggg ggt gtt tcc aat gtt tcc ttc tcg ttc cgg ggt aac1728Lys Val Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn565 570 575gaa ccg atc cgc gaa gcc att cat acc gca ttc ctg tat cac gcg gtc1776Glu Pro Ile Arg Glu Ala Ile His Thr Ala Phe Leu Tyr His Ala Val580 585 590aag gca ggc atg acc atg ggt atc gtc aac gca ggt cag ctt ggg gtt1824Lys Ala Gly Met Thr Met Gly Ile Val Asn Ala Gly Gln Leu Gly Val595 600 605tat tcc gac att ccg ccc gat ctg ctg gaa cat gtc gag gat gta ctg1872Tyr Ser Asp Ile Pro Pro Asp Leu Leu Glu His Val Glu Asp Val Leu610 615 620ctg aac cgg cgg cct gat gca acc gaa cgt ctg gtg gag ttt gcg gaa1920Leu Asn Arg Arg Pro Asp Ala Thr Glu Arg Leu Val Glu Phe Ala Glu625 630 635 640cat ttc aag gga cag aaa aag gag cag atc gaa gat ctg tcc tgg cgt1968His Phe Lys Gly Gln Lys Lys Glu Gln Ile Glu Asp Leu Ser Trp Arg645 650 655gat gaa ccg gtg cgg cag cgc ctg att cat gca ctg gtc agg ggt atc2016Asp Glu Pro Val Arg Gln Arg Leu Ile His Ala Leu Val Arg Gly Ile660 665 670agc acc tac atc gtc gag gat acc gag ctc gtc cgg cag gag atc gac2064Ser Thr Tyr Ile Val Glu Asp Thr Glu Leu Val Arg Gln Glu Ile Asp675 680 685agc cag gga ggc aag ccg atc gag gtg atc gaa ggc ccg ctc atg gac2112Ser Gln Gly Gly Lys Pro Ile Glu Val Ile Glu Gly Pro Leu Met Asp690 695 700ggc atg aat gta gtg ggg gat ctg ttt ggc gca ggc aag atg ttt ctg2160Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala Gly Lys Met Phe Leu705 710 715 720cca cag gtg gtc aag tcg gca cgg gtg atg aag cag gcg gtt gcc tat2208Pro Gln Val Val Lys Ser Ala Arg Val Met Lys Gln Ala Val Ala Tyr725 730 735ctg ttg ccg tac atc gag gca gag aaa aaa att tcc ggc gac agc aag2256Leu Leu Pro Tyr Ile Glu Ala Glu Lys Lys Ile Ser Gly Asp Ser Lys740 745 750ccc aag ggc aag gtg gtg atc gct acc gtc aaa ggg gat gtg cat gat2304Pro Lys Gly Lys Val Val Ile Ala Thr Val Lys Gly Asp Val His Asp755 760 765att ggc aag aat atc gtt tcc gtc gtg ttg cag tgt aat aac ttt gaa2352Ile Gly Lys Asn Ile Val Ser Val Val Leu Gln Cys Asn Asn Phe Glu770 775 780gtc atc aac atg ggg gtg atg gtc ccc agt gca cag att ctg gaa aca2400Val Ile Asn Met Gly Val Met Val Pro Ser Ala Gln Ile Leu Glu Thr785 790 795 800
gca cgc cgt gaa cag gtc gat atg atc ggt ctg tcc ggc ctg atc acc2448Ala Arg Arg Glu Gln Val Asp Met Ile Gly Leu Ser Gly Leu Ile Thr805 810 815cct tcg ctg gaa gaa atg gcg cat gtt gcc cgg gaa atg gag cgt gaa2496Pro Ser Leu Glu Glu Met Ala His Val Ala Arg Glu Met Glu Arg Glu820 825 830caa ttc acc gtt ccg ctg ctg atc ggt ggc gcc acc act tcg cgg atg2544Gln Phe Thr Val Pro Leu Leu Ile Gly Gty Ala Thr Thr Ser Arg Met835 840 845cat acg gca gtc aaa atc gca ccc cat tac ggt ggg gtg acc gta tgg2592His Thr Ala Val Lys Ile Ala Pro His Tyr Gly Gly Val Thr Val Trp850 855 860gtg ccg gat gcc agc cgg gca gtc ggg gtg tgc agc aat ctg atg tca2640Val Pro Asp Ala Ser Arg Ala Val Gly Val Cys Ser Asn Leu Met Ser865 870 875 880cag gat ctg cgt gat gac tat gtc cgg cag gtc aag gcc gag cag gag2688Gln Asp Leu Arg Asp Asp Tyr Val Arg Gln Val Lys Ala Glu Gln Glu885 890 895aag agc cgg gtg cag cac cgc aac aag aaa ggg cca tcc aag ctc ctc2736Lys Ser Arg Val Gln His Arg Asn Lys Lys Gly Pro Ser Lys Leu Leu900 905 910act ttc gag gaa gcc cgg gcc aac gca ctc aag acg gat tgg gct cgt2784Thr Phe Glu Glu Ala Arg Ala Asn Ala Leu Lys Thr Asp Trp Ala Arg915 920 925tat act cca cca gct ccg gat ttc ctg ggg ttg cgc acc ctc aac aac2832Tyr Thr Pro Pro Ala Pro Asp Phe Leu Gly Leu Arg Thr Leu Asn Asn930 935 940tat ccg ctg gaa aca ctg gtg ccg cac atc gac tgg aca cct ttc ttc2880Tyr Pro Leu Glu Thr Leu Val Pro His Ile Asp Trp Thr Pro Phe Phe945 950 955 960cag gca tgg gaa ctg cac ggg cgc tat cct gcc atc ctg cag gat gaa2928Gln Ala Trp Glu Leu His Gly Arg Tyr Pro Ala Ile Leu Gln Asp Glu965 970 975ctc gtc ggg gaa gca gcc agc aat ctg ttt cgc gat gcc cag aat atg2976Leu Val Gly Glu Ala Ala Ser Asn Leu Phe Arg Asp Ala Gln Asn Met980 985 990ctc aga aaa atc gtc gag caa aaa tgg ctc acc gcc aac gcc gtt atc3024Leu Arg Lys Ile Val Glu Gln Lys Trp Leu Thr Ala Asn Ala Val Ile99510001005ggc ctg ttc ccg gcc aat acc gtc aat gga gat gat atc gag att tat3072Gly Leu Phe Pro Ala Asn Thr Val Asn Gly Asp Asp Ile Glu Ile Tyr101010151020gct gac cgt agt cgc agt cag gtg atc atg acc tgg cac acc ttg cgg3120Ala Asp Arg Ser Arg Ser Gln Val Ile Met Thr Trp His Thr Leu Arg1025103010351040cag cag acg gcc aaa ccg gca ggg cgt ccc aat ctg gca ctg gct gat3168Gln Gln Thr Ala Lys Pro Ala Gly Arg Pro Asn Leu Ala Leu Ala Asp
104510501055ttc att gcg ccg cgt gaa acc gga ctg gac gat acc atc ggt ttg ttt3216Phe Ile Ala Pro Arg Glu Thr Gly Leu Asp Asp Thr Ile Gly Leu Phe106010651070gcc gtc agc gcc ggt ttc ggt atc gat gaa cgc ata cgc gct ttt gaa3264Ala Val Ser Ala Gly Phe Gly Ile Asp Glu Arg Ile Arg Ala Phe Glu107510801085gct gca aac gat gat tac agt gcc atc atc ctg aaa gca ctg gct gat3312Ala Ala Asn Asp Asp Tyr Ser Ala Ile Ile Leu Lys Ala Leu Ala Asp109010951100cgt ctg gct gaa gcg ttt gca gaa cac atg cat gca cgg gtg cgg cga3360Arg Leu Ala Glu Ala Phe Ala Glu His Met His Ala Arg Val Arg Arg1105111011151120gaa ttc tgg ggc tat gtg aaa gat gag agt ctg gac aat gaa cag ttg3408Glu Phe Trp Gly Tyr Val Lys Asp Glu Ser Leu Asp Asn Glu Gln Leu112511301135atc gac gag caa tac ctg gga atc cgt cca gca cca ggt tat cct gcc3456Ile Asp Glu Gln Tyr Leu Gly Ile Arg Pro Ala Pro Gly Tyr Pro Ala114011451150tgc cct gat cat acc gaa aag ggg cca ttg ttc gct ctg ctg gaa gcg3504Cys Pro Asp His Thr Glu Lys Gly Pro Leu Phe Ala Leu Leu Glu Ala115511601165gaa aaa cgc agc gga atc gtc ata acg gaa tca ttt gcc atg gtg ccg3552Glu Lys Arg Ser Gly Ile Val Ile Thr Glu Ser Phe Ala Met Val Pro117011751180act gca gca gta tcc ggc ttc tat ctc tct tac cct gaa tcc agc tat3600Thr Ala Ala Val Ser Gly Phe Tyr Leu Ser Tyr Pro Glu Ser Ser Tyr1185119011951200ttt gct gtt gga aaa atc gga aaa gat cag gtc gag gat tat gca aga3648Phe Ala Val Gly Lys Ile Gly Lys Asp Gln Val Glu Asp Tyr Ala Arg120512101215cgc aaa ggg tgg acg ctg gaa gaa gca gaa agg tgg ctt gcg cct gtc3696Arg Lys Gly Trp Thr Leu Glu Glu Ala Glu Arg Trp Leu Ala Pro Val122012251230ttg gcg tat gag cgt taa3714Leu Ala Tyr Glu Arg1235210302111237212PRT213歐洲亞硝化單胞菌40030Met Thr Met His Glu Arg Ala Asp Leu Leu Lys Arg Leu Leu Ala Glu1 5 10 15Arg Ile Leu Met Leu Asp Gly Ala Met Gly Thr Met Ile Gln Ser Tyr20 25 30
Lys Leu Thr Glu Ser Asp Tyr Arg Gly Glu Arg Phe Ala Asp Phe Pro35 40 45His Asp Leu Lys Gly Asn Asn Asp Leu Leu Cys Leu Thr Arg Pro Glu50 55 60Val Ile Arg Ser Ile His Arg Ala Tyr Leu Glu Ala Gly Ser Asp Ile65 70 75 80Ile Glu Thr Asn Thr Phe Asn Ser Asn Ala Pro Ser Met Ala Asp Tyr85 90 95His Met Gln Asp Leu Val Tyr Glu Leu Asn Val Ala Gly Ala Arg Leu100 105 110Ala Cys Glu Glu Ala Arg Ala Met Glu Thr Gln Gln Pro Asp Arg Pro115 120 125Arg Phe Val Ala Gly Val Ile Gly Pro Thr Thr Lys Thr Ala Ser Leu130 135 140Ser Pro Asp Val Asn Asp Pro Gly Phe Arg Ala Ile Thr Phe Asp Asp145 150 155 160Leu Val Glu Ser Tyr Thr Glu Ser Val Arg Gly Leu Ile Asp Gly Gly165 170 175Ala Asp Ile Leu Leu Val Glu Thr Ile Phe Asp Thr Leu Asn Ala Lys180 185 190Ala Ala Leu Phe Ala Ile Asp Gln Tyr Phe Glu Thr His Gly Leu Arg195 200 205Leu Pro Val Met Ile Ser Val Thr Ile Thr Asp Ala Ser Gly Arg Asn210 215 220Leu Ser Gly Gln Thr Pro Glu Ala Phe Trp Asn Ser Val Arg His Ala225 230 235 240Arg Pro Leu Ser Val Gly Ile Asn Cys Ala Leu Gly Ala Glu Leu Met245 250 255Arg Pro Tyr Val Glu Glu Leu Ser Asn Val Ala Glu Val Phe Thr Ser260 265 270Ala His Pro Asn Ala Gly Leu Pro Asn Pro Leu Ala Glu Thr Gly Tyr275 280 285Asp Glu Thr Pro Glu Tyr Thr Ala Arg Leu Ile Lys Asp Phe Ala Gln290 295 300Ser Gly Phe Val Asn Ile Val Gly Gly Cys Cys Gly Thr Thr Pro Lys305 310 315 320His Ile Ala Ala Ile Ala Glu Ala Val Arg Asp Ile Pro Pro Arg Pro325 330 335Leu Pro Asp Ile Pro Lys Lys Leu Arg Leu Ser Gly Leu Glu Pro Leu340 345 350Asn Ile Asp Glu His Ser Leu Phe Val Asn Val Gly Glu Arg Thr Asn
355 360 365Val Thr Gly Ser Lys Ala Phe Ala Arg Leu Ile Leu Asn Gly Gly Tyr370 375 380Ala Glu Gly Leu Val Ile Ala Arg Ser Gln Val Glu Asn Gly Ala Gln385 390 395 400Ile Ile Asp Ile Asn Met Asp Glu Ala Met Leu Asp Ser Gln Lys Ala405 410 415Met Val Thr Phe Leu Asn Leu Leu Ala Ala Glu Pro Asp Ile Ser Arg420 425 430Leu Pro Ile Met Leu Asp Ser Ser Lys Trp Ser Val Ile Glu Ala Gly435 440 445Leu Lys Cys Val Gln Gly Lys Ala Val Ile Asn Ser Ile Ser Leu Lys450 455 460Glu Gly Glu Ala Glu Phe Leu His His Ala Arg Leu Ala Arg Arg Tyr465 470 475 480Gly Ala Ala Val Ile Val Met Ala Phe Asp Glu Thr Gly Gln Ala Asp485 490 495Thr Leu Gln Arg Lys Val Glu Ile Cys Thr Arg Cys Tyr His Thr Leu500 505 510Ile Glu Gln Ala Asp Phe Pro Pro Glu Asp Ile Ile Phe Asp Pro Asn515 520 525Ile Phe Ala Ile Ala Thr Gly Ile Glu Glu His Ser Asn Tyr Ala Val530 535 540Asp Phe Ile Glu Ala Thr His Val Ile Arg Gln Thr Leu Pro Tyr Ala545 550 555 560Lys Val Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn565 570 575Glu Pro Ile Arg Glu Ala Ile His Thr Ala Phe Leu Tyr His Ala Val580 585 590Lys Ala Gly Met Thr Met Gly Ile Val Asn Ala Gly Gln Leu Gly Val595 600 605Tyr Ser Asp Ile Pro Pro Asp Leu Leu Glu His Val Glu Asp Val Leu610 615 620Leu Asn Arg Arg Pro Asp Ala Thr Glu Arg Leu Val Glu Phe Ala Glu625 630 635 640His Phe Lys Gly Gln Lys Lys Glu Gln Ile Glu Asp Leu Ser Trp Arg645 650 655Asp Glu Pro Val Arg Gln Arg Leu Ile His Ala Leu Val Arg Gly Ile660 665 670Ser Thr Tyr Ile Val Glu Asp Thr Glu Leu Val Arg Gln Glu Ile Asp675 680 685
Ser Gln Gly Gly Lys Pro Ile Glu Val Ile Glu Gly Pro Leu Met Asp690 695 700Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala Gly Lys Met Phe Leu705 710 715 720Pro Gln Val Val Lys Ser Ala Arg Val Met Lys Gln Ala Val Ala Tyr725 730 735Leu Leu Pro Tyr Ile Glu Ala Glu Lys Lys Ile Ser Gly Asp Ser Lys740 745 750Pro Lys Gly Lys Val Val Ile Ala Thr Val Lys Gly Asp Val His Asp755 760 765Ile Gly Lys Asn Ile Val Ser Val Val Leu Gln Cys Asn Asn Phe Glu770 775 780Val Ile Asn Met Gly Val Met Val Pro Ser Ala Gln Ile Leu Glu Thr785 790 795 800Ala Arg Arg Glu Gln Val Asp Met Ile Gly Leu Ser Gly Leu Ile Thr805 810 815Pro Ser Leu Glu Glu Met Ala His Val Ala Arg Glu Met Glu Arg Glu820 825 830Gln Phe Thr Val Pro Leu Leu Ile Gly Gly Ala Thr Thr Ser Arg Met835 840 845His Thr Ala Val Lys Ile Ala Pro His Tyr Gly Gly Val Thr Val Trp850 855 860Val Pro Asp Ala Ser Arg Ala Val Gly Val Cys Ser Asn Leu Met Ser865 870 875 880Gln Asp Leu Arg Asp Asp Tyr Val Arg Gln Val Lys Ala Glu Gln Glu885 890 895Lys Ser Arg Val Gln His Arg Asn Lys Lys Gly Pro Ser Lys Leu Leu900 905 910Thr Phe Glu Glu Ala Arg Ala Asn Ala Leu Lys Thr Asp Trp Ala Arg915 920 925Tyr Thr Pro Pro Ala Pro Asp Phe Leu Gly Leu Arg Thr Leu Asn Asn930 935 940Tyr Pro Leu Glu Thr Leu Val Pro His Ile Asp Trp Thr Pro Phe Phe945 950 955 960Gln Ala Trp Glu Leu His Gly Arg Tyr Pro Ala Ile Leu Gln Asp Glu965 970 975Leu Val Gly Glu Ala Ala Ser Asn Leu Phe Arg Asp Ala Gln Asn Met980 985 990Leu Arg Lys Ile Val Glu Gln Lys Trp Leu Thr Ala Asn Ala Val Ile99510001005Gly Leu Phe Pro Ala Asn Thr Val Asn Gly Asp Asp Ile Glu Ile Tyr101010151020
Ala Asp Arg Ser Arg Ser Gln Val Ile Met Thr Trp His Thr Leu Arg1025 103010351040Gln Gln Thr Ala Lys Pro Ala Gly Arg Pro Asn Leu Ala Leu Ala Asp104510501055Phe Ile Ala Pro Arg Glu Thr Gly Leu Asp Asp Thr Ile Gly Leu Phe106010651070Ala Val Ser Ala Gly Phe Gly Ile Asp Glu Arg Ile Arg Ala Phe Glu107510801085Ala Ala Asn Asp Asp Tyr Ser Ala Ile Ile Leu Lys Ala Leu Ala Asp109010951100Arg Leu Ala Glu Ala Phe Ala Glu His Met His Ala Arg Val Arg Arg1105 111011151120Glu Phe Trp Gly Tyr Val Lys Asp Glu Ser Leu Asp Asn Glu Gln Leu112511301135Ile Asp Glu Gln Tyr Leu Gly Ile Arg Pro Ala Pro Gly Tyr Pro Ala114011451150Cys Pro Asp His Thr Glu Lys Gly Pro Leu Phe Ala Leu Leu Glu Ala115511601165Glu Lys Arg Ser Gly Ile Val Ile Thr Glu Ser Phe Ala Met Val Pro117011751180Thr Ala Ala Val Ser Gly Phe Tyr Leu Ser Tyr Pro Glu Ser Ser Tyr1185 119011951200Phe Ala Val Gly Lys Ile Gly Lys Asp Gln Val Glu Asp Tyr Ala Arg120512101215Arg Lys Gly Trp Thr Leu Glu Glu Ala Glu Arg Trp Leu Ala Pro Val122012251230Leu Ala Tyr Glu Arg1235210312113774212DNA213百日咳博德特氏菌(Bordetella pertussis)220
221CDS222(1)..(3771)223RBP00104220
221unsure222205..205223所有的n表示任何核苷酸220
221unsure
222277..277223所有的n表示任何核苷酸40031gtg cct tat ccc cgt atc ccc ttc ccg ctg tcc gcc tac acg cat ggc48Val Pro Tyr Pro Arg Ile Pro Phe Pro Leu Ser Ala Tyr Thr His Gly1 5 10 15ggc gag ttc gtc cgc caa ctg gac aag cgc atc ctg atc ctg gat ggt96Gly Glu Phe Val Arg Gln Leu Asp Lys Arg Ile Leu Ile Leu Asp Gly20 25 30gcc atg ggc acg atg atc cag cgc tac aag ctg ggc gag gcc gat ttc144Ala Met Gly Thr Met Ile Gln Arg Tyr Lys Leu Gly Glu Ala Asp Phe35 40 45cgt ggc gag cgc ttc gcc gag cac cac aag gat ctc aag ggc gac aac192Arg Gly Glu Arg Phe Ala Glu His His Lys Asp Leu Lys Gly Asp Asn50 55 60gaa ctg ctg tcg ntg gtg cgc ccg gac gtg atc gcg gaa atc cac cgg240Glu Leu Leu Ser Xaa Val Arg Pro Asp Val Ile Ala Glu Ile His Arg65 70 75 80cag tac ctc gag gcc ggc gcc gac gtg atc gag acc nac acc ttc ggc288Gln Tyr Leu Glu Ala Gly Ala Asp Val Ile Glu Thr Xaa Thr Phe Gly85 90 95gcc acg tcg atc gcc cag ggc gat tac gac ctg ccg gag ctg gcc tac336Ala Thr Ser Ile Ala Gln Gly Asp Tyr Asp Leu Pro Glu Leu Ala Tyr100 105 110gag atg aac ctg gag tcg gcc cgc ctg gcg cgc gcc gcc tgc gac gcc384Glu Met Asn Leu Glu Ser Ala Arg Leu Ala Arg Ala Ala Cys Asp Ala115 120 125tac agc acg ccc gag cat ccg cgc ttc gtg gcc ggg gcg ctg ggg ccg432Tyr Ser Thr Pro Glu His Pro Arg Phe Val Ala Gly Ala Leu Gly Pro130 135 140cag ccc aag acc gcg tcc atc tcg ccc gac gtc aac gac ccg ggg gcg480Gln Pro Lys Thr Ala Ser Ile Ser Pro Asp Val Asn Asp Pro Gly Ala145 150 155 160cgc aac gtc acc ttc gac gag ctg cgc gcg gcc tat gtc gag cag ctc528Arg Asn Val Thr Phe Asp Glu Leu Arg Ala Ala Tyr Val Glu Gln Leu165 170 175aat ggc ctg ctc gac ggc ggc atc gac atc gtc ctg atc gaa acc atc576Asn Gly Leu Leu Asp Gly Gly Ile Asp Ile Val Leu Ile Glu Thr Ile180 185 190ttc gat acg ctc aac gcc aag gcg gcc atc ttc gcc gtc gag gaa gcg624Phe Asp Thr Leu Asn Ala Lys Ala Ala Ile Phe Ala Val Glu Glu Ala195 200 205ttc gag gcg cgc ggc gtg cgc ctg ccg gtg atg att tcg ggc acc gtg672Phe Glu Ala Arg Gly Val Arg Leu Pro Val Met Ile Ser Gly Thr Val210 215 220acc gat gcg tcg ggc cgc atc ctg tcc ggc cag acc gtc gag gcg ttc720Thr Asp Ala Ser Gly Arg Ile Leu Ser Gly Gln Thr Val Glu Ala Phe
225 230 235 240tgg aac tcg gtg cgc cat gcg cgg ccg gtc acc atc ggc ctg aac tgc768Trp Asn Ser Val Arg His Ala Arg Pro Val Thr Ile Gly Leu Asn Cys245 250 255gcg ctg ggc gcg gcg ctg atg cgt ccg tat gtg gcc gag ctg tcc aag816Ala Leu Gly Ala Ala Leu Met Arg Pro Tyr Val Ala Glu Leu Ser Lys260 265 270atc tgc gac acc tat gtg tgc gtc tat ccc aac gcc ggc ctg ccc aat864Ile Cys Asp Thr Tyr Val Cys Val Tyr Pro Asn Ala Gly Leu Pro Asn275 280 285ccc atg gcc gag acg ggc ttt gac gaa acg ccg gcc gat acc tcg gcc912Pro Met Ala Glu Thr Gly Phe Asp Glu Thr Pro Ala Asp Thr Ser Ala290 295 300ctg ctg gaa gag ttc gcc cag gcc ggg ctg gtc aac atg gcc ggc ggc960Leu Leu Glu Glu Phe Ala Gln Ala Gly Leu Val Asn Met Ala Gly Gly305 310 315 320tgt tgc ggc acc acg ccc gag cac atc cgc gcc atc gcc ggc aag gtg1008Cys Cys Gly Thr Thr Pro Glu His Ile Arg Ala Ile Ala Gly Lys Val325 330 335gcc gcg ctg acg ccg cgc gcg gtg ccc gag gtg ccg gtc aag acc cgc1056Ala Ala Leu Thr Pro Arg Ala Val Pro Glu Val Pro Val Lys Thr Arg340 345 350ctg tcg ggc ctg gag gcg ctc aac atc gac gac gag act ctg ttc gtc1104Leu Ser Gly Leu Glu Ala Leu Asn Ile Asp Asp Glu Thr Leu Phe Val355 360 365aac gtg ggc gag cgc acc aac gtg acg ggc agc aag atg ttc gcc cgc1152Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Lys Met Phe Ala Arg370 375 380ctg gtc cgc gag gag aaa tac gac gag gcg ctg gcc gtg gcg cgc cag1200Leu Val Arg Glu Glu Lys Tyr Asp Glu Ala Leu Ala Val Ala Arg Gln385 390 395 400cag gtc gag aac ggg gcc cag atc atc gac gtc aac atg gac gag gcg1248Gln Val Glu Asn Gly Ala Gln Ile Ile Asp Val Asn Met Asp Glu Ala405 410 415atg ctg gac tcg gtg gcc tgt atg cac cgc ttc ctc aac ctg atc gcg1296Met Leu Asp Ser Val Ala Cys Met His Arg Phe Leu Asn Leu Ile Ala420 425 430tcc gag ccc gac atc gcg cgg gtg ccg gtg atg atc gac agt tcc aag1344Ser Glu Pro Asp Ile Ala Arg Val Pro Val Met Ile Asp Ser Ser Lys435 440 445tgg gaa gtg atc gag acc ggc ctg aag tgc gtg cag ggc aag gcc gtg1392Trp Glu Val Ile Glu Thr Gly Leu Lys Cys Val Gln Gly Lys Ala Val450 455 460gtc aac tcg atc tcc atg aag gaa ggc gag gag ccg ttc cgc cat cat1440Val Asn Ser Ile Ser Met Lys Glu Gly Glu Glu Pro Phe Arg His His465 470 475 480
gcg cgc ctg tgc cgc cgc tac ggc gcg gcc atg gtg gtc atg gcc ttc1488Ala Arg Leu Cys Arg Arg Tyr Gly Ala Ala Met Val Val Met Ala Phe485 490 495gac gaa cag ggg cag gcc gac tcg ctg gag cgc cgc aag gaa atc tgc1536Asp Glu Gln Gly Gln Ala Asp Ser Leu Glu Arg Arg Lys Glu Ile Cys500 505 510ggc cgc gcc tac cgt atc ctg gtc gag gaa gag ggc ttc ccg ccc gag1584Gly Arg Ala Tyr Arg Ile Leu Val Glu Glu Glu Gly Phe Pro Pro Glu515 520 525gac atc atc ttc gat ccc aac gtg ttc gcg gtg gcc acc ggc atc gac1632Asp Ile Ile Phe Asp Pro Asn Val Phe Ala Val Ala Thr Gly Ile Asp530 535 540gaa cac aat cac tac gcc gtc gat ttc atc gaa ggc gcg cgc tgg atc1680Glu His Asn His Tyr Ala Val Asp Phe Ile Glu Gly Ala Arg Trp Ile545 550 555 560cgc gcg aac ctg ccg cat gcc cgc att tcg ggc ggc atc tcg aac gtc1728Arg Ala Asn Leu Pro His Ala Arg Ile Ser Gly Gly Ile Ser Asn Val565 570 575agc ttc tcg ttc cgc ggc aac gag ccg atg cgc gag gcg atc cat acc1776Ser Phe Ser Phe Arg Gly Asn Glu Pro Met Arg Glu Ala Ile His Thr580 585 590gtc ttc ctg tac tac gcc atc gag gcc ggc ctg acg atg ggc atc gtc1824Val Phe Leu Tyr Tyr Ala Ile Glu Ala Gly Leu Thr Met Gly Ile Val595 600 605aac gcg ggc cag ctg ggc gta tat gcc gac ctg gcg ccg cac ctg cgc1872Asn Ala Gly Gln Leu Gly Val Tyr Ala Asp Leu Ala Pro His Leu Arg610 615 620gac ctg gtc gag gac gtc atc ctg gac cgc ccc gag ccg gtg ggc cgc1920Asp Leu Val Glu Asp Val Ile Leu Asp Arg Pro Glu Pro Val Gly Arg625 630 635 640agc gac tcg gcc gac gag cgc tcg ccc acc gaa cgg ctg gtg cag ttt1968Ser Asp Ser Ala Asp Glu Arg Ser Pro Thr Glu Arg Leu Val Gln Phe645 650 655gcc gag acc gtc aag ggc tcg ggc gcg aag aag gaa gaa gac ctg acc2016Ala Glu Thr Val Lys Gly Ser Gly Ala Lys Lys Glu Glu Asp Leu Thr660 665 670tgg cgc acc ggc tcg gtc gag cag cgc ctg gcg cat gcc ctg gtg cac2064Trp Arg Thr Gly Ser Val Glu Gln Arg Leu Ala His Ala Leu Val His675 680 685ggc atc acc acc ttc atc gtc gag gac acc gag gaa gtg cgc cag cag2112Gly Ile Thr Thr Phe Ile Val Glu Asp Thr Glu Glu Val Arg Gln Gln690 695 700gtc gcc gcg cgc ggc ggg cgc acc atc gaa gtg atc gaa ggt ccg ctg2160Val Ala Ala Arg Gly Gly Arg Thr Ile Glu Val Ile Glu Gly Pro Leu705 710 715 720atg gac ggc atg aac gtg gtc ggc gac ctg ttc ggc gcg ggc aag atg2208Met Asp Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala Gly Lys Met
725 730 735ttc ctg ccg caa gtg gtg aag tcg gcg cgc gtg atg aag cag gcg gtg2256Phe Leu Pro Gln Val Val Lys Ser Ala Arg Val Met Lys Gln Ala Val740 745 750gcg cac ctg att ccc ttc atc gag gag gaa aag cgc cag atc gcg gcc2304Ala His Leu Ile Pro Phe Ile Glu Glu Glu Lys Arg Gln Ile Ala Ala755 760 765gcg ggc ggc gat gtg cgc gcc aag ggc aag atc gtg atc gcc acc gtc2352Ala Gly Gly Asp Val Arg Ala Lys Gly Lys Ile Val Ile Ala Thr Val770 775 780aag ggc gac gtg cac gac atc ggc aag aac atc gtg tcg gtg gtc ttg2400Lys Gly Asp Val His Asp Ile Gly Lys Asn Ile Val Ser Val Val Leu785 790 795 800cag tgc aat aac ttc gaa gtc gtg aac atg ggc gtg atg gtg ccg tgc2448Gln Cys Asn Asn Phe Glu Val Val Asn Met Gly Val Met Val Pro Cys805 810 815gcc cag atc ctg cag aag gcc aag gac gag aac gcc gac atg atc ggc2496Ala Gln Ile Leu Gln Lys Ala Lys Asp Glu Asn Ala Asp Met Ile Gly820 825 830ctg tcc ggc ctg atc acg ccc agc ctc gaa gag atg gcc tac gtg gct2544Leu Ser Gly Leu Ile Thr Pro Ser Leu Glu Glu Met Ala Tyr Val Ala835 840 845tca gaa atg cag cgc gac ccc tat ttc cgc gag cgc gcc atg ccg ctg2592Ser Glu Met Gln Arg Asp Pro Tyr Phe Arg Glu Arg Ala Met Pro Leu850 855 860atg ata ggc ggg gcg acc acc agc cgg gtc cat acg gcg gtc aag atc2640Met Ile Gly Gly Ala Thr Thr Ser Arg Val His Thr Ala Val Lys Ile865 870 875 880gcg ccc aac tac gac ggt ccg gtg atc tac gtg ccc gat gcc agc cgt2688Ala Pro Asn Tyr Asp Gly Pro Val Ile Tyr Val Pro Asp Ala Ser Arg885 890 895tcg gtc ggc gtg gcg acc agc ctc atg tcc gac cag gcc ccg gcc tat2736Ser Val Gly Val Ala Thr Ser Leu Met Ser Asp Gln Ala Pro Ala Tyr900 905 910ttg gcg gag ctg gcg cag gag tac gag gat gtg cgc cgc tgc cat gcc2784Leu Ala Glu Leu Ala Gln Glu Tyr Glu Asp Val Arg Arg Cys His Ala915 920 925aac cgc aag gcg gtg ccg ctg gtg tcg ctg gcc gag gcg cgc gcg gcg2832Asn Arg Lys Ala Val Pro Leu Val Ser Leu Ala Glu Ala Arg Ala Ala930 935 940cgc ccg cag atc gac tgg tcc ggc tac cag ccg ccg cgc ccc aag ttc2880Arg Pro Gln Ile Asp Trp Ser Gly Tyr Gln Pro Pro Arg Pro Lys Phe945 950 955 960ctg ggc cgg cgc gcc ttc aag agc tac gac ctg gcc gag atc gcg cgc2928Leu Gly Arg Arg Ala Phe Lys Ser Tyr Asp Leu Ala Glu Ile Ala Arg965 970 975
tat atc gac tgg ggg ccg ttc ttc cag acg tgg agc ctg ttc ggc ccg2976Tyr Ile Asp Trp Gly Pro Phe Phe Gln Thr Trp Ser Leu Phe Gly Pro980 985 990ttc ccc gcc atc ctg gac gac aag gtg gtg ggc gag cag gcg cgc aag3024Phe Pro Ala Ile Leu Asp Asp Lys Val Val Gly Glu Gln Ala Arg Lys99510001005gtc tac gag gaa ggc cag gcc atg ctc aag cgc atc atc gac ggg cgc3072Val Tyr Glu Glu Gly Gln Ala Met Leu Lys Arg Ile Ile Asp Gly Arg101010151020tgg ctg acc gcc agc ggc gtg gtc ggc ttc tat ccg gcc aac cgc gtc3120Trp Leu Thr Ala Ser Gly Val Val Gly Phe Tyr Pro Ala Asn Arg Val1025103010351040aat gac gaa gac atc gag gtc tac gcg gac gag acg cgc agc gag atg3168Asn Asp Glu Asp Ile Glu Val Tyr Ala Asp Glu Thr Arg Ser Glu Met104510501055ctg ttc acc tac cgc aac ctg cgc cag cag ggc gtc aag cgc gaa ggc3216Leu Phe Thr Tyr Arg Asn Leu Arg Gln Gln Gly Val Lys Arg Glu Gly106010651070gtc agc aac aag tgc ctg gcc gac tac atc gcg ccg cgc gac agc ggc3264Val Ser Asn Lys Cys Leu Ala Asp Tyr Ile Ala Pro Arg Asp Ser Gly107510801085ctg ctc gac tac atc ggc atg ttc gcc gtg acc gcg ggc ctg ggc atc3312Leu Leu Asp Tyr Ile Gly Met Phe Ala Val Thr Ala Gly Leu Gly Ile109010951100gag aag aaa gag gcc gag ttc cag gcg gcg ctg gac gac tac tcc agc3360Glu Lys Lys Glu Ala Glu Phe Gln Ala Ala Leu Asp Asp Tyr Ser Ser1105111011151120atc atg ctg aag tcg ctg gcc gac cgg ctg gcc gag gcg ttc gcc gaa3408Ile Met Leu Lys Ser Leu Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu112511301135tgc atg cac gcg cgc gtg cgc cgc gac ctg tgg ggc tac gcg gcg gac3456Cys Met His Ala Arg Val Arg Arg Asp Leu Trp Gly Tyr Ala Ala Asp114011451150gag gcg ctg tcc aac gat gag ctg atc gcc gag aag tac agc ggc atc3504Glu Ala Leu Ser Asn Asp Glu Leu Ile Ala Glu Lys Tyr Ser Gly Ile115511601165cgg ccg gcg ccc ggc tat ccg gcc tgc ccg gag cac gtg gtc aag acg3552Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Glu His Val Val Lys Thr117011751180gac ctg ttc cgc gtg ctg gac gcc gcc gac gtc gga atg gag ctg acc3600Asp Leu Phe Arg Val Leu Asp Ala Ala Asp Val Gly Met Glu Leu Thr1185119011951200gac agc tac gcc atg ttc ccg gcc tcc agc gtc tcg ggg ttc tat ttc3648Asp Ser Tyr Ala Met Phe Pro Ala Ser Ser Val Ser Gly Phe Tyr Phe120512101215agc cac ccc gag tcg cag tat ttc aac gtg ggc aac atc ggc gcc gac3696Ser His Pro Glu Ser Gln Tyr Phe Asn Val Gly Asn Ile Gly Ala Asp
122012251230cag ctg gcc gac tac gtg gcg cgc agc ggc cgc gcc gaa gag gac gtg3744Gln Leu Ala Asp Tyr Val Ala Arg Ser Gly Arg Ala Glu Glu Asp Val123512401245cgc cgc acc ctg gcg ccg aac ctg ggc tag3774Arg Arg Thr Leu Ala Pro Asn Leu Gly12501255210322111257212PRT213百日咳博德特氏菌220
221unsure22269..69223所有的Xaa表示任何胺基酸220
221unsure22293..93223所有的Xaa表示任何胺基酸40032Val Pro Tyr Pro Arg Ile Pro Phe Pro Leu Ser Ala Tyr Thr His Gly1 5 10 15Gly Glu Phe Val Arg Gln Leu Asp Lys Arg Ile Leu Ile Leu Asp Gly20 25 30Ala Met Gly Thr Met Ile Gln Arg Tyr Lys Leu Gly Glu Ala Asp Phe35 40 45Arg Gly Glu Arg Phe Ala Glu His His Lys Asp Leu Lys Gly Asp Asn50 55 60Glu Leu Leu Ser Xaa Val Arg Pro Asp Val Ile Ala Glu Ile His Arg65 70 75 80Gln Tyr Leu Glu Ala Gly Ala Asp Val Ile Glu Thr Xaa Thr Phe Gly85 90 95Ala Thr Ser Ile Ala Gln Gly Asp Tyr Asp Leu Pro Glu Leu Ala Tyr100 105 110Glu Met Asn Leu Glu Ser Ala Arg Leu Ala Arg Ala Ala Cys Asp Ala115 120 125Tyr Ser Thr Pro Glu His Pro Arg Phe Val Ala Gly Ala Leu Gly Pro130 135 140Gln Pro Lys Thr Ala Ser Ile Ser Pro Asp Val Asn Asp Pro Gly Ala145 150 155 160Arg Asn Val Thr Phe Asp Glu Leu Arg Ala Ala Tyr Val Glu Gln Leu165 170 175Asn Gly Leu Leu Asp Gly Gly Ile Asp Ile Val Leu Ile Glu Thr Ile
180 185 190Phe Asp Thr Leu Asn Ala Lys Ala Ala Ile Phe Ala Val Glu Glu Ala195 200 205Phe Glu Ala Arg Gly Val Arg Leu Pro Val Met Ile Ser Gly Thr Val210 215 220Thr Asp Ala Ser Gly Arg Ile Leu Ser Gly Gln Thr Val Glu Ala Phe225 230 235 240Trp Asn Ser Val Arg His Ala Arg Pro Val Thr Ile Gly Leu Asn Cys245 250 255Ala Leu Gly Ala Ala Leu Met Arg Pro Tyr Val Ala Glu Leu Ser Lys260 265 270Ile Cys Asp Thr Tyr Val Cys Val Tyr Pro Asn Ala Gly Leu Pro Asn275 280 285Pro Met Ala Glu Thr Gly Phe Asp Glu Thr Pro Ala Asp Thr Ser Ala290 295 300Leu Leu Glu Glu Phe Ala Gln Ala Gly Leu Val Asn Met Ala Gly Gly305 310 315 320Cys Cys Gly Thr Thr Pro Glu His Ile Arg Ala Ile Ala Gly Lys Val325 330 335Ala Ala Leu Thr Pro Arg Ala Val Pro Glu Val Pro Val Lys Thr Arg340 345 350Leu Ser Gly Leu Glu Ala Leu Asn Ile Asp Asp Glu Thr Leu Phe Val355 360 365Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Lys Met Phe Ala Arg370 375 380Leu Val Arg Glu Glu Lys Tyr Asp Glu Ala Leu Ala Val Ala Arg Gln385 390 395 400Gln Val Glu Asn Gly Ala Gln Ile Ile Asp Val Asn Met Asp Glu Ala405 410 415Met Leu Asp Ser Val Ala Cys Met His Arg Phe Leu Asn Leu Ile Ala420 425 430Ser Glu Pro Asp Ile Ala Arg Val Pro Val Met Ile Asp Ser Ser Lys435 440 445Trp Glu Val Ile Glu Thr Gly Leu Lys Cys Val Gln Gly Lys Ala Val450 455 460Val Asn Ser Ile Ser Met Lys Glu Gly Glu Glu Pro Phe Arg His His465 470 475 480Ala Arg Leu Cys Arg Arg Tyr Gly Ala Ala Met Val Val Met Ala Phe485 490 495Asp Glu Gln Gly Gln Ala Asp Ser Leu Glu Arg Arg Lys Glu Ile Cys500 505 510
Gly Arg Ala Tyr Arg Ile Leu Val Glu Glu Glu Gly Phe Pro Pro Glu515 520 525Asp Ile Ile Phe Asp Pro Asn Val Phe Ala Val Ala Thr Gly Ile Asp530 535 540Glu His Asn His Tyr Ala Val Asp Phe Ile Glu Gly Ala Arg Trp Ile545 550 555 560Arg Ala Asn Leu Pro His Ala Arg Ile Ser Gly Gly Ile Ser Asn Val565 570 575Ser Phe Ser Phe Arg Gly Asn Glu Pro Met Arg Glu Ala Ile His Thr580 585 590Val Phe Leu Tyr Tyr Ala Ile Glu Ala Gly Leu Thr Met Gly Ile Val595 600 605Asn Ala Gly Gln Leu Gly Val Tyr Ala Asp Leu Ala Pro His Leu Arg610 615 620Asp Leu Val Glu Asp Val Ile Leu Asp Arg Pro Glu Pro Val Gly Arg625 630 635 640Ser Asp Ser Ala Asp Glu Arg Ser Pro Thr Glu Arg Leu Val Gln Phe645 650 655Ala Glu Thr Val Lys Gly Ser Gly Ala Lys Lys Glu Glu Asp Leu Thr660 665 670Trp Arg Thr Gly Ser Val Glu Gln Arg Leu Ala His Ala Leu Val His675 680 685Gly Ile Thr Thr Phe Ile Val Glu Asp Thr Glu Glu Val Arg Gln Gln690 695 700Val Ala Ala Arg Gly Gly Arg Thr Ile Glu Val Ile Glu Gly Pro Leu705 710 715 720Met Asp Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala Gly Lys Met725 730 735Phe Leu Pro Gln Val Val Lys Ser Ala Arg Val Met Lys Gln Ala Val740 745 750Ala His Leu Ile Pro Phe Ile Glu Glu Glu Lys Arg Gln Ile Ala Ala755 760 765Ala Gly Gly Asp Val Arg Ala Lys Gly Lys Ile Val Ile Ala Thr Val770 775 780Lys Gly Asp Val His Asp Ile Gly Lys Asn Ile Val Ser Val Val Leu785 790 795 800Gln Cys Asn Asn Phe Glu Val Val Asn Met Gly Val Met Val Pro Cys805 810 815Ala Gln Ile Leu Gln Lys Ala Lys Asp Glu Asn Ala Asp Met Ile Gly820 825 830Leu Ser Gly Leu Ile Thr Pro Ser Leu Glu Glu Met Ala Tyr Val Ala835 840 845
Ser Glu Met Gln Arg Asp Pro Tyr Phe Arg Glu Arg Ala Met Pro Leu850 855 860Met Ile Gly Gly Ala Thr Thr Ser Arg Val His Thr Ala Val Lys Ile865 870 875 880Ala Pro Asn Tyr Asp Gly Pro Val Ile Tyr Val Pro Asp Ala Ser Arg885 890 895Ser Val Gly Val Ala Thr Ser Leu Met Ser Asp Gln Ala Pro Ala Tyr900 905 910Leu Ala Glu Leu Ala Gln Glu Tyr Glu Asp Val Arg Arg Cys His Ala915 920 925Asn Arg Lys Ala Val Pro Leu Val Ser Leu Ala Glu Ala Arg Ala Ala930 935 940Arg Pro Gln Ile Asp Trp Ser Gly Tyr Gln Pro Pro Arg Pro Lys Phe945 950 955 960Leu Gly Arg Arg Ala Phe Lys Ser Tyr Asp Leu Ala Glu Ile Ala Arg965 970 975Tyr Ile Asp Trp Gly Pro Phe Phe Gln Thr Trp Ser Leu Phe Gly Pro980 985 990Phe Pro Ala Ile Leu Asp Asp Lys Val Val Gly Glu Gln Ala Arg Lys99510001005Val Tyr Glu Glu Gly Gln Ala Met Leu Lys Arg Ile Ile Asp Gly Arg101010151020Trp Leu Thr Ala Ser Gly Val Val Gly Phe Tyr Pro Ala Asn Arg Val1025 1030 10351040Asn Asp Glu Asp Ile Glu Val Tyr Ala Asp Glu Thr Arg Ser Glu Met104510501055Leu Phe Thr Tyr Arg Asn Leu Arg Gln Gln Gly Val Lys Arg Glu Gly106010651070Val Ser Asn Lys Cys Leu Ala Asp Tyr Ile Ala Pro Arg Asp Ser Gly107510801085Leu Leu Asp Tyr Ile Gly Met Phe Ala Val Thr Ala Gly Leu Gly Ile109010951100Glu Lys Lys Glu Ala Glu Phe Gln Ala Ala Leu Asp Asp Tyr Ser Ser1105 111011151120Ile Met Leu Lys Ser Leu Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu112511301135Cys Met His Ala Arg Val Arg Arg Asp Leu Trp Gly Tyr Ala Ala Asp114011451150Glu Ala Leu Ser Asn Asp Glu Leu Ile Ala Glu Lys Tyr Ser Gly Ile115511601165Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Glu His Val Val Lys Thr
117011751180Asp Leu Phe Arg Val Leu Asp Ala Ala Asp Val Gly Met Glu Leu Thr1185 119011951200Asp Ser Tyr Ala Met Phe Pro Ala Ser Ser Val Ser Gly Phe Tyr Phe120512101215Ser His Pro Glu Ser Gln Tyr Phe Asn Val Gly Asn Ile Gly Ala Asp122012251230Gln Leu Ala Asp Tyr Val Ala Arg Ser Gly Arg Ala Glu Glu Asp Val123512401245Arg Arg Thr Leu Ala Pro Asn Leu Gly12501255210332113645212DNA213Chlorobium tepidum220
221CDS222(1)..(3642)223RCL0042040033gtg ctc gac ggg gcc atg ggc acc atg atc cag agg cat ggc ctc gac48Val Leu Asp Gly Ala Met Gly Thr Met Ile Gln Arg His Gly Leu Asp1 5 10 15gaa cag gac tac cgg ggc gag cgt ttc gct tcg cat gac cat ccg ctg96Glu Gln Asp Tyr Arg Gly Glu Arg Phe Ala Ser His Asp His Pro Leu20 25 30aag ggc aac aac gac ctt ctt gtc atc acc cgg ccc gac atc atc cgt144Lys Gly Asn Asn Asp Leu Leu Val Ile Thr Arg Pro Asp Ile Ile Arg35 40 45tcg atc cac tgc gac ttc ctc gac gcg ggt gcg gac atc atc gag acc192Ser Ile His Cys Asp Phe Leu Asp Ala Gly Ala Asp Ile Ile Glu Thr50 55 60tgc acc ttc aac gcc aac ccg atc tcg cag tcg gac tac cag ttg cag240Cys Thr Phe Asn Ala Asn Pro Ile Ser Gln Ser Asp Tyr Gln Leu Gln65 70 75 80gac ttg acc cgc gag ctg aac gtg gcg gcg gca aag ata gcc cgc tcg288Asp Leu Thr Arg Glu Leu Asn Val Ala Ala Ala Lys Ile Ala Arg Ser85 90 95gca gcg gac gag ttc acc gca aag act ccc gac aag ccg cgt ttc gtg336Ala Ala Asp Glu Phe Thr Ala Lys Thr Pro Asp Lys Pro Arg Phe Val100 105 110gcc ggt tcc atc gga ccg acc aac aag acg ctc tcg ctc tcg ccg gac384Ala Gly Ser Ile Gly Pro Thr Asn Lys Thr Leu Ser Leu Ser Pro Asp115 120 125gtg aac aac ccc ggc ttc cgc gcc gtc acc ttc cag gag atg gtc gat432
Val Asn Asn Pro Gly Phe Arg Ala Val Thr Phe Gln Glu Met Val Asp130 135 140aac tac act gcc cag ctc gaa ggc ttg cac gag ggc ggt gtc gat ctc480Asn Tyr Thr Ala Gln Leu Glu Gly Leu His Glu Gly Gly Val Asp Leu145 150 155 160ttg ctc gtc gag acg gtg ttc gac aca ctg aac tgc aag gcg gcg ctc528Leu Leu Val Glu Thr Val Phe Asp Thr Leu Asn Cys Lys Ala Ala Leu165 170 175tac gct atc gag gag tac gcg gtg aaa acc ggc tgg cag gtg ccc gtg576Tyr Ala Ile Glu Glu Tyr Ala Val Lys Thr Gly Trp Gln Val Pro Val180 185 190atg gtc tcc ggc acg gtg gtg gac gcg agc ggc cgc acc ctc tcc ggc624Met Val Ser Gly Thr Val Val Asp Ala Ser Gly Arg Thr Leu Ser Gly195 200 205caa acc acc gag gcg ttc tgg att tcg att tcg cac atg ccg agt ctg672Gln Thr Thr Glu Ala Phe Trp Ile Ser Ile Ser His Met Pro Ser Leu210 215 220ctc tcg gtc ggc ctg aac tgc gca ctc ggc tcc aag cag atg cgc ccc720Leu Ser Val Gly Leu Asn Cys Ala Leu Gly Ser Lys Gln Met Arg Pro225 230 235 240ttc atc gag gcg ctc tcg aac atc gcc gaa agc tac gtc agc gtc tat768Phe Ile Glu Ala Leu Ser Asn Ile Ala Glu Ser Tyr Val Ser Val Tyr245 250 255ccc aac gcg ggc ctg ccg aat gag ttc ggc gag tac gac gac tcc ccc816Pro Asn Ala Gly Leu Pro Asn Glu Phe Gly Glu Tyr Asp Asp Ser Pro260 265 270gag tac atg gcc gcg cag atc gcg ggc ttc gcc gaa tca ggc ttc gtg864Glu Tyr Met Ala Ala Gln Ile Ala Gly Phe Ala Glu Ser Gly Phe Val275 280 285aac atc gtc ggc ggc tgc tgc ggc acc acg ccg acg cac atc cgc gcc912Asn Ile Val Gly Gly Cys Cys Gly Thr Thr Pro Thr His Ile Arg Ala290 295 300att gcc gaa gcg gtc aag act ctc ccg ccg aga aag cgc ccc gcc aac960Ile Ala Glu Ala Val Lys Thr Leu Pro Pro Arg Lys Arg Pro Ala Asn305 310 315 320aag cac gtg ctg agg ctc tcc ggc ctc gaa ccg ctc gtg gtt gac gaa1008Lys His Val Leu Arg Leu Ser Gly Leu Glu Pro Leu Val Val Asp Glu325 330 335acc acc ggc ttc atc aac gtc ggc gag cgc acc aac gtc acc ggt tcg1056Thr Thr Gly Phe Ile Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser340 345 350cgc aag ttc gcc cgc ctc atc aag gag gcc aat tac gac gaa gcg ctc1104Arg Lys Phe Ala Arg Leu Ile Lys Glu Ala Asn Tyr Asp Glu Ala Leu355 360 365tcc att gcc cgc cag cag gtc gag aac ggc gcg cag gtg atc gac gtg1152Ser Ile Ala Arg Gln Gln Val Glu Asn Gly Ala Gln Val Ile Asp Val370 375 380
aac ctc gac gaa gga atg ctc gac tcc gaa aag gtg atc gtc gaa ttc1200Asn Leu Asp Glu Gly Met Leu Asp Ser Glu Lys Val Ile Val Glu Phe385 390 395 400ctg aac ctc atc gcc tcc gag cct gag atc gcc aag gtg ccg gtg atg1248Leu Asn Leu Ile Ala Ser Glu Pro Glu Ile Ala Lys Val Pro Val Met405 410 415atc gac tcg tcg aaa tgg tcg gtc atc gaa aac ggc ctg cgc tgc acc1296Ile Asp Ser Ser Lys Trp Ser Val Ile Glu Asn Gly Leu Arg Cys Thr420 425 430cag ggc aag agc atc gtc aac tcg atc agc ctc aag gag ggc gag gag1344Gln Gly Lys Ser Ile Val Asn Ser Ile Ser Leu Lys Glu Gly Glu Glu435 440 445ctg ttc aag gag cgc gct cgc aag atc atg caa tac ggc gcg gcg gcg1392Leu Phe Lys Glu Arg Ala Arg Lys Ile Met Gln Tyr Gly Ala Ala Ala450 455 460gtg gtc atg gcc ttc gac gag cag ggc cag gcc gac agc ctg cac cgc1440Val Val Met Ala Phe Asp Glu Gln Gly Gln Ala Asp Ser Leu His Arg465 470 475 480cgc atc gag att tgc agc cgc gcc tac aaa att ctc acc gaa gag gtg1488Arg Ile Glu Ile Cys Ser Arg Ala Tyr Lys Ile Leu Thr Glu Glu Val485 490 495ggc ttc ccg ccg gag gac atc atc ttt gac ccg aac gtg ctg acc gtg1536Gly Phe Pro Pro Glu Asp Ile Ile Phe Asp Pro Asn Val Leu Thr Val500 505 510gcc acc ggc atc gac gag cac aac aac tac gcg ctc gac ttc atc gaa1584Ala Thr Gly Ile Asp Glu His Asn Asn Tyr Ala Leu Asp Phe Ile Glu515 520 525agc gtg cgc tgg atc aag cag aac ctg ccg cac gcg aag gtc tcc ggc1632Ser Val Arg Trp Ile Lys Gln Asn Leu Pro His Ala Lys Val Ser Gly530 535 540ggc atc agc aac gtt tcg ttc tcc ttc cgc ggc aac gag ccg gtg cgc1680Gly Ile Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Glu Pro Val Arg545 550 555 560gag gcg atg cac acc gcg ttc ctc tac cac gcc atc cac gcc ggt ctc1728Glu Ala Met His Thr Ala Phe Leu Tyr His Ala Ile His Ala Gly Leu565 570 575gac atg ggc atc gtc aac gcc gcc cag ctt ggc atc tac gaa gag atc1776Asp Met Gly Ile Val Asn Ala Ala Gln Leu Gly Ile Tyr Glu Glu Ile580 585 590gac ccg gag ctt ctt gtc tat gtc gag gac gtg ctg ctg aac cgc cgc1824Asp Pro Glu Leu Leu Val Tyr Val Glu Asp Val Leu Leu Asn Arg Arg595 600 605gac gac gcc acc gag cgg ctc gtg gcg ttc gct gaa acg atc cgc gac1872Asp Asp Ala Thr Glu Arg Leu Val Ala Phe Ala Glu Thr Ile Arg Asp610 615 620ggc ggc gaa aag gcc gag gcc aag aac gcc gaa tgg cgc aac gcc ccg1920
Gly Gly Glu Lys Ala Glu Ala Lys Asn Ala Glu Trp Arg Asn Ala Pro625 630 635 640gtc gag gag cgg ctg aaa cac gcg ctc gtc aag ggc atc gtt gac tac1968Val Glu Glu Arg Leu Lys His Ala Leu Val Lys Gly Ile Val Asp Tyr645 650 655atc gac gag gac acc gaa gag gcc cgc cag ctc tac ccg agt ccg ctg2016Ile Asp Glu Asp Thr Glu Glu Ala Arg Gln Leu Tyr Pro Ser Pro Leu660 665 670gag gtg atc gag ggg ccg ctc atg aac ggc atg aac cac gtc ggc gac2064Glu Val Ile Glu Gly Pro Leu Met Asn Gly Met Asn His Val Gly Asp675 680 685ctc ttc gcc gaa ggc aag atg ttc ctg cca cag gtg gtc aaa agc gcc2112Leu Phe Ala Glu Gly Lys Met Phe Leu Pro Gln Val Val Lys Ser Ala690 695 700cgc gtc atg aag cgc tcg gta gct gcg ctg att ccc tat atc gag gag2160Arg Val Met Lys Arg Ser Val Ala Ala Leu Ile Pro Tyr Ile Glu Glu705 710 715 720gag aag tcg aaa aac tgc gac acg agc gcc aaa gcc aag gtg ctg ctc2208Glu Lys Ser Lys Asn Cys Asp Thr Ser Ala Lys Ala Lys Val Leu Leu725 730 735gcc acg gtg aag ggc gac gtg cac gac atc ggc aag aac atc gtg tcg2256Ala Thr Val Lys Gly Asp Val His Asp Ile Gly Lys Asn Ile Val Ser740 745 750gtg gtg ctt gcc tgc aac aac ttc gac gtg atc gac atc ggc gtc atg2304Val Val Leu Ala Cys Asn Asn Phe Asp Val Ile Asp Ile Gly Val Met755 760 765atg cca tgc gac aag att ctc gaa gcg ctg gca gaa cac aag ccc gac2352Met Pro Cys Asp Lys Ile Leu Glu Ala Leu Ala Glu His Lys Pro Asp770 775 780gtg ctc ggc ctc tcc ggc ctc atc acc ccg tcg ctc gaa gag atg gcg2400Val Leu Gly Leu Ser Gly Leu Ile Thr Pro Ser Leu Glu Glu Met Ala785 790 795 800cac gtg gcc aaa gag atg gag cgg ctc ggc atg aac att ccg ctc atc2448His Val Ala Lys Glu Met Glu Arg Leu Gly Met Asn Ile Pro Leu Ile805 810 815atc ggc ggc gcg acc acc tcg aag gtg cac acg gcg gtg aaa ctc gcg2496Ile Gly Gly Ala Thr Thr Ser Lys Val His Thr Ala Val Lys Leu Ala820 825 830ccc tgc tac ccc agc ggc gcg gta gta cac gtg ctc gac gcc tcg cgc2544Pro Cys Tyr Pro Ser G1y Ala Val Val His Val Leu Asp Ala Ser Arg835 840 845agc gtg ccg gtg gtc agc aac ctc tgc aac ccc gcc cag cgc gac agc2592Ser Val Pro Val Val Ser Asn Leu Cys Asn Pro Ala Gln Arg Asp Ser850 855 860tat atc gcg gcg ctg aag gat gag cag gag gcg atg cgc aag agc cac2640Tyr Ile Ala Ala Leu Lys Asp Glu Gln Glu Ala Met Arg Lys Ser His865 870 875 880
gcc gag cgc atg gcg gca aaa aag tac gtc tcg ctc gac gcc gcc cgc2688Ala Glu Arg Met Ala Ala Lys Lys Tyr Val Ser Leu Asp Ala Ala Arg885 890 895gac aac cgc ctc acc att gac tgg gag gcc gaa acc atc gac aag ccc2736Asp Asn Arg Leu Thr Ile Asp Trp Glu Ala Glu Thr Ile Asp Lys Pro900 905 910gcc cag act ggc gtc acc gtg ctg gag gat gtc acc gtc ggc gcg ctc2784Ala Gln Thr Gly Val Thr Val Leu Glu Asp Val Thr Val Gly Ala Leu915 920 925cgc ccg tat atc gac tgg gca mcc ttc ttc tgg agc tgg gag ctg cac2832Arg Pro Tyr Ile Asp Trp Ala Xaa Phe Phe Trp Ser Trp Glu Leu His930 935 940ggc gtc tat ccg cag att ctg gag gat gaa aag gtc ggc gag gag gca2880Gly Val Tyr Pro Gln Ile Leu Glu Asp Glu Lys Val Gly Glu Glu Ala945 950 955 960acc aaa ctc ttc aac gac gcc acc gct ctg ctc gac cgg atc gac agc2928Thr Lys Leu Phe Asn Asp Ala Thr Ala Leu Leu Asp Arg Ile Asp Ser965 970 975gaa aag ctg ctc ggc atc aaa ggc gtg gcg ggc atc ttc ccg gcc aac2976Glu Lys Leu Leu Gly Ile Lys Gly Val Ala Gly Ile Phe Pro Ala Asn980 985 990agc atc ggc gac gac atc ttc gtc tat gcg gat gac gag cgc tcg ata3024Ser Ile Gly Asp Asp Ile Phe Val Tyr Ala Asp Asp Glu Arg Ser Ile99510001005atc cgc acc gtg ctg cac acc ctg cgc cag caa ggc gaa aag cac ggc3072Ile Arg Thr Val Leu His Thr Leu Arg Gln Gln Gly Glu Lys His Gly101010151020gaa gcg aac ctc gcg ctg gcg gac ttc gtg gcc ccg cgc gaa agc ggc3120Glu Ala Asn Leu Ala Leu Ala Asp Phe Val Ala Pro Arg Glu Ser Gly1025103010351040gtc aac gac tgg atc ggc tgc ttc acc gta acc gcc gga ctc ggc atc3168Val Asn Asp Trp Ile Gly Cys Phe Thr Val Thr Ala Gly Leu Gly Ile104510501055cag aat ttg ctc gac gag ttc aca gca gag aac gac gac tac cac cgc3216Gln Asn Leu Leu Asp Glu Phe Thr Ala Glu Asn Asp Asp Tyr His Arg106010651070atc atg aca cag gcg ctc gcc gac cga ctg gcc gaa gcg ttc gca gag3264Ile Met Thr Gln Ala Leu Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu107510801085atg ctg cac gaa aag gtg cgc cgc gaa ctc tgg ggc tac gcg ccc ggc3312Met Leu His Glu Lys Val Arg Arg Glu Leu Trp Gly Tyr Ala Pro Gly109010951100gaa atc ctc ggc aac gaa gag ctg atc gcc gaa aag tac cga ggc atc3360Glu Ile Leu Gly Asn Glu Glu Leu Ile Ala Glu Lys Tyr Arg Gly Ile1105111011151120cgc ccc gcc ccc ggc tac ccc gcc tgc ccg gat cac acc gaa aag gca3408
Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Asp His Thr Glu Lys Ala112511301135atc atc ttc gac ctg ctc aac gct gaa gcg gcc acc ggc gtc acg ctg3456Ile Ile Phe Asp Leu Leu Asn Ala Glu Ala Ala Thr Gly Val Thr Leu114011451150acg gaa act ttc gcg atg aac ccc gca gcc tca gtc tgc ggc ctc tac3504Thr Glu Thr Phe Ala Met Asn Pro Ala Ala Ser Val Cys Gly Leu Tyr115511601165ttc gcc aac ccg gcc tcg aaa tac ttc gta ctc ggc aag att ggt aag3552Phe Ala Asn Pro Ala Ser Lys Tyr Phe Val Leu Gly Lys Ile Gly Lys117011751180gat cag gtc gaa gac tac gcc aac cgc aaa ggg ctg gaa gta gca gaa3600Asp Gln Val Glu Asp Tyr Ala Asn Arg Lys Gly Leu Glu Val Ala Glu1185119011951200gcc gag aag tgg ctc gcg ccc tcg ctg aac tac gat cca gcg3642Ala Glu Lys Trp Leu Ala Pro Ser Leu Asn Tyr Asp Pro Ala12051210taa3645210342111214212PRT213Chlorobium tepidum220
221unsure222936..936223所有的Xaa表示任何胺基酸40034Val Leu Asp Gly Ala Met Gly Thr Met Ile Gln Arg His Gly Leu Asp1 5 10 15Glu Gln Asp Tyr Arg Gly Glu Arg Phe Ala Ser His Asp His Pro Leu20 25 30Lys Gly Asn Asn Asp Leu Leu Val Ile Thr Arg Pro Asp Ile Ile Arg35 40 45Ser Ile His Cys Asp Phe Leu Asp Ala Gly Ala Asp Ile Ile Glu Thr50 55 60Cys Thr Phe Asn Ala Asn Pro Ile Ser Gln Ser Asp Tyr Gln Leu Gln65 70 75 80Asp Leu Thr Arg Glu Leu Asn Val Ala Ala Ala Lys Ile Ala Arg Ser85 90 95Ala Ala Asp Glu Phe Thr Ala Lys Thr Pro Asp Lys Pro Arg Phe Val100 105 110Ala Gly Ser Ile Gly Pro Thr Asn Lys Thr Leu Ser Leu Ser Pro Asp115 120 125Val Asn Asn Pro Gly Phe Arg Ala Val Thr Phe Gln Glu Met Val Asp
130 135 140Asn Tyr Thr Ala Gln Leu Glu Gly Leu His Glu Gly Gly Val Asp Leu145 150 155 160Leu Leu Val Glu Thr Val Phe Asp Thr Leu Asn Cys Lys Ala Ala Leu165 170 175Tyr Ala Ile Glu Glu Tyr Ala Val Lys Thr Gly Trp Gln Val Pro Val180 185 190Met Val Ser Gly Thr Val Val Asp Ala Ser Gly Arg Thr Leu Ser Gly195 200 205Gln Thr Thr Glu Ala Phe Trp Ile Ser Ile Ser His Met Pro Ser Leu210 215 220Leu Ser Val Gly Leu Asn Cys Ala Leu Gly Ser Lys Gln Met Arg Pro225 230 235 240Phe Ile Glu Ala Leu Ser Asn Ile Ala Glu Ser Tyr Val Ser Val Tyr245 250 255Pro Asn Ala Gly Leu Pro Asn Glu Phe Gly Glu Tyr Asp Asp Ser Pro260 265 270Glu Tyr Met Ala Ala Gln Ile Ala Gly Phe Ala Glu Ser Gly Phe Val275 280 285Asn Ile Val Gly Gly Cys Cys Gly Thr Thr Pro Thr His Ile Arg Ala290 295 300Ile Ala Glu Ala Val Lys Thr Leu Pro Pro Arg Lys Arg Pro Ala Asn305 310 315 320Lys His Val Leu Arg Leu Ser Gly Leu Glu Pro Leu Val Val Asp Glu325 330 335Thr Thr Gly Phe Ile Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser340 345 350Arg Lys Phe Ala Arg Leu Ile Lys Glu Ala Asn Tyr Asp Glu Ala Leu355 360 365Ser Ile Ala Arg Gln Gln Val Glu Asn Gly Ala Gln Val Ile Asp Val370 375 380Asn Leu Asp Glu Gly Met Leu Asp Ser Glu Lys Val Ile Val Glu Phe385 390 395 400Leu Asn Leu Ile Ala Ser Glu Pro Glu Ile Ala Lys Val Pro Val Met405 410 415Ile Asp Ser Ser Lys Trp Ser Val Ile Glu Asn Gly Leu Arg Cys Thr420 425 430Gln Gly Lys Ser Ile Val Asn Ser Ile Ser Leu Lys Glu Gly Glu Glu435 440 445Leu Phe Lys Glu Arg Ala Arg Lys Ile Met Gln Tyr Gly Ala Ala Ala450 455 460
Val Val Met Ala Phe Asp Glu Gln Gly Gln Ala Asp Ser Leu His Arg465 470 475 480Arg Ile Glu Ile Cys Ser Arg Ala Tyr Lys Ile Leu Thr Glu Glu Val485 490 495Gly Phe Pro Pro Glu Asp Ile Ile Phe Asp Pro Asn Val Leu Thr Val500 505 510Ala Thr Gly Ile Asp Glu His Asn Asn Tyr Ala Leu Asp Phe Ile Glu515 520 525Ser Val Arg Trp Ile Lys Gln Asn Leu Pro His Ala Lys Val Ser Gly530 535 540Gly Ile Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Glu Pro Val Arg545 550 555 560Glu Ala Met His Thr Ala Phe Leu Tyr His Ala Ile His Ala Gly Leu565 570 575Asp Met Gly Ile Val Asn Ala Ala Gln Leu Gly Ile Tyr Glu Glu Ile580 585 590Asp Pro Glu Leu Leu Val Tyr Val Glu Asp Val Leu Leu Asn Arg Arg595 600 605Asp Asp Ala Thr Glu Arg Leu Val Ala Phe Ala Glu Thr Ile Arg Asp610 615 620Gly Gly Glu Lys Ala Glu Ala Lys Asn Ala Glu Trp Arg Asn Ala Pro625 630 635 640Val Glu Glu Arg Leu Lys His Ala Leu Val Lys Gly Ile Val Asp Tyr645 650 655Ile Asp Glu Asp Thr Glu Glu Ala Arg Gln Leu Tyr Pro Ser Pro Leu660 665 670Glu Val Ile Glu Gly Pro Leu Met Asn Gly Met Asn His Val Gly Asp675 680 685Leu Phe Ala Glu Gly Lys Met Phe Leu Pro Gln Val Val Lys Ser Ala690 695 700Arg Val Met Lys Arg Ser Val Ala Ala Leu Ile Pro Tyr Ile Glu Glu705 710 715 720Glu Lys Ser Lys Asn Cys Asp Thr Ser Ala Lys Ala Lys Val Leu Leu725 730 735Ala Thr Val Lys Gly Asp Val His Asp Ile Gly Lys Asn Ile Val Ser740 745 750Val Val Leu Ala Cys Asn Asn Phe Asp Val Ile Asp Ile Gly Val Met755 760 765Met Pro Cys Asp Lys Ile Leu Glu Ala Leu Ala Glu His Lys Pro Asp770 775 780Val Leu Gly Leu Ser Gly Leu Ile Thr Pro Ser Leu Glu Glu Met Ala785 790 795 800
His Val Ala Lys Glu Met Glu Arg Leu Gly Met Asn Ile Pro Leu Ile805 810 815Ile Gly Gly Ala Thr Thr Ser Lys Val His Thr Ala Val Lys Leu Ala820 825 830Pro Cys Tyr Pro Ser Gly Ala Val Val His Val Leu Asp Ala Ser Arg835 840 845Ser Val Pro Val Val Ser Asn Leu Cys Asn Pro Ala Gln Arg Asp Ser850 855 860Tyr Ile Ala Ala Leu Lys Asp Glu Gln Glu Ala Met Arg Lys Ser His865 870 875 880Ala Glu Arg Met Ala Ala Lys Lys Tyr Val Ser Leu Asp Ala Ala Arg885 890 895Asp Asn Arg Leu Thr Ile Asp Trp Glu Ala Glu Thr Ile Asp Lys Pro900 905 910Ala Gln Thr Gly Val Thr Val Leu Glu Asp Val Thr Val Gly Ala Leu915 920 925Arg Pro Tyr Ile Asp Trp Ala Xaa Phe Phe Trp Ser Trp Glu Leu His930 935 940Gly Val Tyr Pro Gln Ile Leu Glu Asp Glu Lys Val Gly Glu Glu Ala945 950 955 960Thr Lys Leu Phe Asn Asp Ala Thr Ala Leu Leu Asp Arg Ile Asp Ser965 970 975Glu Lys Leu Leu Gly Ile Lys Gly Val Ala Gly Ile Phe Pro Ala Asn980 985 990Ser Ile Gly Asp Asp Ile Phe Val Tyr Ala Asp Asp Glu Arg Ser Ile99510001005Ile Arg Thr Val Leu His Thr Leu Arg Gln Gln Gly Glu Lys His Gly101010151020Glu Ala Asn Leu Ala Leu Ala Asp Phe Val Ala Pro Arg Glu Ser Gly1025 103010351040Val Asn Asp Trp Ile Gly Cys Phe Thr Val Thr Ala Gly Leu Gly Ile104510501055Gln Asn Leu Leu Asp Glu Phe Thr Ala Glu Asn Asp Asp Tyr His Arg106010651070Ile Met Thr Gln Ala Leu Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu107510801085Met Leu His Glu Lys Val Arg Arg Glu Leu Trp Gly Tyr Ala Pro Gly109010951100Glu Ile Leu Gly Asn Glu Glu Leu Ile Ala Glu Lys Tyr Arg Gly Ile1105 111011151120Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Asp His Thr Glu Lys Ala
112511301135Ile Ile Phe Asp Leu Leu Asn Ala Glu Ala Ala Thr Gly Val Thr Leu114011451150Thr Glu Thr Phe Ala Met Asn Pro Ala Ala Ser Val Cys Gly Leu Tyr115511601165phe Ala Asn Pro Ala Ser Lys Tyr Phe Val Leu Gly Lys Ile Gly Lys117011751180Asp Gln Val Glu Asp Tyr Ala Asn Arg Lys Gly Leu Glu Val Ala Glu1185 119011951200Ala Glu Lys Trp Leu Ala Pro Ser Leu Asn Tyr Asp Pro Ala12051210210352113777212DNA213耐輻射奇異球菌(Deinococcus radiodurans)220
221CDS222(1)..(3774)223RDR0264540035atg agc cat cac cca gaa gcg tcg gct tcc gcc aat ccg tcc atc aac48Met Ser His His Pro Glu Ala Ser Ala Ser Ala Asn Pro Ser Ile Asn1 5 10 15cat caa ccg tcc acc atc acc gag gcc gcc cgc cag cgc atc ctg att96His Gln Pro Ser Thr Ile Thr Glu Ala Ala Arg Gln Arg Ile Leu Ile20 25 30ctc gac ggc gcc tgg ggt acg cag ctt cag cga gcc aac ctc acc gaa144Leu Asp Gly Ala Trp Gly Thr Gln Leu Gln Arg Ala Asn Leu Thr Glu35 40 45gcg gac ttc cgc tgg gac gaa gcc gac ccc acg cgg atg tac cgg ggc192Ala Asp Phe Arg Trp Asp Glu Ala Asp Pro Thr Arg Met Tyr Arg Gly50 55 60aac ttc gac ctg ctg caa ctg acc aag cct gac gtg att cgc gcc gtg240Asn Phe Asp Leu Leu Gln Leu Thr Lys Pro Asp Val Ile Arg Ala Val65 70 75 80cac cgc gcc tat ttc gag gcc gga gcg gac atc gcc agc acc aat acc288His Arg Ala Tyr Phe Glu Ala Gly Ala Asp Ile Ala Ser Thr Asn Thr85 90 95ttc aac tcc acg acc atc tcg cag gcg gat tac ggc acc gag gca ctg336Phe Asn Ser Thr Thr Ile Ser Gln Ala Asp Tyr Gly Thr Glu Ala Leu100 105 110gcc tac gcc atg aac cgc gag ggg gca agg ctg gcc cgc gaa gtc gcc384Ala Tyr Ala Met Asn Arg Glu Gly Ala Arg Leu Ala Arg Glu Val Ala115 120 125gac gag ttc gag gcg cgc gac ggc aaa aag cgc tgg gtg gcg ggg agt432
Asp Glu Phe Glu Ala Arg Asp Gly Lys Lys Arg Trp Val Ala Gly Ser130 135 140gtc ggt ccc acc aac cgc acc gcg acc ctt tct ccc gac gtg gag cgg480Val Gly Pro Thr Asn Arg Thr Ala Thr Leu Ser Pro Asp Val Glu Arg145 150 155 160ccc gag ttc cgc aac gtg acc tac gac gac ctc gtg gcg gcg tac tcg528Pro Glu Phe Arg Asn Val Thr Tyr Asp Asp Leu Val Ala Ala Tyr Ser165 170 175gag gcc atc acc ggg ttg atg gaa ggt ggc gcg gac ctg ctg ctc att576Glu Ala Ile Thr Gly Leu Met Glu Gly Gly Ala Asp Leu Leu Leu Ile180 185 190gaa acg gtg ttt gac acg ctg aac gcc aaa gcc gcg ctg ttt gcc gcg624Glu Thr Val Phe Asp Thr Leu Asn Ala Lys Ala Ala Leu Phe Ala Ala195 200 205cag gac gtg ttc gcg gcg cag ggg cgc gag ctg ccg gtc atg ctc tcg672Gln Asp Val Phe Ala Ala Gln Gly Arg Glu Leu Pro Val Met Leu Ser210 215 220ggc acc atc acc gac gcc tcg ggc cgc acg ctg agc ggg cag acg ccc720Gly Thr Ile Thr Asp Ala Ser Gly Arg Thr Leu Ser Gly Gln Thr Pro225 230 235 240gaa gcc ttc gcg gtg agc acc gag cac gcc ggc ctc ttt tcg ctg ggc768Glu Ala Phe Ala Val Ser Thr Glu His Ala Gly Leu Phe Ser Leu Gly245 250 255ctg aac tgc gcg ctg ggc gcc gac ctg ctg cgg ccc cac ctg cgc gca816Leu Asn Cys Ala Leu Gly Ala Asp Leu Leu Arg Pro His Leu Arg Ala260 265 270att gcg gcg aac acg gag gcg ctg gtg tcg gtt cac ccc aac gcg ggc864Ile Ala Ala Asn Thr Glu Ala Leu Val Ser Val His Pro Asn Ala Gly275 280 285ctc ccc aac gcc ttc ggg gaa tac gac gaa acg ccc gaa cac acg gcg912Leu Pro Asn Ala Phe Gly Glu Tyr Asp Glu Thr Pro Glu His Thr Ala290 295 300gcg gtg ctg gcc gac ttc gcc cgc gag ggg ctg gtc aac atc gtg ggc960Ala Val Leu Ala Asp Phe Ala Arg Glu Gly Leu Val Asn Ile Val Gly305 310 315 320ggc tgc tgc ggc acc aca ccc gag cac atc aaa gcg att gcg gag gcg1008Gly Cys Cys Gly Thr Thr Pro Glu His Ile Lys Ala Ile Ala Glu Ala325 330 335gtg aag gac att ccc ccg cgc cag gcg ctg caa ctg ccg cct tac ctg1056Val Lys Asp Ile Pro Pro Arg Gln Ala Leu Gln Leu Pro Pro Tyr Leu340 345 350cgc ctc agc ggc ctc gaa gcc ttc acc ctg acg ccg gaa acc aac ttc1104Arg Leu Ser Gly Leu Glu Ala Phe Thr Leu Thr Pro Glu Thr Asn Phe355 360 365gtc aac gtg ggc gag cgc acc aac gtg acc ggc agt ccc aag ttc agc1152Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Pro Lys Phe Ser370 375 380
aag gcg att ctg gcc ggc gac tac gac gca ggg ctc aag att gcc cgc1200Lys Ala Ile Leu Ala Gly Asp Tyr Asp Ala Gly Leu Lys Ile Ala Arg385 390 395 400cag cag gtg acg aac ggc gcg caa atc gtg gac atc aac ttc gac gag1248Gln Gln Val Thr Asn Gly Ala Gln Ile Val Asp Ile Asn Phe Asp Glu405 410 415ggg atg ctc gac ggc gaa gga gcg atg gtc aag ttc ctc aac ctg ctc1296Gly Met Leu Asp Gly Glu Gly Ala Met Val Lys Phe Leu Asn Leu Leu420 425 430gcc ggg gag ccg gac atc tcg cgc gtg ccc ctg atg ctc gac tcg tcc1344Ala Gly Glu Pro Asp Ile Ser Arg Val Pro Leu Met Leu Asp Ser Ser435 440 445aag tgg gag att ctg gaa gcg ggg ctg cgg cgg gtg cag ggc aag gca1392Lys Trp Glu Ile Leu Glu Ala Gly Leu Arg Arg Val Gln Gly Lys Ala450 455 460gtc gtc aac tcc atc tcg ctc aag gac ggc gag gcc agg ttt ctg gaa1440Val Val Asn Ser Ile Ser Leu Lys Asp Gly Glu Ala Arg Phe Leu Glu465 470 475 480cgc gcc cgg ctg ctg cgg cgc tac ggg gcg gcg gcg gtg gtc atg gcc1488Arg Ala Arg Leu Leu Arg Arg Tyr Gly Ala Ala Ala Val Val Met Ala485 490 495ttc gac gaa cag gga cag gcc gac aac ctc gcc cga cgc cgg gag att1536Phe Asp Glu Gln Gly Gln Ala Asp Asn Leu Ala Arg Arg Arg Glu Ile500 505 510ctg ggc cgc gcg tat agg ctg ctg acc gag cag gcg gac ttt ccg ccg1584Leu Gly Arg Ala Tyr Arg Leu Leu Thr Glu Gln Ala Asp Phe Pro Pro515 520 525cag gac atc att ttc gac ccc aac gtg ctg acc gtt gcc acc ggc atc1632Gln Asp Ile Ile Phe Asp Pro Asn Val Leu Thr Val Ala Thr Gly Ile530 535 540gag gaa cac gac cgc tac gcg ctg gac ttt atc gag gcg acg cgc tgg1680Glu Glu His Asp Arg Tyr Ala Leu Asp Phe Ile Glu Ala Thr Arg Trp545 550 555 560att aaa gaa aac ctg ccg gcg gcg aag gtg tcg ggc ggg att tcc aac1728Ile Lys Glu Asn Leu Pro Ala Ala Lys Val Ser Gly Gly Ile Ser Asn565 570 575gtc tcg ttc agc ttc cgg ggc aac aac cac gtg cgc gag gcg atg cac1776Val Ser Phe Ser Phe Arg Gly Asn Asn His Val Arg Glu Ala Met His580 585 590gcg gtg ttt ctg tac cac gcc atc cgc gcc ggg ctg gac atg ggc atc1824Ala Val Phe Leu Tyr His Ala Ile Arg Ala Gly Leu Asp Met Gly Ile595 600 605gtg aac gcg ggg atg ctg gcg gtg tac gag gac atc gag ccg gag ctg1872Val Asn Ala Gly Met Leu Ala Val Tyr Glu Asp Ile Glu Pro Glu Leu610 615 620cgc gag gcc gtc gag gac gtc att ctg gct cgc cgt ccg gac gcc acc1920
Arg Glu Ala Val Glu Asp Val Ile Leu Ala Arg Arg Pro Asp Ala Thr625 630 635 640gag cgt ttg ctg acg ctg gcc gac cgc tac aag gac atc aag cgc gaa1968Glu Arg Leu Leu Thr Leu Ala Asp Arg Tyr Lys Asp Ile Lys Arg Glu645 650 655agt gcc gcc cag agc gcc tgg cgc gac ctg ccg gtg cag gaa cgg ctg2016Ser Ala Ala Gln Ser Ala Trp Arg Asp Leu Pro Val Gln Glu Arg Leu660 665 670cgg cac gca ctg gtg cag ggc gtc gcc gac cac gtg gat gag gac gcc2064Arg His Ala Leu Val Gln Gly Val Ala Asp His Val Asp Glu Asp Ala675 680 685gag gcc gcc tat cag gaa ctc ggc agc ccg ctg gcc gtc atc gaa ggc2112Glu Ala Ala Tyr Gln Glu Leu Gly Ser Pro Leu Ala Val Ile Glu Gly690 695 700ccg ctg atg gac ggc atg aac gtg gtg ggc gac ctc ttc ggc gcg ggg2160Pro Leu Met Asp Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala Gly705 710 715 720aaa atg ttc ctg ccg cag gtg gtc aaa tcc gcc cgc gtg atg aaa aag2208Lys Met Phe Leu Pro Gln Val Val Lys Ser Ala Arg Val Met Lys Lys725 730 735gca gtg gcc tac ctc acg ccc tat ctg gaa gcg gag aag gcg gaa agc2256Ala Val Ala Tyr Leu Thr Pro Tyr Leu Glu Ala Glu Lys Ala Glu Ser740 745 750tcc agc aag ggc aag gta ctg ctg gcg acc gtc aag ggc gat gtg cac2304Ser Ser Lys Gly Lys Val Leu Leu Ala Thr Val Lys Gly Asp Val His755 760 765gac atc ggc aag aac atc gtg ggc gtg gtg ctc gcc tgc aac ggc tat2352Asp Ile Gly Lys Asn Ile Val Gly Val Val Leu Ala Cys Asn Gly Tyr770 775 780cag gtg acc gac ctc ggc gtg atg gtg ccg ggc gag aag att ctg gac2400Gln Val Thr Asp Leu Gly Val Met Val Pro Gly Glu Lys Ile Leu Asp785 790 795 800gaa gcc gag cgg ctc ggt gcc gac gtg atc ggt ctg agc ggg ctg att2448Glu Ala Glu Arg Leu Gly Ala Asp Val Ile Gly Leu Ser Gly Leu Ile805 810 815acg cct tcc tta gac gaa atg gtg aac gtg gcc cgc gag atg acg cgc2496Thr Pro Ser Leu Asp Glu Met Val Asn Val Ala Arg Glu Met Thr Arg820 825 830cgg ggc gtg aaa act cca ctg ctg atc ggc ggc gcg acg acc agc cgg2544Arg Gly Val Lys Thr Pro Leu Leu Ile Gly Gly Ala Thr Thr Ser Arg835 840 845gcg cac acg gcg gtc aag att gac ccg gcc tac gac ggg acg gta gtg2592Ala His Thr Ala Val Lys Ile Asp Pro Ala Tyr Asp Gly Thr Val Val850 855 860cac gtg ctg gac gcc agc cgc gcc gtg acc gtg acc aac gac ctg ctg2640His Val Leu Asp Ala Ser Arg Ala Val Thr Val Thr Asn Asp Leu Leu865 870 875 880
acc gac gag gcc gcc tac gct ggg cgc gtg cag ggc gag tat gac acc2688Thr Asp Glu Ala Ala Tyr Ala Gly Arg Val Gln Gly Glu Tyr Asp Thr885 890 895ttg cgc gag cgc cac ggc gag cgg cag gtg cgg ctg att gcg ctg gca2736Leu Arg Glu Arg His Gly Glu Arg Gln Val Arg Leu Ile Ala Leu Ala900 905 910gaa gcc cgc gcc cgc gcc ccg caa ctg agt gcc gcc gtg ccc ccc gcg2784Glu Ala Arg Ala Arg Ala Pro Gln Leu Ser Ala Ala Val Pro Pro Ala915 920 925ccg cac gat ctg ggc cgt cag gtg gtc gaa cag ccc att gcc gag ctg2832Pro His Asp Leu Gly Arg Gln Val Val Glu Gln Pro Ile Ala Glu Leu930 935 940ctg ccc ttc atc gac tgg acg ccc ttt ttc atc gcc tgg gag atg aag2880Leu Pro Phe Ile Asp Trp Thr Pro Phe Phe Ile Ala Trp Glu Met Lys945 950 955 960ggc atc tac ccg ggc atc ctg acc gac cct ctg cgt ggc gag gag gcc2928Gly Ile Tyr Pro Gly Ile Leu Thr Asp Pro Leu Arg Gly Glu Glu Ala965 970 975cgc aag ctg ttt gcc gac gcg cag gcg ctg ctg gag cag gtt atc gcc2976Arg Lys Leu Phe Ala Asp Ala Gln Ala Leu Leu Glu Gln Val Ile Ala980 985 990gac ggc tcg ctg cgg gcg cgc ggc gtc atc ggg ctg tgg ccc gcg cac3024Asp Gly Ser Leu Arg Ala Arg Gly Val Ile Gly Leu Trp Pro Ala His99510001005ggc gac gac atc gtg ctg gac gat gcg gcg atg ggg cgt ggc gag acg3072Gly Asp Asp Ile Val Leu Asp Asp Ala Ala Met Gly Arg Gly Glu Thr101010151020ctg gat ttc gag acg cac gaa ctc gcc gcc ggg cgc gag ccg ctg ccg3120Leu Asp Phe Glu Thr His Glu Leu Ala Ala Gly Arg Glu Pro Leu Pro1025 103010351040aac atg ccg cgc ctg cac acg ctg cgg cag cag cgc gac cag acc acg3168Asn Met Pro Arg Leu His Thr Leu Arg Gln Gln Arg Asp Gln Thr Thr104510501055ccg aac act gcg ctg gct gac ttt gtg gcg gaa gga ggc gac cac atc3216Pro Asn Thr Ala Leu Ala Asp Phe Val Ala Glu Gly Gly Asp His Ile106010651070ggc gcc ttc gcc acg gcc atc ttc ggc gcc gag gag ttg gcg cag cag3264Gly Ala Phe Ala Thr Ala Ile Phe Gly Ala Glu Glu Leu Ala Gln Gln107510801085ttc gag gcg cag cac gac gac tac aac tcg att ctg gtc aag gcg gtg3312Phe Glu Ala Gln His Asp Asp Tyr Asn Ser Ile Leu Val Lys Ala Val109010951100gcc gac cga ctg gcc gag gcc ttt gcc gag aag ctg cac cgc gac gtg3360Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Lys Leu His Arg Asp Val110511101115 1120cgc gtg cgg cac tgg ggt tac gcc gag ggc gag gcg ctc gac aac acc3408
Arg Val Arg His Trp Gly Tyr Ala Glu Gly Glu Ala Leu Asp Asn Thr112511301135gac ctc atc aag gag cgc tat cag ggc atc cgc cct gcg ccc ggc tac3456Asp Leu Ile Lys Glu Arg Tyr Gln Gly Ile Arg Pro Ala Pro Gly Tyr114011451150ccc gcg cag ccc gac cac acc gag aaa cgc acc ctg ttt gag ctg ctg3504Pro Ala Gln Pro Asp His Thr Glu Lys Arg Thr Leu Phe Glu Leu Leu115511601165gac gcg gaa agc atc ggc ctg cgc ctc acc gag tcg tgt gcc atg acc3552Asp Ala Glu Ser Ile Gly Leu Arg Leu Thr Glu Ser Cys Ala Met Thr117011751180ccg gcg gcg gcg gtg tcg ggg ctg tac ttc gcg cat ccg gag gcc cgt3600Pro Ala Ala Ala Val Ser Gly Leu Tyr Phe Ala His Pro Glu Ala Arg1185119011951200tat ttc gca gtg ggc cgc atc ggg cgc gac cag gtg gag aac tac gcc3648Tyr Phe Ala Val Gly Arg Ile Gly Arg Asp Gln Val Glu Asn Tyr Ala120512101215gcc cgt aag ggt tgg act gtg cag gaa gcc gag cgc tgg ctg ggg ccg3696Ala Arg Lys Gly Trp Thr Val Gln Glu Ala Glu Arg Trp Leu Gly Pro122012251230ctg ctg gcg tac agc gcc ggg ccg ggg cca gaa gca agc cag aaa gcc3744Leu Leu Ala Tyr Ser Ala Gly Pro Gly Pro Glu Ala Ser Gln Lys Ala123512401245ctc ggc gca gag ctg aca gga gcg caa tcg tga3777Leu Gly Ala Glu Leu Thr Gly Ala Gln Ser12501255210362111258212PRT213耐輻射奇異球菌40036Met Ser His His Pro Glu Ala Ser Ala Ser Ala Asn Pro Ser Ile Asn1 5 10 15His Gln Pro Ser Thr Ile Thr Glu Ala Ala Arg Gln Arg Ile Leu Ile20 25 30Leu Asp Gly Ala Trp Gly Thr Gln Leu Gln Arg Ala Asn Leu Thr Glu35 40 45Ala Asp Phe Arg Trp Asp Glu Ala Asp Pro Thr Arg Met Tyr Arg Gly50 55 60Asn Phe Asp Leu Leu Gln Leu Thr Lys Pro Asp Val Ile Arg Ala Val65 70 75 80His Arg Ala Tyr Phe Glu Ala Gly Ala Asp Ile Ala Ser Thr Asn Thr85 90 95Phe Asn Ser Thr Thr Ile Ser Gln Ala Asp Tyr Gly Thr Glu Ala Leu100 105 110
Ala Tyr Ala Met Asn Arg Glu Gly Ala Arg Leu Ala Arg Glu Val Ala115 120 125Asp Glu Phe Glu Ala Arg Asp Gly Lys Lys Arg Trp Val Ala Gly Ser130 135 140Val Gly Pro Thr Asn Arg Thr Ala Thr Leu Ser Pro Asp Val Glu Arg145 150 155 160Pro Glu Phe Arg Asn Val Thr Tyr Asp Asp Leu Val Ala Ala Tyr Ser165 170 175Glu Ala Ile Thr Gly Leu Met Glu Gly Gly Ala Asp Leu Leu Leu Ile180 185 190Glu Thr Val Phe Asp Thr Leu Asn Ala Lys Ala Ala Leu Phe Ala Ala195 200 205Gln Asp Val Phe Ala Ala Gln Gly Arg Glu Leu Pro Val Met Leu Ser210 215 220Gly Thr Ile Thr Asp Ala Ser Gly Arg Thr Leu Ser Gly Gln Thr Pro225 230 235 240Glu Ala Phe Ala Val Ser Thr Glu His Ala Gly Leu Phe Ser Leu Gly245 250 255Leu Asn Cys Ala Leu Gly Ala Asp Leu Leu Arg Pro His Leu Arg Ala260 265 270Ile Ala Ala Asn Thr Glu Ala Leu Val Ser Val His Pro Asn Ala Gly275 280 285Leu Pro Asn Ala Phe Gly Glu Tyr Asp Glu Thr Pro Glu His Thr Ala290 295 300Ala Val Leu Ala Asp Phe Ala Arg Glu Gly Leu Val Asn Ile Val Gly305 310 315 320Gly Cys Cys Gly Thr Thr Pro Glu His Ile Lys Ala Ile Ala Glu Ala325 330 335Val Lys Asp Ile Pro Pro Arg Gln Ala Leu Gln Leu Pro Pro Tyr Leu340 345 350Arg Leu Ser Gly Leu Glu Ala Phe Thr Leu Thr Pro Glu Thr Asn Phe355 360 365Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Pro Lys Phe Ser370 375 380Lys Ala Ile Leu Ala Gly Asp Tyr Asp Ala Gly Leu Lys Ile Ala Arg385 390 395 400Gln Gln Val Thr Asn Gly Ala Gln Ile Val Asp Ile Asn Phe Asp Glu405 410 415Gly Met Leu Asp Gly Glu Gly Ala Met Val Lys Phe Leu Asn Leu Leu420 425 430Ala Gly Glu Pro Asp Ile Ser Arg Val Pro Leu Met Leu Asp Ser Ser
435 440 445Lys Trp Glu Ile Leu Glu Ala Gly Leu Arg Arg Val Gln Gly Lys Ala450 455 460Val Val Asn Ser Ile Ser Leu Lys Asp Gly Glu Ala Arg Phe Leu Glu465 470 475 480Arg Ala Arg Leu Leu Arg Arg Tyr Gly Ala Ala Ala Val Val Met Ala485 490 495Phe Asp Glu Gln Gly Gln Ala Asp Asn Leu Ala Arg Arg Arg Glu Ile500 505 510Leu Gly Arg Ala Tyr Arg Leu Leu Thr Glu Gln Ala Asp Phe Pro Pro515 520 525Gln Asp Ile Ile Phe Asp Pro Asn Val Leu Thr Val Ala Thr Gly Ile530 535 540Glu Glu His Asp Arg Tyr Ala Leu Asp Phe Ile Glu Ala Thr Arg Trp545 550 555 560Ile Lys Glu Asn Leu Pro Ala Ala Lys Val Ser Gly Gly Ile Ser Asn565 570 575Val Ser Phe Ser Phe Arg Gly Asn Asn His Val Arg Glu Ala Met His580 585 590Ala Val Phe Leu Tyr His Ala Ile Arg Ala Gly Leu Asp Met Gly Ile595 600 605Val Asn Ala Gly Met Leu Ala Val Tyr Glu Asp Ile Glu Pro Glu Leu610 615 620Arg Glu Ala Val Glu Asp Val Ile Leu Ala Arg Arg Pro Asp Ala Thr625 630 635 640Glu Arg Leu Leu Thr Leu Ala Asp Arg Tyr Lys Asp Ile Lys Arg Glu645 650 655Ser Ala Ala Gln Ser Ala Trp Arg Asp Leu Pro Val Gln Glu Arg Leu660 665 670Arg His Ala Leu Val Gln Gly Val Ala Asp His Val Asp Glu Asp Ala675 680 685Glu Ala Ala Tyr Gln Glu Leu Gly Ser Pro Leu Ala Val Ile Glu Gly690 695 700Pro Leu Met Asp Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala Gly705 710 715 720Lys Met Phe Leu Pro Gln Val Val Lys Ser Ala Arg Val Met Lys Lys725 730 735Ala Val Ala Tyr Leu Thr Pro Tyr Leu Glu Ala Glu Lys Ala Glu Ser740 745 750Ser Ser Lys Gly Lys Val Leu Leu Ala Thr Val Lys Gly Asp Val His755 760 765
Asp Ile Gly Lys Asn Ile Val Gly Val Val Leu Ala Cys Asn Gly Tyr770 775 780Gln Val Thr Asp Leu Gly Val Met Val Pro Gly Glu Lys Ile Leu Asp785 790 795 800Glu Ala Glu Arg Leu Gly Ala Asp Val Ile Gly Leu Ser Gly Leu Ile805 810 815Thr Pro Ser Leu Asp Glu Met Val Asn Val Ala Arg Glu Met Thr Arg820 825 830Arg Gly Val Lys Thr Pro Leu Leu Ile Gly Gly Ala Thr Thr Ser Arg835 840 845Ala His Thr Ala Val Lys Ile Asp Pro Ala Tyr Asp Gly Thr Val Val850 855 860His Val Leu Asp Ala Ser Arg Ala Val Thr Val Thr Asn Asp Leu Leu865 870 875 880Thr Asp Glu Ala Ala Tyr Ala Gly Arg Val Gln Gly Glu Tyr Asp Thr885 890 895Leu Arg Glu Arg His Gly Glu Arg Gln Val Arg Leu Ile Ala Leu Ala900 905 910Glu Ala Arg Ala Arg Ala Pro Gln Leu Ser Ala Ala Val Pro Pro Ala915 920 925Pro His Asp Leu Gly Arg Gln Val Val Glu Gln Pro Ile Ala Glu Leu930 935 940Leu Pro Phe Ile Asp Trp Thr Pro Phe Phe Ile Ala Trp Glu Met Lys945 950 955 960Gly Ile Tyr Pro Gly Ile Leu Thr Asp Pro Leu Arg Gly Glu Glu Ala965 970 975Arg Lys Leu Phe Ala Asp Ala Gln Ala Leu Leu Glu Gln Val Ile Ala980 985 990Asp Gly Ser Leu Arg Ala Arg Gly Val Ile Gly Leu Trp Pro Ala His995 1000 1005Gly Asp Asp Ile Val Leu Asp Asp Ala Ala Met Gly Arg Gly Glu Thr101010151020Leu Asp Phe Glu Thr His Glu Leu Ala Ala Gly Arg Glu Pro Leu Pro1025 103010351040Asn Met Pro Arg Leu His Thr Leu Arg Gln Gln Arg Asp Gln Thr Thr104510501055Pro Asn Thr Ala Leu Ala Asp Phe Val Ala Glu Gly Gly Asp His Ile106010651070Gly Ala Phe Ala Thr Ala Ile Phe Gly Ala Glu Glu Leu Ala Gln Gln107510801085Phe Glu Ala Gln His Asp Asp Tyr Asn Ser Ile Leu Val Lys Ala Val109010951100
Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Lys Leu His Arg Asp Val1105 111011151120Arg Val Arg His Trp Gly Tyr Ala Glu Gly Glu Ala Leu Asp Asn Thr112511301135Asp Leu Ile Lys Glu Arg Tyr Gln Gly Ile Arg Pro Ala Pro Gly Tyr114011451150Pro Ala Gln Pro Asp His Thr Glu Lys Arg Thr Leu Phe Glu Leu Leu115511601165Asp Ala Glu Ser Ile Gly Leu Arg Leu Thr Glu Ser Cys Ala Met Thr117011751180Pro Ala Ala Ala Val Ser Gly Leu Tyr Phe Ala His Pro Glu Ala Arg1185 119011951200Tyr Phe Ala Val Gly Arg Ile Gly Arg Asp Gln Val Glu Asn Tyr Ala120512101215Ala Arg Lys Gly Trp Thr Val Gln Glu Ala Glu Arg Trp Leu Gly Pro122012251230Leu Leu Ala Tyr Ser Ala Gly Pro Gly Pro Glu Ala Ser Gln Lys Ala123512401245Leu Gly Ala Glu Leu Thr Gly Ala Gln Ser12501255210372113642212DNA213丙酮丁醇梭菌(Clostridium acetobutylicum)220
221CDS222(1)..(3639)223RCA0126540037ctt atg aat tct tca cta aag aat ttg tta aat aac aaa att tta gtt48Leu Met Asn Ser Ser Leu Lys Asn Leu Leu Asn Asn Lys Ile Leu Val1 5 10 15tta gat ggt gct atg gga aca tgt att caa tcc ttt aat cta gat gaa96Leu Asp Gly Ala Met Gly Thr Cys Ile Gln Ser Phe Asn Leu Asp Glu20 25 30ggc gac ttt aaa ggt tcc tta tct tgt aca tgt cat tcc aat caa aaa144Gly Asp Phe Lys Gly Ser Leu Ser Cys Thr Cys His Ser Asn Gln Lys35 40 45gga aac aat gat gtt tta aat tta acc aag cca gaa ata ata aaa gaa192Gly Asn Asn Asp Val Leu Asn Leu Thr Lys Pro Glu Ile Ile Lys Glu50 55 60atc cac aag aga tac ctt gaa gct ggc gca gat ata ata gaa aca aac240Ile His Lys Arg Tyr Leu Glu Ala Gly Ala Asp Ile Ile Glu Thr Asn65 70 75 80
act ttt aac gct act gaa ata tca caa aaa gat tat aat atg caa gat288Thr Phe Asn Ala Thr Glu Ile Ser Gln Lys Asp Tyr Asn Met Gln Asp85 90 95aaa ata tat gat att aat ttt aag ggg gca aaa ctc gca aag gaa gct336Lys Ile Tyr Asp Ile Asn Phe Lys Gly Ala Lys Leu Ala Lys Glu Ala100 105 110tgt act tac tac aca aaa cta aat cct aat aag cct aga ttt gct gct384Cys Thr Tyr Tyr Thr Lys Leu Asn Pro Asn Lys Pro Arg Phe Ala Ala115 120 125ggt tct att ggg cct aca aat aga act gct tct cta tct cca gat gtt432Gly Ser Ile Gly Pro Thr Asn Arg Thr Ala Ser Leu Ser Pro Asp Val130 135 140gaa aat cct ggt ttt aga aat gta acc ttt gat gag cta tgt aat gcc480Glu Asn Pro Gly Phe Arg Asn Val Thr Phe Asp Glu Leu Cys Asn Ala145 150 155 160tat aaa cat caa ata gag gct cta ata gat gga ggt gta gac ctt ctt528Tyr Lys His Gln Ile Glu Ala Leu Ile Asp Gly Gly Val Asp Leu Leu165 170 175tta att gaa act ata ttt gat act tta aac gct aga gca gca atc ttt576Leu Ile Glu Thr Ile Phe Asp Thr Leu Asn Ala Arg Ala Ala Ile Phe180 185 190gca gca gaa aca gta ttt gaa aat aaa aaa ata aaa ctt cct att ata624Ala Ala Glu Thr Val Phe Glu Asn Lys Lys Ile Lys Leu Pro Ile Ile195 200 205att tca ggg aca ata gct gat aaa agt gga aga ata tta tcc ggt caa672Ile Ser Gly Thr Ile Ala Asp Lys Ser Gly Arg Ile Leu Ser Gly Gln210 215 220act ctt gac gct ttt gca gaa agt tta aaa aac gaa aat ata att gct720Thr Leu Asp Ala Phe Ala Glu Ser Leu Lys Asn Glu Asn Ile Ile Ala225 230 235 240ata ggg ctt aat tgt tcc ttt ggt gct gaa gaa ctt ata cct ttt ata768Ile Gly Leu Asn Cys Ser Phe Gly Ala Glu Glu Leu Ile Pro Phe Ile245 250 255aaa aga ctc tct gaa aca caa aat aga tat ata tcc ttt cat cca aac816Lys Arg Leu Ser Glu Thr Gln Asn Arg Tyr Ile Ser Phe His Pro Asn260 265 270gca gga ctt cca aac tcc ctt ggt gaa tat gaa gaa ctg cca gag gaa864Ala Gly Leu Pro Asn Ser Leu Gly Glu Tyr Glu Glu Leu Pro Glu Glu275 280 285act gct agc att gta aaa aaa tta gca ctt gaa gga cat tta aat ata912Thr Ala Ser Ile Val Lys Lys Leu Ala Leu Glu Gly His Leu Asn Ile290 295 300gtt gga ggc tgc tgt ggc act aca cca gaa cat ata aga gca ata agc960Val Gly Gly Cys Cys Gly Thr Thr Pro Glu His Ile Arg Ala Ile Ser305 310 315 320agc gta gtt aaa ggc att tct cca aga aaa gtt cca aac ttg gaa ccc1008
Ser Val Val Lys Gly Ile Ser Pro Arg Lys Val Pro Asn Leu Glu Pro325 330 335aaa aca att tac agc gga cta gaa aac ata aaa att gat aag aac agt1056Lys Thr Ile Tyr Ser Gly Leu Glu Asn Ile Lys Ile Asp Lys Asn Ser340 345 350aac ttc ata aat ata ggc gaa aga aca aat gta gcg ggc tca aga aaa1104Asn Phe Ile Asn Ile Gly Glu Arg Thr Asn Val Ala Gly Ser Arg Lys355 360 365ttc gca agg ctt ata cgt gaa aaa aat tat gag gag gct cta acc att1152Phe Ala Arg Leu Ile Arg Glu Lys Asn Tyr Glu Glu Ala Leu Thr Ile370 375 380gca aga cat cag gtt gaa aat ggt gcc caa att ata gat ata aat ttt1200Ala Arg His Gln Val Glu Asn Gly Ala Gln Ile Ile Asp Ile Asn Phe385 390 395 400gat gat gca ctt tta gat gct cgc tct gaa atg gaa aca ttt tta aga1248Asp Asp Ala Leu Leu Asp Ala Arg Ser Glu Met Glu Thr Phe Leu Arg405 410 415ctt att gca agt gaa cct gaa ata tca aaa gtt cca gtt atg ata gac1296Leu Ile Ala Ser Glu Pro Glu Ile Ser Lys Val Pro Val Met Ile Asp420 425 430tcc tct aat ttt gaa gtt tta aaa gtt gga tta aag tct att caa ggt1344Ser Ser Asn Phe Glu Val Leu Lys Val Gly Leu Lys Ser Ile Gln Gly435 440 445aaa gcc ata gta aat tcc ata agt ctt aag gtt gga gaa gaa aag ttc1392Lys Ala Ile Val Asn Ser Ile Ser Leu Lys Val Gly Glu Glu Lys Phe450 455 460att gaa gag gca aaa ttt ata aag aac ttt ggc gct ggc gta gtt gta1440Ile Glu Glu Ala Lys Phe Ile Lys Asn Phe Gly Ala Gly Val Val Val465 470 475 480atg gcc ttt gac gaa gaa ggt caa gca gct act tat gaa aga aaa att1488Met Ala Phe Asp Glu Glu Gly Gln Ala Ala Thr Tyr Glu Arg Lys Ile485 490 495gaa atc tgc aag aga gct tat act att ctc aca gaa aaa gtt gag ttt1536Glu Ile Cys Lys Arg Ala Tyr Thr Ile Leu Thr Glu Lys Val Glu Phe500 505 510cca cct gaa aat ata ata ttt gat cca aat ata cta tct ata gcg aca1584Pro Pro Glu Asn Ile Ile Phe Asp Pro Asn Ile Leu Ser Ile Ala Thr515 520 525gga att gaa gaa cat gac aac tat gca gtt aat tac ata aaa gct gtt1632Gly Ile Glu Glu His Asp Asn Tyr Ala Val Asn Tyr Ile Lys Ala Val530 535 540aaa tgg ata aaa gag aat cta cca tac gct aaa gtc agc ggt gga gtt1680Lys Trp Ile Lys Glu Asn Leu Pro Tyr Ala Lys Val Ser Gly Gly Val545 550 555 560agc aac ctc tcc ttt tct ttt agg ggt aat gac gca ata aga aga gct1728Ser Asn Leu Ser Phe Ser Phe Arg Gly Asn Asp Ala Ile Arg Arg Ala565 570 575
atg cat tct gtt ttc ctt tac cat gca ata aac gct gga atg gat atg1776Met His Ser Val Phe Leu Tyr His Ala Ile Asn Ala G1y Met Asp Met580 585 590ggt att gtt aat cca gca atg att gat tta tat gac gat ata gat aag1824Gly Ile Val Asn Pro Ala Met Ile Asp Leu Tyr Asp Asp Ile Asp Lys595 600 605gat ctt ctc gaa aag gtt gag aat gtt gta cta aat aaa tca tct aac1872Asp Leu Leu Glu Lys Val Glu Asn Val Val Leu Asn Lys Ser Ser Asn610 615 620gct tct gaa tca tta cta gaa ttt gct caa acg tat aaa aag acg act1920Ala Ser Glu Ser Leu Leu Glu Phe Ala Gln Thr Tyr Lys Lys Thr Thr625 630 635 640gaa acc tta gaa aag cac gag gat gaa tgg cga caa aaa agc cca agt1968Glu Thr Leu Glu Lys His Glu Asp Glu Trp Arg Gln Lys Ser Pro Ser645 650 655gaa agg ttg agt tat gct tta gtt aaa gga aat gtt gaa ttt att gaa2016Glu Arg Leu Ser Tyr Ala Leu Val Lys Gly Asn Val Glu Phe Ile Glu660 665 670gaa gat ata gaa gaa gca aga aaa gag tat aca aat gca ctt gaa att2064Glu Asp Ile Glu Glu Ala Arg Lys Glu Tyr Thr Asn Ala Leu Glu Ile675 680 685ata gag gtt cct tta atg aat gga atg aaa aaa gtg ggt aaa ctt ttt2112Ile Glu Val Pro Leu Met Asn Gly Met Lys Lys Val Gly Lys Leu Phe690 695 700gga gag gga aaa atg ttt ctt cct caa gta gta aaa agt gct aga gtt2160Gly Glu Gly Lys Met Phe Leu Pro Gln Val Val Lys Ser Ala Arg Val705 710 715 720atg aaa aag gct gtt gaa tgt ctt ctt ccc tat ata aac gaa gaa aag2208Met Lys Lys Ala Val Glu Cys Leu Leu Pro Tyr Ile Asn Glu Glu Lys725 730 735tct aaa aat cac aat aaa agt gct ggt aag gtt gta ttt gca act gtt2256Ser Lys Asn His Asn Lys Ser Ala Gly Lys Val Val Phe Ala Thr Val740 745 750aaa ggc gat gtt cat gac ata ggc aaa aat atc gta tct gta gtt ctt2304Lys Gly Asp Val His Asp Ile Gly Lys Asn Ile Val Ser Val Val Leu755 760 765tcc tgc aac aat ttt gaa gtt ata gat tta gga gta atg gtt ccc cct2352Ser Cys Asn Asn Phe Glu Val Ile Asp Leu Gly Val Met Val Pro Pro770 775 780gaa acc ata ctt gaa acg gca aaa cgt gaa aat gca gat atc att gct2400Glu Thr Ile Leu Glu Thr Ala Lys Arg Glu Asn Ala Asp Ile Ile Ala785 790 795 800tta agt ggt tta att aca cct tct ctt aat gaa atg gct tat gta gct2448Leu Ser Gly Leu Ile Thr Pro Ser Leu Asn Glu Met Ala Tyr Val Ala805 810 815gaa gaa atg aaa agg ctt aat ttt gat ata cca ctt atg gtg ggt ggt2496
Glu Glu Met Lys Arg Leu Asn Phe Asp Ile Pro Leu Met Val Gly Gly820 825 830gct gct acc tca aaa act cac aca gct tta aaa cta gct acg aaa tat2544Ala Ala Thr Ser Lys Thr His Thr Ala Leu Lys Leu Ala Thr Lys Tyr835 840 845aaa tat gta gta cac agt act gat gct tca gat gct gtt acc gta gcc2592Lys Tyr Val Val His Ser Thr Asp Ala Ser Asp Ala Val Thr Val Ala850 855 860aaa aat cta atg agt gaa aac aaa ttt act ttc tta gaa aaa tta aat2640Lys Asn Leu Met Ser Glu Asn Lys Phe Thr Phe Leu Glu Lys Leu Asn865 870 875 880gaa gag tat tct aaa ata aga gag acc ttc tct act aat aag att gaa2688Glu Glu Tyr Ser Lys Ile Arg Glu Thr Phe Ser Thr Asn Lys Ile Glu885 890 895ctt atc tcc att caa aac gca aga aaa aac aga ttt act att gac tgg2736Leu Ile Ser Ile Gln Asn Ala Arg Lys Asn Arg Phe Thr Ile Asp Trp900 905 910aat aaa act aaa ata act gaa cct aaa ttt gtc ggt ata aaa aaa tta2784Asn Lys Thr Lys Ile Thr Glu Pro Lys Phe Val Gly Ile Lys Lys Leu915 920 925caa gct gta cct ata aat gaa tta aga aag tat ata gat tgg act ttc2832Gln Ala Val Pro Ile Asn Glu Leu Arg Lys Tyr Ile Asp Trp Thr Phe930 935 940ttc ttt acg tct tgg gat atg gga atg aat tac ccc aaa ata atg aaa2880Phe Phe Thr Ser Trp Asp Met Gly Met Asn Tyr Pro Lys Ile Met Lys945 950 955 960gat cct aaa tac gga gct gaa gct caa aaa ctc ttt aag gat gcc aat2928Asp Pro Lys Tyr Gly Ala Glu Ala Gln Lys Leu Phe Lys Asp Ala Asn965 970 975gaa atg ctt gat tta ttg caa aaa gaa aat tta atc act tgt aat gga2976Glu Met Leu Asp Leu Leu Gln Lys Glu Asn Leu Ile Thr Cys Asn Gly980 985 990gtt ttt gga ata ttc cca gct aat tct gtt aat gat gat ata gaa atc3024Val Phe Gly Ile Phe Pro Ala Asn Ser Val Asn Asp Asp Ile Glu Ile99510001005tac act gat aaa gga act gta acc ata aat act ctt cgt cag cag cag3072Tyr Thr Asp Lys Gly Thr Val Thr Ile Asn Thr Leu Arg Gln Gln Gln101010151020ata ctt aaa gac agc gat tat aaa gct cta tct gat tat atc gct cca3120Ile Leu Lys Asp Ser Asp Tyr Lys Ala Leu Ser Asp Tyr Ile Ala Pro1025 103010351040aag ggt att ggc atc aaa gat tat ata ggt ggt ttt att gta act gct3168Lys Gly Ile Gly Ile Lys Asp Tyr Ile Gly Gly Phe Ile Val Thr Ala104510501055gga ata ggt gca aag gaa tat tcc gat aaa tta aag aaa aaa tgc gac3216Gly Ile Gly Ala Lys Glu Tyr Ser Asp Lys Leu Lys Lys Lys Cys Asp106010651070
gat tat gga gct act atg ctt aaa ctt ata tgc gat aga ctt gca gag3264Asp Tyr Gly Ala Thr Met Leu Lys Leu Ile Cys Asp Arg Leu Ala Glu107510801085gcc ttt tca gaa ctt ctt cac cta agg gta aga aaa gaa tac tgg gga3312Ala Phe Ser Glu Leu Leu His Leu Arg Val Arg Lys Glu Tyr Trp Gly109010951100tac tct caa gat gaa aac tta tcc tta gaa aaa ctt ctt aaa gga agt3360Tyr Ser Gln Asp Glu Asn Leu Ser Leu Glu Lys Leu Leu Lys Gly Ser1105111011151120tac aga ggg ata aaa cca gct att gga tat cct tct att ccc gat cac3408Tyr Arg Gly Ile Lys Pro Ala Ile Gly Tyr Pro Ser Ile Pro Asp His112511301135tct gaa aaa gca aag tta ttt gat tta ctt tta ggt aaa act tca ata3456Ser Glu Lys Ala Lys Leu Phe Asp Leu Leu Leu Gly Lys Thr Ser Ile114011451150gga gtg gaa ttg acg gaa agt tat atg atg aat cca act tca agt gta3504Gly Val Glu Leu Thr Glu Ser Tyr Met Met Asn Pro Thr Ser Ser Val115511601165tgc ggt ttg tat ttt gca aat gaa cga gca aaa tac ttt aat ata aat3552Cys Gly Leu Tyr Phe Ala Asn Glu Arg Ala Lys Tyr Phe Asn Ile Asn117011751180aaa ata gga aaa gat caa ctt gag gac tat gct gtt cga agt aat aaa3600Lys Ile Gly Lys Asp Gln Leu Glu Asp Tyr Ala Val Arg Ser Asn Lys1185119011951200gac att aat gaa ata aaa aaa tta tta gat act ctg tta taa3642Asp Ile Asn Glu Ile Lys Lys Leu Leu Asp Thr Leu Leu12051210210382111213212PRT213丙酮丁醇梭菌40038Leu Met Asn Ser Ser Leu Lys Asn Leu Leu Asn Asn Lys Ile Leu Val1 5 10 15Leu Asp Gly Ala Met Gly Thr Cys Ile Gln Ser Phe Asn Leu Asp Glu20 25 30Gly Asp Phe Lys Gly Ser Leu Ser Cys Thr Cys His Ser Asn Gln Lys35 40 45Gly Asn Asn Asp Val Leu Asn Leu Thr Lys Pro Glu Ile Ile Lys Glu50 55 60Ile His Lys Arg Tyr Leu Glu Ala Gly Ala Asp Ile Ile Glu Thr Asn65 70 75 80Thr Phe Asn Ala Thr Glu Ile Ser Gln Lys Asp Tyr Asn Met Gln Asp85 90 95
Lys Ile Tyr Asp Ile Asn Phe Lys Gly Ala Lys Leu Ala Lys Glu Ala100 105 110Cys Thr Tyr Tyr Thr Lys Leu Asn Pro Asn Lys Pro Arg Phe Ala Ala115 120 125Gly Ser Ile Gly Pro Thr Asn Arg Thr Ala Ser Leu Ser Pro Asp Val130 135 140Glu Asn Pro Gly Phe Arg Asn Val Thr Phe Asp Glu Leu Cys Asn Ala145 150 155 160Tyr Lys His Gln Ile Glu Ala Leu Ile Asp Gly Gly Val Asp Leu Leu165 170 175Leu Ile Glu Thr Ile Phe Asp Thr Leu Asn Ala Arg Ala Ala Ile Phe180 185 190Ala Ala Glu Thr Val Phe Glu Asn Lys Lys Ile Lys Leu Pro Ile Ile195 200 205Ile Ser Gly Thr Ile Ala Asp Lys Ser Gly Arg Ile Leu Ser Gly Gln210 215 220Thr Leu Asp Ala Phe Ala Glu Ser Leu Lys Asn Glu Asn Ile Ile Ala225 230 235 240Ile Gly Leu Asn Cys Ser Phe Gly Ala Glu Glu Leu Ile Pro Phe Ile245 250 255Lys Arg Leu Ser Glu Thr Gln Asn Arg Tyr Ile Ser Phe His Pro Asn260 265 270Ala Gly Leu Pro Asn Ser Leu Gly Glu Tyr Glu Glu Leu Pro Glu Glu275 280 285Thr Ala Ser Ile Val Lys Lys Leu Ala Leu Glu Gly His Leu Asn Ile290 295 300Val Gly Gly Cys Cys Gly Thr Thr Pro Glu His Ile Arg Ala Ile Ser305 310 315 320Ser Val Val Lys Gly Ile Ser Pro Arg Lys Val Pro Asn Leu Glu Pro325 330 335Lys Thr Ile Tyr Ser Gly Leu Glu Asn Ile Lys Ile Asp Lys Asn Ser340 345 350Asn Phe Ile Asn Ile Gly Glu Arg Thr Asn Val Ala Gly Ser Arg Lys355 360 365Phe Ala Arg Leu Ile Arg Glu Lys Asn Tyr Glu Glu Ala Leu Thr Ile370 375 380Ala Arg His Gln Val Glu Asn Gly Ala Gln Ile Ile Asp Ile Asn Phe385 390 395 400Asp Asp Ala Leu Leu Asp Ala Arg Ser Glu Met Glu Thr Phe Leu Arg405 410 415Leu Ile Ala Ser Glu Pro Glu Ile Ser Lys Val Pro Val Met Ile Asp420 425 430
Ser Ser Asn Phe Glu Val Leu Lys Val Gly Leu Lys Ser Ile Gln Gly435 440 445Lys Ala Ile Val Asn Ser Ile Ser Leu Lys Val Gly Glu Glu Lys Phe450 455 460Ile Glu Glu Ala Lys Phe Ile Lys Asn Phe Gly Ala Gly Val Val Val465 470 475 480Met Ala Phe Asp Glu Glu Gly Gln Ala Ala Thr Tyr Glu Arg Lys Ile485 490 495Glu Ile Cys Lys Arg Ala Tyr Thr Ile Leu Thr Glu Lys Val Glu Phe500 505 510Pro Pro Glu Asn Ile Ile Phe Asp Pro Asn Ile Leu Ser Ile Ala Thr515 520 525Gly Ile Glu Glu His Asp Asn Tyr Ala Val Asn Tyr Ile Lys Ala Val530 535 540Lys Trp Ile Lys Glu Asn Leu Pro Tyr Ala Lys Val Ser Gly Gly Val545 550 555 560Ser Asn Leu Ser Phe Ser Phe Arg Gly Asn Asp Ala Ile Arg Arg Ala565 570 575Met His Ser Val Phe Leu Tyr His Ala Ile Asn Ala Gly Met Asp Met580 585 590Gly Ile Val Asn Pro Ala Met Ile Asp Leu Tyr Asp Asp Ile Asp Lys595 600 605Asp Leu Leu Glu Lys Val Glu Asn Val Val Leu Asn Lys Ser Ser Asn610 615 620Ala Ser Glu Ser Leu Leu Glu Phe Ala Gln Thr Tyr Lys Lys Thr Thr625 630 635 640Glu Thr Leu Glu Lys His Glu Asp Glu Trp Arg Gln Lys Ser Pro Ser645 650 655Glu Arg Leu Ser Tyr Ala Leu Val Lys Gly Asn Val Glu Phe Ile Glu660 665 670Glu Asp Ile Glu Glu Ala Arg Lys Glu Tyr Thr Asn Ala Leu Glu Ile675 680 685Ile Glu Val Pro Leu Met Asn Gly Met Lys Lys Val Gly Lys Leu Phe690 695 700Gly Glu Gly Lys Met Phe Leu Pro Gln Val Val Lys Ser Ala Arg Val705 710 715 720Met Lys Lys Ala Val Glu Cys Leu Leu Pro Tyr Ile Asn Glu Glu Lys725 730 735Ser Lys Asn His Asn Lys Ser Ala Gly Lys Val Val Phe Ala Thr Val740 745 750Lys Gly Asp Val His Asp Ile Gly Lys Asn Ile Val Ser Val Val Leu
755 760 765Ser Cys Asn Asn Phe Glu Val Ile Asp Leu Gly Val Met Val Pro Pro770 775 780Glu Thr Ile Leu Glu Thr Ala Lys Arg Glu Asn Ala Asp Ile Ile Ala785 790 795 800Leu Ser Gly Leu Ile Thr Pro Ser Leu Asn Glu Met Ala Tyr Val Ala805 810 815Glu Glu Met Lys Arg Leu Asn Phe Asp Ile Pro Leu Met Val Gly Gly820 825 830Ala Ala Thr Ser Lys Thr His Thr Ala Leu Lys Leu Ala Thr Lys Tyr835 840 845Lys Tyr Val Val His Ser Thr Asp Ala Ser Asp Ala Val Thr Val Ala850 855 860Lys Asn Leu Met Ser Glu Asn Lys Phe Thr Phe Leu Glu Lys Leu Asn865 870 875 880Glu Glu Tyr Ser Lys Ile Arg Glu Thr Phe Ser Thr Asn Lys Ile Glu885 890 895Leu Ile Ser Ile Gln Asn Ala Arg Lys Asn Arg Phe Thr Ile Asp Trp900 905 910Asn Lys Thr Lys Ile Thr Glu Pro Lys Phe Val Gly Ile Lys Lys Leu915 920 925Gln Ala Val Pro Ile Asn Glu Leu Arg Lys Tyr Ile Asp Trp Thr Phe930 935 940Phe Phe Thr Ser Trp Asp Met Gly Met Asn Tyr Pro Lys Ile Met Lys945 950 955 960Asp Pro Lys Tyr Gly Ala Glu Ala Gln Lys Leu Phe Lys Asp Ala Asn965 970 975Glu Met Leu Asp Leu Leu Gln Lys Glu Asn Leu Ile Thr Cys Asn Gly980 985 990Val Phe Gly Ile Phe Pro Ala Asn Ser Val Asn Asp Asp Ile Glu Ile99510001005Tyr Thr Asp Lys Gly Thr Val Thr Ile Asn Thr Leu Arg Gln Gln Gln101010151020Ile Leu Lys Asp Ser Asp Tyr Lys Ala Leu Ser Asp Tyr Ile Ala Pro1025 103010351040Lys Gly Ile Gly Ile Lys Asp Tyr Ile Gly Gly Phe Ile Val Thr Ala104510501055Gly Ile Gly Ala Lys Glu Tyr Ser Asp Lys Leu Lys Lys Lys Cys Asp106010651070Asp Tyr Gly Ala Thr Met Leu Lys Leu Ile Cys Asp Arg Leu Ala Glu107510801085
Ala Phe Ser Glu Leu Leu His Leu Arg Val Arg Lys Glu Tyr Trp Gly109010951100Tyr Ser Gln Asp Glu Asn Leu Ser Leu Glu Lys Leu Leu Lys Gly Ser1105 111011151120Tyr Arg Gly Ile Lys Pro Ala Ile Gly Tyr Pro Ser Ile Pro Asp His112511301135Ser Glu Lys Ala Lys Leu Phe Asp Leu Leu Leu Gly Lys Thr Ser Ile114011451150Gly Val Glu Leu Thr Glu Ser Tyr Met Met Asn Pro Thr Ser Ser Val115511601165Cys Gly Leu Tyr Phe Ala Asn Glu Arg Ala Lys Tyr Phe Asn Ile Asn117011751180Lys Ile Gly Lys Asp Gln Leu Glu Asp Tyr Ala Val Arg Ser Asn Lys1185 119011951200Asp Ile Asn Glu Ile Lys Lys Leu Leu Asp Thr Leu Leu12051210210392113954212DNA213新月柄桿菌(caulobacter crescentus)220
221CDS222(1)..(3951)223RCO0227140039atg acc gat ctc tcc atc cgc gcc aac cgc gtc gcc gcc ctg aag gcc48Met Thr Asp Leu Ser Ile Arg Ala Asn Arg Val Ala Ala Leu Lys Ala1 5 10 15gcc gcc aag gag cgt att ctc att ctc gac ggc tcc tgg ggc gtg atg96Ala Ala Lys Glu Arg Ile Leu Ile Leu Asp Gly Ser Trp Gly Val Met20 25 30ttc cag aag aag ggg ctg acc gag gcc gac tac cgc gcc gag cgc ttc144Phe Gln Lys Lys Gly Leu Thr Glu Ala Asp Tyr Arg Ala Glu Arg Phe35 40 45gcc gcc tac aac ggc cag atg aag ggc aat aac gac atc ctg tgc ctg192Ala Ala Tyr Asn Gly Gln Met Lys Gly Asn Asn Asp Ile Leu Cys Leu50 55 60acg cgg ccc gat ctc gtg gcc gag ctg cac gac gcc tat ttc agc gcc240Thr Arg Pro Asp Leu Val Ala Glu Leu His Asp Ala Tyr Phe Ser Ala65 70 75 80ggc gcc gac atc tcc gag acc aac acc ttc tcg ggc acc acc atc gcc288Gly Ala Asp Ile Ser Glu Thr Asn Thr Phe Ser Gly Thr Thr Ile Ala85 90 95cag gcc gac tat cat ctg ggt gaa cag gat gtc tgg gac atc aac ctg336Gln Ala Asp Tyr His Leu Gly Glu Gln Asp Val Trp Asp Ile Asn Leu
100 105 110gaa ggc gcc aag atc ggc cgc tcg gtg gcc gac cgc tgg aac gcg cag384Glu Gly Ala Lys Ile Gly Arg Ser Val Ala Asp Arg Trp Asn Ala Gln115 120 125aat ccc gac cgc ccg aag ttc atc gcc ggc tcg atg ggg ccg ctg aac432Asn Pro Asp Arg Pro Lys Phe Ile Ala Gly Ser Met Gly Pro Leu Asn130 135 140gtc atg ctg tcg atg tcg tcg gac gtg aac gat ccg ggc gcg cgc aag480Val Met Leu Ser Met Ser Ser Asp Val Asn Asp Pro Gly Ala Arg Lys145 150 155 160gtg acc ttc gac cag gtc tac gag gcc tat cgc cag cag gtg gat gcg528Val Thr Phe Asp Gln Val Tyr Glu Ala Tyr Arg Gln Gln Val Asp Ala165 170 175ctt tac cag ggc ggg gtc gat ctc ttc ctg atc gag acc atc acc gac576Leu Tyr Gln Gly Gly Val Asp Leu Phe Leu Ile Glu Thr Ile Thr Asp180 185 190acc ctg aac tgc aag gcc gcg atc aag gcg atc ctg gac tgg cgc gac624Thr Leu Asn Cys Lys Ala Ala Ile Lys Ala Ile Leu Asp Trp Arg Asp195 200 205gag ggc cac gag gag ctg ccg atc tgg atc agc ggc acc atc acc gat672Glu Gly His Glu Glu Leu Pro Ile Trp Ile Ser Gly Thr Ile Thr Asp210 215 220cgc tcg ggc cgc acc ctg tcg ggc cag acg gcc gag gcg ttc tgg aac720Arg Ser Gly Arg Thr Leu Ser Gly Gln Thr Ala Glu Ala Phe Trp Asn225 230 235 240agc gtc aag cac gcc aag ccg ttc gca gtg ggc ttc aac tgc gcc ctg768Ser Val Lys His Ala Lys Pro Phe Ala Val Gly Phe Asn Cys Ala Leu245 250 255ggc gcg gat ttg atg cgt ccg cac atc gcc gag atg gcc cgt atc gcc816Gly Ala Asp Leu Met Arg Pro His Ile Ala Glu Met Ala Arg Ile Ala260 265 270gac acc ctg gtc gca gcc tat ccc aac gcc ggc ctg ccc aac gcc atg864Asp Thr Leu Val Ala Ala Tyr Pro Asn Ala Gly Leu Pro Asn Ala Met275 280 285ggc cag tac gac gag gag ccg cac gag acc ggc cac gcc ctg cac gag912Gly Gln Tyr Asp Glu Glu Pro His Glu Thr Gly His Ala Leu His Glu290 295 300tgg gcc aag gac ggc ctc gtc aac atc ctg ggc ggc tgc tgc ggc acg960Trp Ala Lys Asp Gly Leu Val Asn Ile Leu Gly Gly Cys Cys Gly Thr305 310 315 320aca ccg gac cac atc cgt cac gtc gcc gac gag gtg cgc ggc gtg acg1008Thr Pro Asp His Ile Arg His Val Ala Asp Glu Val Arg Gly Val Thr325 330 335ccg cgc cag atc ccc gag cgc ccc aag gcc atg cgc ctg gcg ggc ctc1056Pro Arg Gln Ile Pro Glu Arg Pro Lys Ala Met Arg Leu Ala Gly Leu340 345 350
gaa ccg ttc gag ttg gct tag tgg cta cgg ccg caa att ccc ttc tcc1104Glu Pro Phe Glu Leu Ala Xaa Trp Leu Arg Pro Gln Ile Pro Phe Ser355 360 365cct tgc ggg aga agg tgt cgc cga agg cga cgg atg agg ggt ctc gcc1152Pro Cys Gly Arg Arg Cys Arg Arg Arg Arg Arg Met Arg Gly Leu Ala370 375 380ggc cct tca acc gct gtc tcg cgg cgg cga cgt tct tca acc cct cat1200Gly Pro Ser Thr Ala Val Ser Arg Arg Arg Arg Ser Ser Thr Pro His385 390 395 400ccg acc cgc tgc gcg ggc cac ctt ctc ccg caa ggg gag aag gga tga1248Pro Thr Arg Cys Ala Gly His Leu Leu Pro Gln Gly Glu Lys Gly Xaa405 410 415ctg cta ttg gat cct gaa atg cgc ccc gtc ttc gtc aac atc ggt gag1296Leu Leu Leu Asp Pro Glu Met Arg Pro Val Phe Val Asn Ile Gly Glu420 425 430cgc acc aac gtc acc ggc tcg gcc aag ttc aag aag ctg atc gtc gaa1344Arg Thr Asn Val Thr Gly Ser Ala Lys Phe Lys Lys Leu Ile Val Glu435 440 445ggg aac tat ccc gag gcg ctg tcg gtc gcg cgc cag cag gtc gag gcc1392Gly Asn Tyr Pro Glu Ala Leu Ser Val Ala Arg Gln Gln Val Glu Ala450 455 460ggg gcc cag gtc atc gac gtg aac atg gac gag ggt ctg ctg gac agc1440Gly Ala Gln Val Ile Asp Val Asn Met Asp Glu Gly Leu Leu Asp Ser465 470 475 480cag cag gcc atg gtc acc ttc ctg aat ctg atg gcg gcc gag ccc gac1488Gln Gln Ala Met Val Thr Phe Leu Asn Leu Met Ala Ala Glu Pro Asp485 490 495atc gcg cgc gtg ccg gtg atg atc gac agc tcc aag tgg gag gtg atc1536Ile Ala Arg Val Pro Val Met Ile Asp Ser Ser Lys Trp Glu Val Ile500 505 510gag gcg ggc ctg aag tgc gta caa ggc aag gcg atc gtc aac tcg atc1584Glu Ala Gly Leu Lys Cys Val Gln Gly Lys Ala Ile Val Asn Ser Ile515 520 525agc ctg aag gaa ggc gag gaa aag ttc ctc gaa cag gcc acg ctc tgc1632Ser Leu Lys Glu Gly Glu Glu Lys Phe Leu Glu Gln Ala Thr Leu Cys530 535 540ctg cgc tat ggc gca gcc gtg gtg gtc atg gcc ttc gac gag gtt ggc1680Leu Arg Tyr Gly Ala Ala Val Val Val Met Ala Phe Asp Glu Val Gly545 550 555 560cag gcc gac acc gaa aag cgc aag gtc gag atc tgt acg cgg gcc tac1728Gln Ala Asp Thr Glu Lys Arg Lys Val Glu Ile Cys Thr Arg Ala Tyr565 570 575aac acg ctc gtg gac aag gtc ggc ttc ccg ccc gag gac atc atc ttc1776Asn Thr Leu Val Asp Lys Val Gly Phe Pro Pro Glu Asp Ile Ile Phe580 585 590gac ccc aac atc ttc gcc gtg gcg acg ggg atc gag gag cac gac aac1824Asp Pro Asn Ile Phe Ala Val Ala Thr Gly Ile Glu Glu His Asp Asn
595 600 605tac gcc gtc gac ttc atc gag gcc acg cgg cgc atc aag cag atg ttg1872Tyr Ala Val Asp Phe Ile Glu Ala Thr Arg Arg Ile Lys Gln Met Leu610 615 620ccc tat gcg cgg gtg tcg ggc ggg gtg tcg aac gtc tcg ttc agc ttc1920Pro Tyr Ala Arg Val Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe625 630 635 640cgg ggc aat gag ccg gtg cgc cgg gcg atc cac tcg gtg ttc ctg tac1968Arg Gly Asn Glu Pro Val Arg Arg Ala Ile His Ser Val Phe Leu Tyr645 650 655cac gcc atc aac gcc ggc atg gac atg ggc atc gtc aac gcc ggc gac2016His Ala Ile Asn Ala Gly Met Asp Met Gly Ile Val Asn Ala Gly Asp660 665 670ctg ccg gtc tat gac gac atc gat ccg gcc ctg cgc gag gcc gtc gag2064Leu Pro Val Tyr Asp Asp Ile Asp Pro Ala Leu Arg Glu Ala Val Glu675 680 685gac gtg atc ctc aac cgg ccg cag cgc gat ccg gtg atg acc aac acc2112Asp Val Ile Leu Asn Arg Pro Gln Arg Asp Pro Val Met Thr Asn Thr690 695 700gag cgc ctg gtc gag atg gcc ccg cgc tat aag ggc gag aag ggg cag2160Glu Arg Leu Val Glu Met Ala Pro Arg Tyr Lys Gly Glu Lys Gly Gln705 710 715 720cag cag gtc gcc aac ctg gag tgg cga aag ggc acg gtg aac gag cgc2208Gln Gln Val Ala Asn Leu Glu Trp Arg Lys Gly Thr Val Asn Glu Arg725 730 735ctg acc cat gct ctc gtt cac ggc atc acc gag ttc atc gag cag gac2256Leu Thr His Ala Leu Val His Gly Ile Thr Glu Phe Ile Glu Gln Asp740 745 750acc gag gag gcg cgc ctg gcc gcc gag cgc ccc ttg cac gtg att gaa2304Thr Glu Glu Ala Arg Leu Ala Ala Glu Arg Pro Leu His Val Ile Glu755 760 765ggc ccg ctg atg gac ggc atg aac gtc gtc ggc gac ctg ttc ggc gcg2352Gly Pro Leu Met Asp Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala770 775 780ggc aag atg ttc ctg ccc cag gtg gtg aag tcg gcc cgc gtg atg aag2400Gly Lys Met Phe Leu Pro Gln Val Val Lys Ser Ala Arg Val Met Lys785 790 795 800cag gcc gtc gcc tgg ctg atg ccg ttc atg gag gcc gag aag gaa ggc2448Gln Ala Val Ala Trp Leu Met Pro Phe Met Glu Ala Glu Lys Glu Gly805 810 815cag gag cgc aag gcc gcc ggc aag gtg ctg atg gcc acc gtc aag ggc2496Gln Glu Arg Lys Ala Ala Gly Lys Val Leu Met Ala Thr Val Lys Gly820 825 830gac gtc cac gac atc ggt aag aac atc gtc ggc gtc gtg ctg cag tgt2544Asp Val His Asp Ile Gly Lys Asn Ile Val Gly Val Val Leu Gln Cys835 840 845
aac aac tac gag gtc gtg gac ctg ggt gtc atg gtg ccc gcc gac cgc2592Asn Asn Tyr Glu Val Val Asp Leu Gly Val Met Val Pro Ala Asp Arg850 855 860atc ctg gac gaa gcc aag aag cac aag gtc gac atg atc ggc ctg tcg2640Ile Leu Asp Glu Ala Lys Lys His Lys Val Asp Met Ile Gly Leu Ser865 870 875 880ggc ctg atc acc ccc tcg ctg gac gag atg gtg ttc gtg gcc gcc gag2688Gly Leu Ile Thr Pro Ser Leu Asp Glu Met Val Phe Val Ala Ala Glu885 890 895atg gag cgc cag ggc ttt gat atc ccg ctg ctg atc ggc ggc gcc acc2736Met Glu Arg Gln Gly Phe Asp Ile Pro Leu Leu Ile Gly Gly Ala Thr900 905 910acc agc cgc acc cac acc gcg gtg aag atc gag ccg gcc tat cgc cgg2784Thr Ser Arg Thr His Thr Ala Val Lys Ile Glu Pro Ala Tyr Arg Arg915 920 925ggt ccg acg acc tat gtc gtc gac gcc agc cgc gcc gtg ggc gtg gtc2832Gly Pro Thr Thr Tyr Val Val Asp Ala Ser Arg Ala Val Gly Val Val930 935 940tcg ggc ctg ctg tcg gaa ggc gag cgt gac cgg atc atc gcc gag acc2880Ser Gly Leu Leu Ser Glu Gly Glu Arg Asp Arg Ile Ile Ala Glu Thr945 950 955 960cgc gcc gag tat gtg aag gtc cgc gag caa tac gcg cgc ggc cag acc2928Arg Ala Glu Tyr Val Lys Val Arg Glu Gln Tyr Ala Arg Gly Gln Thr965 970 975acc aag gcc cgc gcc tcg atc cag gag gcc cgc aag cgc gcc ttc gcc2976Thr Lys Ala Arg Ala Ser Ile Gln Glu Ala Arg Lys Arg Ala Phe Ala980 985 990att gac tgg aag ggc tat gcg ccg ccc aag ccc gcc ttc atc ggc acg3024Ile Asp Trp Lys Gly Tyr Ala Pro Pro Lys Pro Ala Phe Ile Gly Thr99510001005cgg gtg ttc gag ccg tcg ctg gcc gag ctg gtc ccg ttc atc gac tgg3072Arg Val Phe Glu Pro Ser Leu Ala Glu Leu Val Pro Phe Ile Asp Trp101010151020tcg ccg ttc ttc gcc agc tgg gag ctg atc ggc cgc ttc ccg cag atc3120Ser Pro Phe Phe Ala Ser Trp Glu Leu Ile Gly Arg Phe Pro Gln Ile1025103010351040ctg gag gac gac gtg gtc ggc cag gcc gcc acc gac ctc tac cgc gac3168Leu Glu Asp Asp Val Val Gly Gln Ala Ala Thr Asp Leu Tyr Arg Asp104510501055gcc cgc gcc atg ctg gac aag gtg gtc gag gaa aag tgg ttc ggg gcc3216Ala Arg Ala Met Leu Asp Lys Val Val Glu Glu Lys Trp Phe Gly Ala106010651070aag ggc gtg atc ggc ttc tgg ccg gcc cag gcc cag ggc gac gac atc3264Lys Gly Val Ile Gly Phe Trp Pro Ala Gln Ala Gln Gly Asp Asp Ile107510801085gtg ctc tat acc gac gag acc cgc gtg gcc gag ttc tcg cgc ctg cac3312Val Leu Tyr Thr Asp Glu Thr Arg Val Ala Glu Phe Ser Arg Leu His
109010951100acc ctt cgc cag cag atg gac aag ggc gcc gac aag agc ggc gag gcc3360Thr Leu Arg Gln Gln Met Asp Lys Gly Ala Asp Lys Ser Gly Glu Ala1105111011151120aag gcc aat gtc gcc ctg tcg gac ttc gtc gcg ccg atc ggg cag ggg3408Lys Ala Asn Val Ala Leu Ser Asp Phe Val Ala Pro Ile Gly Gln Gly112511301135gct gac tat gtc ggc ggc ttc gcc gtc acc gca ggc cat ggc gag gac3456Ala Asp Tyr Val Gly Gly Phe Ala Val Thr Ala Gly His Gly Glu Asp114011451150gag atc gtc gcc aag ttc aag gcg gcc ggc gac gac tac aac gcc atc3504Glu Ile Val Ala Lys Phe Lys Ala Ala Gly Asp Asp Tyr Asn Ala Ile115511601165atg gcc tcg gcc ctg gcc gac cgc ctg gcc gaa gcc ttc gcc gag tgg3552Met Ala Ser Ala Leu Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Trp117011751180ctg cac tac aaa gcc cgt gtc gag ctg tgg ggc tac gcc gcc gac gag3600Leu His Tyr Lys Ala Arg Val Glu Leu Trp Gly Tyr Ala Ala Asp Glu1185 11901195 1200gac gcc gac gtc gag cgc ctg atc gcc gaa aag tac cag ggc atc cgc3648Asp Ala Asp Val Glu Arg Leu Ile Ala Glu Lys Tyr Gln Gly Ile Arg120512101215ccc gcg ccc ggc tat ccg gcc cag ccc gac cac acc gag aaa ggt acg3696Pre Ala Pre Gly Tyr Pro Ala Gln Pro Asp His Thr Glu Lys Gly Thr122012251230ctg ttc aag ctg ctc gac gcc gag gcg gcc acc ggt ctg cag ctg acc3744Leu Phe Lys Leu Leu Asp Ala Glu Ala Ala Thr Gly Leu Gln Leu Thr123512401245gag agc tac gcc atg acc cct ggc gcg gcg gtc tcc ggc ctg ttc ttc3792Glu Ser Tyr Ala Met Thr Pro Gly Ala Ala Val Ser Gly Leu Phe Phe125012551260agc cac cgc cag gcg cac tat ttc ggg gtc ggc aag atc gac gcc gac3840Ser His Arg Gln Ala His Tyr Phe Gly Val Gly Lys Ile Asp Ala Asp1265 127012751280cag gtc gag gac tac gcc cgc cgc aag ggc tgg gat atg gag acg gcc3888Gln Val Glu Asp Tyr Ala Arg Arg Lys Gly Trp Asp Met Glu Thr Ala128512901295gag cgc tgg ctg tcg ccg atc ctg aac tac gat ccg cta gcg cgg gcg3936Glu Arg Trp Leu Ser Pro Ile Leu Asn Tyr Asp Pro Leu Ala Arg Ala130013051310cgc ggg gcg gcg gct tag3954Arg Gly Ala Ala Ala1315210402111317212PRT
213新月柄桿菌220
221unsure222359..359223所有的Xaa表示任何胺基酸220
221unsure222416..416223所有的Xaa表示任何胺基酸40040Met Thr Asp Leu Ser Ile Arg Ala Asn Arg Val Ala Ala Leu Lys Ala1 5 10 15Ala Ala Lys Glu Arg Ile Leu Ile Leu Asp Gly Ser Trp Gly Val Met20 25 30Phe Gln Lys Lys Gly Leu Thr Glu Ala Asp Tyr Arg Ala Glu Arg Phe35 40 45Ala Ala Tyr Asn Gly Gln Met Lys Gly Asn Asn Asp Ile Leu Cys Leu50 55 60Thr Arg Pro Asp Leu Val Ala Glu Leu His Asp Ala Tyr Phe Ser Ala65 70 75 80Gly Ala Asp Ile Ser Glu Thr Asn Thr Phe Ser Gly Thr Thr Ile Ala85 90 95Gln Ala Asp Tyr His Leu Gly Glu Gln Asp Val Trp Asp Ile Asn Leu100 105 110Glu Gly Ala Lys Ile Gly Arg Ser Val Ala Asp Arg Trp Asn Ala Gln115 120 125Asn Pro Asp Arg Pro Lys Phe Ile Ala Gly Ser Met Gly Pro Leu Asn130 135 140Val Met Leu Ser Met Ser Ser Asp Val Asn Asp Pro Gly Ala Arg Lys145 150 155 160Val Thr Phe Asp Gln Val Tyr Glu Ala Tyr Arg Gln Gln Val Asp Ala165 170 175Leu Tyr Gln Gly Gly Val Asp Leu Phe Leu Ile Glu Thr Ile Thr Asp180 185 190Thr Leu Asn Cys Lys Ala Ala Ile Lys Ala Ile Leu Asp Trp Arg Asp195 200 205Glu Gly His Glu Glu Leu Pro Ile Trp Ile Ser Gly Thr Ile Thr Asp210 215 220Arg Ser Gly Arg Thr Leu Ser Gly Gln Thr Ala Glu Ala Phe Trp Asn225 230 235 240Ser Val Lys His Ala Lys Pro Phe Ala Val Gly Phe Asn Cys Ala Leu245 250 255
Gly Ala Asp Leu Met Arg Pro His Ile Ala Glu Met Ala Arg Ile Ala260 265 270Asp Thr Leu Val Ala Ala Tyr Pro Asn Ala Gly Leu Pro Asn Ala Met275 280 285Gly Gln Tyr Asp Glu Glu Pro His Glu Thr Gly His Ala Leu His Glu290 295 300Trp Ala Lys Asp Gly Leu Val Asn Ile Leu Gly Gly Cys Cys Gly Thr305 310 315 320Thr Pro Asp His Ile Arg His Val Ala Asp Glu Val Arg Gly Val Thr325 330 335Pro Arg Gln Ile Pro Glu Arg Pro Lys Ala Met Arg Leu Ala Gly Leu340 345 350Glu Pro Phe Glu Leu Ala Xaa Trp Leu Arg Pro Gln Ile Pro Phe Ser355 360 365Pro Cys Gly Arg Arg Cys Arg Arg Arg Arg Arg Met Arg Gly Leu Ala370 375 380Gly Pro Ser Thr Ala Val Ser Arg Arg Arg Arg Ser Ser Thr Pro His385 390 395 400Pro Thr Arg Cys Ala Gly His Leu Leu Pro Gln Gly Glu Lys Gly Xaa405 410 415Leu Leu Leu Asp Pro Glu Met Arg Pro Val Phe Val Asn Ile Gly Glu420 425 430Arg Thr Asn Val Thr Gly Ser Ala Lys Phe Lys Lys Leu Ile Val Glu435 440 445Gly Asn Tyr Pro Glu Ala Leu Ser Val Ala Arg Gln Gln Val Glu Ala450 455 460Gly Ala Gln Val Ile Asp Val Asn Met Asp Glu Gly Leu Leu Asp Ser465 470 475 480Gln Gln Ala Met Val Thr Phe Leu Asn Leu Met Ala Ala Glu Pro Asp485 490 495Ile Ala Arg Val Pro Val Met Ile Asp Ser Ser Lys Trp Glu Val Ile500 505 510Glu Ala Gly Leu Lys Cys Val Gln Gly Lys Ala Ile Val Asn Ser Ile515 520 525Ser Leu Lys Glu Gly Glu Glu Lys Phe Leu Glu Gln Ala Thr Leu Cys530 535 540Leu Arg Tyr Gly Ala Ala Val Val Val Met Ala Phe Asp Glu Val Gly545 550 555 560Gln Ala Asp Thr Glu Lys Arg Lys Val Glu Ile Cys Thr Arg Ala Tyr565 570 575Asn Thr Leu Val Asp Lys Val Gly Phe Pro Pro Glu Asp Ile Ile Phe580 585 590
Asp Pro Asn Ile Phe Ala Val Ala Thr Gly Ile Glu Glu His Asp Asn595 600 605Tyr Ala Val Asp Phe Ile Glu Ala Thr Arg Arg Ile Lys Gln Met Leu610 615 620Pro Tyr Ala Arg Val Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe625 630 635 640Arg Gly Asn Glu Pro Val Arg Arg Ala Ile His Ser Val Phe Leu Tyr645 650 655His Ala Ile Asn Ala Gly Met Asp Met Gly Ile Val Asn Ala Gly Asp660 665 670Leu Pro Val Tyr Asp Asp Ile Asp Pro Ala Leu Arg Glu Ala Val Glu675 680 685Asp Val Ile Leu Asn Arg Pro Gln Arg Asp Pro Val Met Thr Asn Thr690 695 700Glu Arg Leu Val Glu Met Ala Pro Arg Tyr Lys Gly Glu Lys Gly Gln705 710 715 720Gln Gln Val Ala Asn Leu Glu Trp Arg Lys Gly Thr Val Asn Glu Arg725 730 735Leu Thr His Ala Leu Val His Gly Ile Thr Glu Phe Ile Glu Gln Asp740 745 750Thr Glu Glu Ala Arg Leu Ala Ala Glu Arg Pro Leu His Val Ile Glu755 760 765Gly Pro Leu Met Asp Gly Met Asn Val Val Gly Asp Leu Phe Gly Ala770 775 780Gly Lys Met Phe Leu Pro Gln Val Val Lys Ser Ala Arg Val Met Lys785 790 795 800Gln Ala Val Ala Trp Leu Met Pro Phe Met Glu Ala Glu Lys Glu Gly805 810 815Gln Glu Arg Lys Ala Ala Gly Lys Val Leu Met Ala Thr Val Lys Gly820 825 830Asp Val His Asp Ile Gly Lys Asn Ile Val Gly Val Val Leu Gln Cys835 840 845Asn Asn Tyr Glu Val Val Asp Leu Gly Val Met Val Pro Ala Asp Arg850 855 860Ile Leu Asp Glu Ala Lys Lys His Lys Val Asp Met Ile Gly Leu Ser865 870 875 880Gly Leu Ile Thr Pro Ser Leu Asp Glu Met Val Phe Val Ala Ala Glu885 890 895Met Glu Arg Gln Gly Phe Asp Ile Pro Leu Leu Ile Gly Gly Ala Thr900 905 910Thr Ser Arg Thr His Thr Ala Val Lys Ile Glu Pro Ala Tyr Arg Arg
915 920 925Gly Pro Thr Thr Tyr Val Val Asp Ala Ser Arg Ala Val Gly Val Val930 935 940Ser Gly Leu Leu Ser Glu Gly Glu Arg Asp Arg Ile Ile Ala Glu Thr945 950 955 960Arg Ala Glu Tyr Val Lys Val Arg Glu Gln Tyr Ala Arg Gly Gln Thr965 970 975Thr Lys Ala Arg Ala Ser Ile Gln Glu Ala Arg Lys Arg Ala Phe Ala980 985 990Ile Asp Trp Lys Gly Tyr Ala Pro Pro Lys Pro Ala Phe Ile Gly Thr99510001005Arg Val Phe Glu Pro Ser Leu Ala Glu Leu Val Pro Phe Ile Asp Trp101010151020Ser Pro Phe Phe Ala Ser Trp Glu Leu Ile Gly Arg Phe Pro Gln Ile1025 103010351040Leu Glu Asp Asp Val Val Gly Gln Ala Ala Thr Asp Leu Tyr Arg Asp104510501055Ala Arg Ala Met Leu Asp Lys Val Val Glu Glu Lys Trp Phe Gly Ala10601065 1070Lys Gly Val Ile Gly Phe Trp Pro Ala Gln Ala Gln Gly Asp Asp Ile107510801085Val Leu Tyr Thr Asp Glu Thr Arg Val Ala Glu Phe Ser Arg Leu His109010951100Thr Leu Arg Gln Gln Met Asp Lys Gly Ala Asp Lys Ser Gly Glu Ala1105 111011151120Lys Ala Asn Val Ala Leu Ser Asp Phe Val Ala Pro Ile Gly Gln Gly112511301135Ala Asp Tyr Val Gly Gly Phe Ala Val Thr Ala Gly His Gly Glu Asp114011451150Glu Ile Val Ala Lys Phe Lys Ala Ala Gly Asp Asp Tyr Asn Ala Ile115511601165Met Ala Ser Ala Leu Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu Trp117011751180Leu His Tyr Lys Ala Arg Val Glu Leu Trp Gly Tyr Ala Ala Asp Glu1185 119011951200Asp Ala Asp Val Glu Arg Leu Ile Ala Glu Lys Tyr Gln Gly Ile Arg120512101215Pro Ala Pro Gly Tyr Pro Ala Gln Pro Asp His Thr Glu Lys Gly Thr122012251230Leu Phe Lys Leu Leu Asp Ala Glu Ala Ala Thr Gly Leu Gln Leu Thr123512401245
Glu Ser Tyr Ala Met Thr Pro Gly Ala Ala Val Ser Gly Leu Phe Phe125012551260Ser His Arg Gln Ala His Tyr Phe Gly Val Gly Lys Ile Asp Ala Asp1265 127012751280Gln Val Glu Asp Tyr Ala Arg Arg Lys Gly Trp Asp Met Glu Thr Ala128512901295Glu Arg Trp Leu Ser Pro Ile Leu Asn Tyr Asp Pro Leu Ala Arg Ala130013051310Arg Gly Ala Ala Ala1315210412113759212DNA213莢膜紅細菌(Rhodobacter capsulatus)220
221CDS222(1)..(3756)223RRC0173140041atg ctg acc cag acc ctg ccc cga tct gcg gcc ttt gcc gca att gag48Met Leu Thr Gln Thr Leu Pro Arg Ser Ala Ala Phe Ala Ala Ile Glu1 5 10 15gcg ctt tcg cgc cag cgg atc ttg atc ctt gac ggg gcg atg ggc acg96Ala Leu Ser Arg Gln Arg Ile Leu Ile Leu Asp Gly Ala Met Gly Thr20 25 30cag atc cag cag ctt ggc ctg agc gag gac gat ttt ctg ggc cac ggc144Gln Ile Gln Gln Leu Gly Leu Ser Glu Asp Asp Phe Leu Gly His Gly35 40 45tcg ggc tgc gcc tgc cgc cat gcc acc gat cat ccg caa aag ggc aac192Ser Gly Cys Ala Cys Arg His Ala Thr Asp His Pro Gln Lys Gly Asn50 55 60aac gac ctg ctg gtg ctg acc cag ccg caa gcg atc gag gag atc cat240Asn Asp Leu Leu Val Leu Thr Gln Pro Gln Ala Ile Glu Glu Ile His65 70 75 80ttc cgc tat gcg atg gcg ggg gcg gat atc gtc gag acg aac acc ttt288Phe Arg Tyr Ala Met Ala Gly Ala Asp Ile Val Glu Thr Asn Thr Phe85 90 95tcg gcc acc acc atc gcg cag gcc gat tac ggg ctg gaa agc gcg gtg336Ser Ala Thr Thr Ile Ala Gln Ala Asp Tyr Gly Leu Glu Ser Ala Val100 105 110ttc gac ctg aac gcc gcg ggg gcg cgg gtg gcg cgg gcg gcg atg gac384Phe Asp Leu Asn Ala Ala Gly Ala Arg Val Ala Arg Ala Ala Met Asp115 120 125cgc gcc gag gcc acc gac gga cgg cgc cgc ttc gtt gcg ggg gcg gtg432Arg Ala Glu Ala Thr Asp Gly Arg Arg Arg Phe Val Ala Gly Ala Val130 135 140
ggg ccg acg aac cgc acc gcc tcg ctc tcg ccc gat gtg aac gac ccg480Gly Pro Thr Asn Arg Thr Ala Ser Leu Ser Pro Asp Val Asn Asp Pro145 150 155 160ggc ttt cgc gcc gtc acc ttc gac gat ctg cgc acg gcc tat ggc cag528Gly Phe Arg Ala Val Thr Phe Asp Asp Leu Arg Thr Ala Tyr Gly Gln165 170 175cag gtg cgc ggt ctg atc gcg ggg ggc gcc gat atc ctg ctg atc gag576Gln Val Arg Gly Leu Ile Ala Gly Gly Ala Asp Ile Leu Leu Ile Glu180 185 190acg atc ttt gac acg ctg aac gcc aag gcg gcg att ttc gcc tgt ttc624Thr Ile Phe Asp Thr Leu Asn Ala Lys Ala Ala Ile Phe Ala Cys Phe195 200 205gaa gcc ttt gcc gaa cgg ggc gag cgg ctg ccg gtg atg att tcc ggc672Glu Ala Phe Ala Glu Arg Gly Glu Arg Leu Pro Val Met Ile Ser Gly210 215 220acg atc acc gat gcc tcg ggg cgc aca ttg tcg ggg cag acg ccg acc720Thr Ile Thr Asp Ala Ser Gly Arg Thr Leu Ser Gly Gln Thr Pro Thr225 230 235 240gcg ttc tgg cat tcg gtg gct cat gcc cgg ccc ttt acc gtg ggg ctg768Ala Phe Trp His Ser Val Ala His Ala Arg Pro Phe Thr Val Gly Leu245 250 255aac tgc gcg ctg ggc gcc agt gcg atg cgt ccg cat ctg gcg gaa ctg816Asn Cys Ala Leu Gly Ala Ser Ala Met Arg Pro His Leu Ala Glu Leu260 265 270gcg ggc gtc gcc ccc tgc gcg atc tgc gcc tat ccc aat gcc ggg ctg864Ala Gly Val Ala Pro Cys Ala Ile Cys Ala Tyr Pro Asn Ala Gly Leu275 280 285ccc aat gcc ttt ggc caa tat gac gaa acc ccc gac cgg acc gcc gcg912Pro Asn Ala Phe Gly Gln Tyr Asp Glu Thr Pro Asp Arg Thr Ala Ala290 295 300cag gtg gcc gaa ttt gcc cgc gaa ggg ctg gtc aat gtc gtg ggc ggt960Gln Val Ala Glu Phe Ala Arg Glu Gly Leu Val Asn Val Val Gly Gly305 310 315 320tgc tgc ggc acc acc ccc gat cac atc cgc gcc atc gcg gaa gcc gtg1008Cys Cys Gly Thr Thr Pro Asp His Ile Arg Ala Ile Ala Glu Ala Val325 330 335aaa cct ttc ccg ccg agg gcc ctg cca agc cgt tat ctg cgc ctt tcg1056Lys Pro Phe Pro Pro Arg Ala Leu Pro Ser Arg Tyr Leu Arg Leu Ser340 345 350ggg ctt gag ccc ttt acc ctg acg ccc gac att ccc ttc gtg aac atc1104Gly Leu Glu Pro Phe Thr Leu Thr Pro Asp Ile Pro Phe Val Asn Ile355 360 365ggc gag cgc acg aat gtc acc ggc tcg gcc cgg ttc cgc aag atg atc1152Gly Glu Arg Thr Asn Val Thr Gly Ser Ala Arg Phe Arg Lys Met Ile370 375 380gtc gcc cgc gac tat gcc gcc gcg ctg gat gtc gcc cgc gat cag gtg1200
Val Ala Arg Asp Tyr Ala Ala Ala Leu Asp Val Ala Arg Asp Gln Val385 390 395 400gaa aac ggc gcg cag atc ctt gac atc aac atg gac gag ggg ctg atc1248Glu Asn Gly Ala Gln Ile Leu Asp Ile Asn Met Asp Glu Gly Leu Ile405 410 415gac agt cag gcg gcg atg gtc gcc ttc ctc aac ctc ttg gcc gcc gag1296Asp Ser Gln Ala Ala Met Val Ala Phe Leu Asn Leu Leu Ala Ala Glu420 425 430ccc gac att gcc cgg gtg ccg gtg atg atc gac agc tcg aaa tgg gag1344Pro Asp Ile Ala Arg Val Pro Val Met Ile Asp Ser Ser Lys Trp Glu435 440 445gtg atc gag gcc ggg ctg aaa tgc gtg cag ggc aag ccc gtc gtc aat1392Val Ile Glu Ala Gly Leu Lys Cys Val Gln Gly Lys Pro Val Val Asn450 455 460tcg atc agc ctg aag gag ggc gag gag atc ttc cgc cat cac gcg gcg1440Ser Ile Ser Leu Lys Glu Gly Glu Glu Ile Phe Arg His His Ala Ala465 470 475 480ctg tgt ctg gcc tat ggc gcg gcg gtc gtc gtg atg gcc ttt gac gaa1488Leu Cys Leu Ala Tyr Gly Ala Ala Val Val Val Met Ala Phe Asp Glu485 490 495gag ggg cag gcc gac agt ttc gcc cga aag acc agc atc tgc gcc cgc1536Glu Gly Gln Ala Asp Ser Phe Ala Arg Lys Thr Ser Ile Cys Ala Arg500 505 510gcc tat cgc att ctg gtc gag gag atc ggc ttt ccg ccc gaa gac atc1584Ala Tyr Arg Ile Leu Val Glu Glu Ile Gly Phe Pro Pro Glu Asp Ile515 520 525atc ttt gac ccg aac gtc ttt gcc gtc gcc acg ggc atc gaa gaa cac1632Ile Phe Asp Pro Asn Val Phe Ala Val Ala Thr Gly Ile Glu Glu His530 535 540gac aat tac ggc gtt gat ttc atc gag gcc gct cgc tgg atc cgg gcc1680Asp Asn Tyr Gly Val Asp Phe Ile Glu Ala Ala Arg Trp Ile Arg Ala545 550 555 560aac ctg ccg cat gcc cat gtc tcg ggc ggg gtg tcg aac ctg tcc ttc1728Asn Leu Pro His Ala His Val Ser Gly Gly Val Ser Asn Leu Ser Phe565 570 575agc ttt cgc ggc aac gaa ccc gtg cgc gcg gcg atg cat gcg gtg ttt1776Ser Phe Arg Gly Asn Glu Pro Val Arg Ala Ala Met His Ala Val Phe580 585 590ctt tac cac gcc atc cgc gcc ggg atg gat atg ggg atc gtc aat gcc1824Leu Tyr His Ala Ile Arg Ala Gly Met Asp Met Gly Ile Val Asn Ala595 600 605ggg cag ctg gtg gtc tat gac cag atc gac ccc gag ctg cgc cag gcc1872Gly Gln Leu Val Val Tyr Asp Gln Ile Asp Pro Glu Leu Arg Gln Ala610 615 620tgc gag gat gtg gtg ctc aac cgc cag ccc aaa tcg ggc ggc acc gcg1920Cys Glu Asp Val Val Leu Asn Arg Gln Pro Lys Ser Gly Gly Thr Ala625 630 635 640
acc gag cgg atg ctg gag gtg gcc gag cgc ttc cgc ggc ggc gcg cgc1968Thr Glu Arg Met Leu Glu Val Ala Glu Arg Phe Arg Gly Gly Ala Arg645 650 655gag gaa aag acc cgc gat ctg gcc tgg cgc gac tgg ccg gtg gaa aag2016Glu Glu Lys Thr Arg Asp Leu Ala Trp Arg Asp Trp Pro Val Glu Lys660 665 670cgg ctc gaa cat gcg ctg gtc aat ggc atc acc gaa ttc atc gag gcc2064Arg Leu Glu His Ala Leu Val Asn Gly Ile Thr Glu Phe Ile Glu Ala675 680 685gat acc gaa gcc gca agg ctt ctg gcc gaa cgc ccg ctg cat gtg atc2112Asp Thr Glu Ala Ala Arg Leu Leu Ala Glu Arg Pro Leu His Val Ile690 695 700gaa ggg ccg ctg atg gcg ggg atg aat gtc gtc ggt gat ctg ttc ggc2160Glu Gly Pro Leu Met Ala Gly Met Asn Val Val Gly Asp Leu Phe Gly705 710 715 720gcg ggc aag atg ttc ctg cca cag gtg gtg aaa tcg gcg cgc gtg atg2208Ala Gly Lys Met Phe Leu Pro Gln Val Val Lys Ser Ala Arg Val Met725 730 735aaa cag gcc gtc gcc gtt ctg ctg ccc tac atg gat gcc gaa aag gcc2256Lys Gln Ala Val Ala Val Leu Leu Pro Tyr Met Asp Ala Glu Lys Ala740 745 750gcg cgc ggc ggc gag ggg cgc gaa acc gcg ggc aag atc ctg atg gcc2304Ala Arg Gly Gly Glu Gly Arg Glu Thr Ala Gly Lys Ile Leu Met Ala755 760 765acg gtc aag ggc gat gtg cat gac atc ggc aag aac atc gtc ggc gtc2352Thr Val Lys Gly Asp Val His Asp Ile Gly Lys Asn Ile Val Gly Val770 775 780gtg ctg gcc tgc aac aat tac gac atc gtc gac ctg ggc gtg atg gtg2400Val Leu Ala Cys Asn Asn Tyr Asp Ile Val Asp Leu Gly Val Met Val785 790 795 800ccg ccg caa aag atc ctg gaa gtg gcg cgg gcc gaa aag gtc gat gcg2448Pro Pro Gln Lys Ile Leu Glu Val Ala Arg Ala Glu Lys Val Asp Ala805 810 815atc ggg ctt tcc ggg ctg atc acg cca agc ctg gac gag atg gtg cat2496Ile Gly Leu Ser Gly Leu Ile Thr Pro Ser Leu Asp Glu Met Val His820 825 830ctg gcc gcg gaa atg gag cgc gag ggc ttt gac att ccg ctg ctg atc2544Leu Ala Ala Glu Met Glu Arg Glu Gly Phe Asp Ile Pro Leu Leu Ile835 840 845ggc ggg gcg acc acg tcg aaa gtg cat acg gcg gtg aag atc gcc ccc2592Gly Gly Ala Thr Thr Ser Lys Val His Thr Ala Val Lys Ile Ala Pro850 855 860gcc tac agc cgc ggg cag gcg gtt tat gtg ctc gat gcc agc cgg gcc2640Ala Tyr Ser Arg Gly Gln Ala Val Tyr Val Leu Asp Ala Ser Arg Ala865 870 875 880gtg ggg gtg gtg ggg gcg ctt ttg agc ccg aac cag aag gtc gat tac2688
Val Gly Val Val Gly Ala Leu Leu Ser Pro Asn Gln Lys Val Asp Tyr885 890 895gcg gcg cag atc cgc gcg gac tat gcg cag atc gcc gcc cgt cat gcc2736Ala Ala Gln Ile Arg Ala Asp Tyr Ala Gln Ile Ala Ala Arg His Ala900 905 910cgc gac gag gcc gcc aag gtg cgg ctg cct ttg gcc gcg gcc cgg gcc2784Arg Asp Glu Ala Ala Lys Val Arg Leu Pro Leu Ala Ala Ala Arg Ala915 920 925aat gcg ctg cgg ctc gac tgg tcg ggc tat gcc gtg ccc gcg ccg caa2832Asn Ala Leu Arg Leu Asp Trp Ser Gly Tyr Ala Val Pro Ala Pro Gln930 935 940ttc ctt ggc ccg cgc gtg atc gac gac tgg gat ctg gcc gaa gtg gcg2880Phe Leu Gly Pro Arg Val Ile Asp Asp Trp Asp Leu Ala Glu Val Ala945 950 955 960cgg tat atc gac tgg acg ccc ttc ttc cat gcc tgg gaa ttg aag ggg2928Arg Tyr Ile Asp Trp Thr Pro Phe Phe His Ala Trp Glu Leu Lys Gly965 970 975gtc tat ccg cgg att ctc gat gac gcc gaa aag ggc gaa gcg gcg cgg2976Val Tyr Pro Arg Ile Leu Asp Asp Ala Glu Lys Gly Glu Ala Ala Arg980 985 990gca ctt ttc gcc gat gcc cag gcg atg ctg gcg cag atc att gcc gaa3024Ala Leu Phe Ala Asp Ala Gln Ala Met Leu Ala Gln Ile Ile Ala Glu99510001005cgc tgg ttc acc ccg cgc gcc gtg gtg ggg ttc tgg ccc gcg cag gcg3072Arg Trp Phe Thr Pro Arg Ala Val Val Gly Phe Trp Pro Ala Gln Ala101010151020gtg ggc gac gat atc cgg ctt tac acc gac gag agc cgg acc gaa gac3120Val Gly Asp Asp Ile Arg Leu Tyr Thr Asp Glu Ser Arg Thr Glu Asp1025103010351040ctc gcc act ttc ttc acc ctg cgc cag cag acc ggc aag cgc gaa ggc3168Leu Ala Thr Phe Phe Thr Leu Arg Gln Gln Thr Gly Lys Arg Glu Gly104510501055cgc ccg aat gtg gct ttg gcc gat ttc gtc gcg cct gcg ggc acg gtg3216Arg Pro Asn Val Ala Leu Ala Asp Phe Val Ala Pro Ala Gly Thr Val106010651070ccc gat tat ctg ggc ggc ttc gtg gtc acc gcg ggc ccc gag gaa gcc3264Pro Asp Tyr Leu Gly Gly Phe Val Val Thr Ala Gly Pro Glu Glu Ala107510801085gag atc gcc gcg cgg ttc gaa gct gcc aat gac cat tat tcc gcg atc3312Glu Ile Ala Ala Arg Phe Glu Ala Ala Asn Asp His Tyr Ser Ala Ile109010951100ctg gtc aag gcg ctg gcc gac cgc ttt gcc gaa gcc ctg gcc gag gcc3360Leu Val Lys Ala Leu Ala Asp Arg Phe Ala Glu Ala Leu Ala Glu Ala1105111011151120ctg cat cag cgg gtg cgg cgc gac tat tgg ggc tat gcg ccc gaa gaa3408Leu His Gln Arg Val Arg Arg Asp Tyr Trp Gly Tyr Ala Pro Glu Glu112511301135
agc ttc gcc ccc gat cag ctg gtg ggc gag ccc tat cgc ggc atc cgc3456Ser Phe Ala Pro Asp Gln Leu Val Gly Glu Pro Tyr Arg Gly Ile Arg114011451150ccg gcg ccc ggc tat ccg gcc cag ccc gac cac acg gaa aag ctg acg3504Pro Ala Pro Gly Tyr Pro Ala Gln Pro Asp His Thr Glu Lys Leu Thr115511601165ctg ttc cgg ctg ctt ggg gcc gag gcc gcg acc ggc gtg cat ctg acc3552Leu Phe Arg Leu Leu Gly Ala Glu Ala Ala Thr Gly Val His Leu Thr117011751180gac agc atg gcg atg tgg ccc ggc tct tcg gtc tcg ggg ctc tat atc3600Asp Ser Met Ala Met Trp Pro Gly Ser Ser Val Ser Gly Leu Tyr Ile1185119011951200ggc cat ccg gag gcc tat tat ttc ggt ctg gcc cgg atc gag cag gat3648Gly His Pro Glu Ala Tyr Tyr Phe Gly Leu Ala Arg Ile Glu Gln Asp120512101215cag gcc gcc gat tac gcc gcc cgc aag ggc atg gcc ttg gcc gag gtg3696Gln Ala Ala Asp Tyr Ala Ala Arg Lys Gly Met Ala Leu Ala Glu Val122012251230cag cgc tgg ctg gcc ccg gtg ctg ggg tcg gcc gcg ccc gcc gcc gct3744Gln Arg Trp Leu Ala Pro Val Leu Gly Ser Ala Ala Pro Ala Ala Ala123512401245gcg gtg gcc gcg tga3759Ala Val Ala Ala1250210422111252212PRT213莢膜紅細菌40042Met Leu Thr Gln Thr Leu Pro Arg Ser Ala Ala Phe Ala Ala Ile Glu1 5 10 15Ala Leu Ser Arg Gln Arg Ile Leu Ile Leu Asp Gly Ala Met Gly Thr20 25 30Gln Ile Gln Gln Leu Gly Leu Ser Glu Asp Asp Phe Leu Gly His Gly35 40 45Ser Gly Cys Ala Cys Arg His Ala Thr Asp His Pro Gln Lys Gly Asn50 55 60Asn Asp Leu Leu Val Leu Thr Gln Pro Gln Ala Ile Glu Glu Ile His65 70 75 80Phe Arg Tyr Ala Met Ala Gly Ala Asp Ile Val Glu Thr Asn Thr Phe85 90 95Ser Ala Thr Thr Ile Ala Gln Ala Asp Tyr Gly Leu Glu Ser Ala Val100 105 110Phe Asp Leu Asn Ala Ala Gly Ala Arg Val Ala Arg Ala Ala Met Asp
115 120 125Arg Ala Glu Ala Thr Asp Gly Arg Arg Arg Phe Val Ala Gly Ala Val130 135 140Gly Pro Thr Asn Arg Thr Ala Ser Leu Ser Pro Asp Val Asn Asp Pro145 150 155 160Gly Phe Arg Ala Val Thr Phe Asp Asp Leu Arg Thr Ala Tyr Gly Gln165 170 175Gln Val Arg Gly Leu Ile Ala Gly Gly Ala Asp Ile Leu Leu Ile Glu180 185 190Thr Ile Phe Asp Thr Leu Asn Ala Lys Ala Ala Ile Phe Ala Cys Phe195 200 205Glu Ala Phe Ala Glu Arg Gly Glu Arg Leu Pro Val Met Ile Ser Gly210 215 220Thr Ile Thr Asp Ala Ser Gly Arg Thr Leu Ser Gly Gln Thr Pro Thr225 230 235 240Ala Phe Trp His Ser Val Ala His Ala Arg Pro Phe Thr Val Gly Leu245 250 255Asn Cys Ala Leu Gly Ala Ser Ala Met Arg Pro His Leu Ala Glu Leu260 265 270Ala Gly Val Ala Pro Cys Ala Ile Cys Ala Tyr Pro Asn Ala Gly Leu275 280 285Pro Asn Ala Phe Gly Gln Tyr Asp Glu Thr Pro Asp Arg Thr Ala Ala290 295 300Gln Val Ala Glu Phe Ala Arg Glu Gly Leu Val Asn Val Val Gly Gly305 310 315 320Cys Cys Gly Thr Thr Pro Asp His Ile Arg Ala Ile Ala Glu Ala Val325 330 335Lys Pro Phe Pro Pro Arg Ala Leu Pro Ser Arg Tyr Leu Arg Leu Ser340 345 350Gly Leu Glu Pro Phe Thr Leu Thr Pro Asp Ile Pro Phe Val Asn Ile355 360 365Gly Glu Arg Thr Asn Val Thr Gly Ser Ala Arg Phe Arg Lys Met Ile370 375 380Val Ala Arg Asp Tyr Ala Ala Ala Leu Asp Val Ala Arg Asp Gln Val385 390 395 400Glu Asn Gly Ala Gln Ile Leu Asp Ile Asn Met Asp Glu Gly Leu Ile405 410 415Asp Ser Gln Ala Ala Met Val Ala Phe Leu Asn Leu Leu Ala Ala Glu420 425 430Pro Asp Ile Ala Arg Val Pro Val Met Ile Asp Ser Ser Lys Trp Glu435 440 445
Val Ile Glu Ala Gly Leu Lys Cys Val Gln Gly Lys Pro Val Val Asn450 455 460Ser Ile Ser Leu Lys Glu Gly Glu Glu Ile Phe Arg His His Ala Ala465 470 475 480Leu Cys Leu Ala Tyr Gly Ala Ala Val Val Val Met Ala Phe Asp Glu485 490 495Glu Gly Gln Ala Asp Ser Phe Ala Arg Lys Thr Ser Ile Cys Ala Arg500 505 510Ala Tyr Arg Ile Leu Val Glu Glu Ile Gly Phe Pro Pro Glu Asp Ile515 520 525Ile Phe Asp Pro Asn Val Phe Ala Val Ala Thr Gly Ile Glu Glu His530 535 540Asp Asn Tyr Gly Val Asp Phe Ile Glu Ala Ala Arg Trp Ile Arg Ala545 550 555 560Asn Leu Pro His Ala His Val Ser Gly Gly Val Ser Asn Leu Ser Phe565 570 575Ser Phe Arg Gly Asn Glu Pro Val Arg Ala Ala Met His Ala Val Phe580 585 590Leu Tyr His Ala Ile Arg Ala Gly Met Asp Met Gly Ile Val Asn Ala595 600 605Gly Gln Leu Val Val Tyr Asp Gln Ile Asp Pro Glu Leu Arg Gln Ala610 615 620Cys Glu Asp Val Val Leu Asn Arg Gln Pro Lys Ser Gly Gly Thr Ala625 630 635 640Thr Glu Arg Met Leu Glu Val Ala Glu Arg Phe Arg Gly Gly Ala Arg645 650 655Glu Glu Lys Thr Arg Asp Leu Ala Trp Arg Asp Trp Pro Val Glu Lys660 665 670Arg Leu Glu His Ala Leu Val Asn Gly Ile Thr Glu Phe Ile Glu Ala675 680 685Asp Thr Glu Ala Ala Arg Leu Leu Ala Glu Arg Pro Leu His Val Ile690 695 700Glu Gly Pro Leu Met Ala Gly Met Asn Val Val Gly Asp Leu Phe Gly705 710 715 720Ala Gly Lys Met Phe Leu Pro Gln Val Val Lys Ser Ala Arg Val Met725 730 735Lys Gln Ala Val Ala Val Leu Leu Pro Tyr Met Asp Ala Glu Lys Ala740 745 750Ala Arg Gly Gly Glu Gly Arg Glu Thr Ala Gly Lys Ile Leu Met Ala755 760 765Thr Val Lys Gly Asp Val His Asp Ile Gly Lys Asn Ile Val Gly Val770 775 780
Val Leu Ala Cys Asn Asn Tyr Asp Ile Val Asp Leu Gly Val Met Val785 790 795 800Pro Pro Gln Lys Ile Leu Glu Val Ala Arg Ala Glu Lys Val Asp Ala805 810 815Ile Gly Leu Ser Gly Leu Ile Thr Pro Ser Leu Asp Glu Met Val His820 825 830Leu Ala Ala Glu Met Glu Arg Glu Gly Phe Asp Ile Pro Leu Leu Ile835 840 845Gly Gly Ala Thr Thr Ser Lys Val His Thr Ala Val Lys Ile Ala Pro850 855 860Ala Tyr Ser Arg Gly Gln Ala Val Tyr Val Leu Asp Ala Ser Arg Ala865 870 875 880Val Gly Val Val Gly Ala Leu Leu Ser Pro Asn Gln Lys Val Asp Tyr885 890 895Ala Ala Gln Ile Arg Ala Asp Tyr Ala Gln Ile Ala Ala Arg His Ala900 905 910Arg Asp Glu Ala Ala Lys Val Arg Leu Pro Leu Ala Ala Ala Arg Ala915 920 925Asn Ala Leu Arg Leu Asp Trp Ser Gly Tyr Ala Val Pro Ala Pro Gln930 935 940Phe Leu Gly Pro Arg Val Ile Asp Asp Trp Asp Leu Ala Glu Val Ala945 950 955 960Arg Tyr Ile Asp Trp Thr Pro Phe Phe His Ala Trp Glu Leu Lys Gly965 970 975Val Tyr Pro Arg Ile Leu Asp Asp Ala Glu Lys Gly Glu Ala Ala Arg980 985 990Ala Leu Phe Ala Asp Ala Gln Ala Met Leu Ala Gln Ile Ile Ala Glu99510001005Arg Trp Phe Thr Pro Arg Ala Val Val Gly Phe Trp Pro Ala Gln Ala101010151020Val Gly Asp Asp Ile Arg Leu Tyr Thr Asp Glu Ser Arg Thr Glu Asp1025 103010351040Leu Ala Thr Phe Phe Thr Leu Arg Gln Gln Thr Gly Lys Arg Glu Gly104510501055Arg Pro Asn Val Ala Leu Ala Asp Phe Val Ala Pro Ala Gly Thr Val106010651070Pro Asp Tyr Leu Gly Gly Phe Val Val Thr Ala Gly Pro Glu Glu Ala107510801085Glu Ile Ala Ala Arg Phe Glu Ala Ala Asn Asp His Tyr Ser Ala Ile109010951100Leu Val Lys Ala Leu Ala Asp Arg Phe Ala Glu Ala Leu Ala Glu Ala
1105 111011151120Leu His Gln Arg Val Arg Arg Asp Tyr Trp Gly Tyr Ala Pro Glu Glu112511301135Ser Phe Ala Pro Asp Gln Leu Val Gly Glu Pro Tyr Arg Gly Ile Arg114011451150Pro Ala Pro Gly Tyr Pro Ala Gln Pro Asp His Thr Glu Lys Leu Thr115511601165Leu Phe Arg Leu Leu Gly Ala Glu Ala Ala Thr Gly Val His Leu Thr117011751180Asp Ser Met Ala Met Trp Pro Gly Ser Ser Val Ser Gly Leu Tyr Ile1185 119011951200Gly His Pro Glu Ala Tyr Tyr Phe Gly Leu Ala Arg Ile Glu Gln Asp120512101215Gln Ala Ala Asp Tyr Ala Ala Arg Lys Gly Met Ala Leu Ala Glu Val122012251230Gln Arg Trp Leu Ala Pro Val Leu Gly Ser Ala Ala Pro Ala Ala Ala123512401245Ala Val Ala Ala1250210432113798212DNA213人(Homo sapiens)220
221CDS222(1)..(3795)223RHS2470540043atg tca ccc gcg ctc caa gac ctg tcg caa ccc gaa ggt ctg aag aaa48Met Ser Pro Ala Leu Gln Asp Leu Ser Gln Pro Glu Gly Leu Lys Lys1 5 10 15acc ctg cgg gat gag atc aat gcc att ctg cag aag agg att atg gtg96Thr Leu Arg Asp Glu Ile Asn Ala Ile Leu Gln Lys Arg Ile Met Val20 25 30ctg gat gga ggg atg ggg acc atg atc cag cgg gag aag cta aac gaa144Leu Asp Gly Gly Met Gly Thr Met Ile Gln Arg Glu Lys Leu Asn Glu35 40 45gaa cac ttc cga ggt cag gaa ttt aaa gat cat gcc agg ccg ctg aaa192Glu His Phe Arg Gly Gln Glu Phe Lys Asp His Ala Arg Pro Leu Lys50 55 60ggc aac aat gac att tta agt ata act cag cct gat gtc att tac caa240Gly Asn Asn Asp Ile Leu Ser Ile Thr Gln Pro Asp Val Ile Tyr Gln65 70 75 80atc cat aag gaa tac ttg ctg gct ggg gca gat atc att gaa aca aat288
Ile His Lys Glu Tyr Leu Leu Ala Gly Ala Asp Ile Ile Glu Thr Asn85 90 95act ttt agc agc act agt att gcc caa gct gac tat ggc ctt gaa cac336Thr Phe Ser Ser Thr Ser Ile Ala Gln Ala Asp Tyr Gly Leu Glu His100 105 110ttg gcc tac cgg atg aac atg tgc tct gca gga gtg gcc aga aaa gct384Leu Ala Tyr Arg Met Asn Met Cys Ser Ala Gly Val Ala Arg Lys Ala115 120 125gcc gag gag gta act ctc cag aca gga att aag agg ttt gtg gca ggg432Ala Glu Glu Val Thr Leu Gln Thr Gly Ile Lys Arg Phe Val Ala Gly130 135 140gct ctg ggt ccg act aat aag aca ctc tct gtg tcc cca tct gtg gaa480Ala Leu Gly Pro Thr Asn Lys Thr Leu Ser Val Ser Pro Ser Val Glu145 150 155 160agg ccg gat tat agg aac atc aca ttt gat gag ctt gtt gaa gca tac528Arg Pro Asp Tyr Arg Asn Ile Thr Phe Asp Glu Leu Val Glu Ala Tyr165 170 175caa gag cag gcc aaa gga ctt ctg gat ggc ggg gtt gat atc tta ctc576Gln Glu Gln Ala Lys Gly Leu Leu Asp Gly Gly Val Asp Ile Leu Leu180 185 190att gaa act att ttt gat act gcc aat gcc aag gca gcc ttg ttt gca624Ile Glu Thr Ile Phe Asp Thr Ala Asn Ala Lys Ala Ala Leu Phe Ala195 200 205ctc caa aat ctt ttt gag gag aaa tat gct ccc cgg cct atc ttt att672Leu Gln Asn Leu Phe Glu Glu Lys Tyr Ala Pro Arg Pro Ile Phe Ile210 215 220tca ggg acg atc gtt gat aaa agt ggg cgg act ctt tcc gga cag aca720Ser Gly Thr Ile Val Asp Lys Ser Gly Arg Thr Leu Ser Gly Gln Thr225 230 235 240gga gag gga ttt gtc atc agc gtg tct cat gga gaa cca ctc tgc att768Gly Glu Gly Phe Val Ile Ser Val Ser His Gly Glu Pro Leu Cys Ile245 250 255gga tta aat tgt gct ttg ggt gca gct gaa atg aga cct ttt att gaa816Gly Leu Asn Cys Ala Leu Gly Ala Ala Glu Met Arg Pro Phe Ile Glu260 265 270ata att gga aaa tgt aca aca gcc tat gtc ctc tgt tat ccc aat gca864Ile Ile Gly Lys Cys Thr Thr Ala Tyr Val Leu Cys Tyr Pro Asn Ala275 280 285ggt ctt ccc aac acc ttt ggt gac tat gat gaa acg cct tct atg atg912Gly Leu Pro Asn Thr Phe Gly Asp Tyr Asp Glu Thr Pro Ser Met Met290 295 300gcc aag cac cta aag gat ttt gct atg gat ggc ttg gtc aat ata gtt960Ala Lys His Leu Lys Asp Phe Ala Met Asp Gly Leu Val Asn Ile Val305 310 315 320gga gga tgc tgt ggg tca aca cca gat cat atc agg gaa att gct gaa1008Gly Gly Cys Cys Gly Ser Thr Pro Asp His Ile Arg Glu Ile Ala Glu325 330 335
gct gtg aaa aat tgt aag cct aga gtt cca cct gcc act gct ttt gaa1056Ala Val Lys Asn Cys Lys Pro Arg Val Pro Pro Ala Thr Ala Phe Glu340 345 350gga cat atg tta ctg tct ggt cta gag ccc ttc agg att gga ccg tac1104Gly His Met Leu Leu Ser Gly Leu Glu Pro Phe Arg Ile Gly Pro Tyr355 360 365acc aac ttt gtt aac att gga gag cgc tgt aat gtt gca gga tca agg1152Thr Asn Phe Val Asn Ile Gly Glu Arg Cys Asn Val Ala Gly Ser Arg370 375 380aag ttt gct aaa ctc atc atg gca gga aac tat gaa gaa gcc ttg tgt1200Lys Phe Ala Lys Leu Ile Met Ala Gly Asn Tyr Glu Glu Ala Leu Cys385 390 395 400gtt gcc aaa gtg cag gtg gaa atg gga gcc cag gtg ttg gat gtc aac1248Val Ala Lys Val Gln Val Glu Met Gly Ala Gln Val Leu Asp Val Asn405 410 415atg gat gat ggc atg cta gat ggt cca agt gca atg acc aga ttt tgc1296Met Asp Asp Gly Met Leu Asp Gly Pro Ser Ala Met Thr Arg Phe Cys420 425 430aac tta att gct tcc gag cca gac atc gca aag gta cct ttg tgc atc1344Asn Leu Ile Ala Ser Glu Pro Asp Ile Ala Lys Val Pro Leu Cys Ile435 440 445gac tcc tcc aat ttt gct gtg att gaa gct ggg tta aag tgc tgc caa1392Asp Ser Ser Asn Phe Ala Val Ile Glu Ala Gly Leu Lys Cys Cys Gln450 455 460ggg aag tgc att gtc aat agc att agt ctg aag gaa gga gag gac gac1440Gly Lys Cys Ile Val Asn Ser Ile Ser Leu Lys Glu Gly Glu Asp Asp465 470 475 480ttc ttg gag aag gcc agg aag att aaa aag tat gga gct gct atg gtg1488Phe Leu Glu Lys Ala Arg Lys Ile Lys Lys Tyr Gly Ala Ala Met Val485 490 495gtc atg gct ttt gat gaa gaa gga cag gca aca gaa aca gac aca aaa1536Val Met Ala Phe Asp Glu Glu Gly Gln Ala Thr Glu Thr Asp Thr Lys500 505 510atc aga gtg tgc acc cgg gcc tac cat ctg ctt gtg aaa aaa ctg ggc1584Ile Arg Val Cys Thr Arg Ala Tyr His Leu Leu Val Lys Lys Leu Gly515 520 525ttt aat cca aat gac att att ttt gac cct aat atc cta acc att ggg1632Phe Asn Pro Asn Asp Ile Ile Phe Asp Pro Asn Ile Leu Thr Ile Gly530 535 540act gga atg gag gaa cac aac ttg tat gcc att aat ttt atc cat gca1680Thr Gly Met Glu Glu His Asn Leu Tyr Ala Ile Asn Phe Ile His Ala545 550 555 560aca aaa gtc att aaa gaa aca tta cct gga gcc aga ata agt gga ggt1728Thr Lys Val Ile Lys Glu Thr Leu Pro Gly Ala Arg Ile Ser Gly Gly565 570 575ctt tcc aac ttg tcc ttc tcc ttc cga gga atg gaa gcc att cga gaa1776
Leu Ser Asn Leu Ser Phe Ser Phe Arg Gly Met Glu Ala Ile Arg Glu580 585 590gca atg cat ggg gtt ttc ctt tac cat gca atc aag tct ggc atg gac1824Ala Met His Gly Val Phe Leu Tyr His Ala Ile Lys Ser Gly Met Asp595 600 605atg ggg ata gtg aat gct gga aac ctc cct gtg tat gat gat atc cat1872Met Gly Ile Val Asn Ala Gly Asn Leu Pro Val Tyr Asp Asp Ile His610 615 620aag gaa ctt ctg cag ctc tgt gaa gat ctc atc tgg aat aaa gac cct1920Lys Glu Leu Leu Gln Leu Cys Glu Asp Leu Ile Trp Asn Lys Asp Pro625 630 635 640gag gcc act gag aag ctc tta cgt tat gcc cag act caa ggc aca gga1968Glu Ala Thr Glu Lys Leu Leu Arg Tyr Ala Gln Thr Gln Gly Thr Gly645 650 655ggg aag aaa gtc att cag act gat gag tgg aga aat ggc cct gtc gaa2016Gly Lys Lys Val Ile Gln Thr Asp Glu Trp Arg Asn Gly Pro Val Glu660 665 670gaa cgc ctt gag tat gcc ctt gtg aag ggc att gaa aaa cat att att2064Glu Arg Leu Glu Tyr Ala Leu Val Lys Gly Ile Glu Lys His Ile Ile675 680 685gag gat act gag gaa gcc agg tta aac caa aaa aaa tat ccc cga cct2112Glu Asp Thr Glu Glu Ala Arg Leu Asn Gln Lys Lys Tyr Pro Arg Pro690 695 700ctc aat ata att gaa gga ccc ctg atg aat gga atg aaa att gtt ggt2160Leu Asn Ile Ile Glu Gly Pro Leu Met Asn Gly Met Lys Ile Val Gly705 710 715 720gat ctt ttt gga gct gga aaa atg ttt cta cct cag gtt ata aag tca2208Asp Leu Phe Gly Ala Gly Lys Met Phe Leu Pro Gln Val Ile Lys Ser725 730 735gcc cgg gtt atg aag aag gct gtt ggc cac ctt atc cct ttc atg gaa2256Ala Arg Val Met Lys Lys Ala Val Gly His Leu Ile Pro Phe Met Glu740 745 750aaa gaa aga gaa gaa acc aga gtg ctt aac ggc aca gta gaa gaa gag2304Lys Glu Arg Glu Glu Thr Arg Val Leu Asn Gly Thr Val Glu Glu Glu755 760 765gac cct tac cag ggc acc atc gtg ctg gcc act gtt aaa ggc gac gtg2352Asp Pro Tyr Gln Gly Thr Ile Val Leu Ala Thr Val Lys Gly Asp Val770 775 780cac gac ata ggc aag aac ata gtt gga gta gtc ctt ggc tgc aat aat2400His Asp Ile Gly Lys Asn Ile Val Gly Val Val Leu Gly Cys Asn Asn785 790 795 800ttc cga gtt att gat tta gga gtc atg act cca tgt gat aag ata ctg2448Phe Arg Val Ile Asp Leu Gly Val Met Thr Pro Cys Asp Lys Ile Leu805 810 815aaa gct gct ctt gac cac aaa gca gat ata att ggc ctg tca gga ctc2496Lys Ala Ala Leu Asp His Lys Ala Asp Ile Ile Gly Leu Ser Gly Leu820 825 830
atc act cct tcc ctg gat gaa atg att ttt gtt gcc aag gaa atg gag2544Ile Thr Pro Ser Leu Asp Glu Met Ile Phe Val Ala Lys Glu Met Glu835 840 845aga tta gct ata agg att cca ttg ttg att gga gga gca acc act tca2592Arg Leu Ala Ile Arg Ile Pro Leu Leu Ile Gly Gly Ala Thr Thr Ser850 855 860aaa acc cac aca gca gtt aaa ata gct ccg aga tac agt gca cct gta2640Lys Thr His Thr Ala Val Lys Ile Ala Pro Arg Tyr Ser Ala Pro Val865 870 875 880atc cat gtc ctg gac gcg tcc aag agt gtg gtg gtg tgt tcc cag ctg2688Ile His Val Leu Asp Ala Ser Lys Ser Val Val Val Cys Ser Gln Leu885 890 895tta gat gaa aat cta aag gat gaa tac ttt gag gaa atc atg gaa gaa2736Leu Asp Glu Asn Leu Lys Asp Glu Tyr Phe Glu Glu Ile Met Glu Glu900 905 910tat gaa gat att aga cag gac cat tat gag tct ctc aag gag agg aga2784Tyr Glu Asp Ile Arg Gln Asp His Tyr Glu Ser Leu Lys Glu Arg Arg915 920 925tac tta ccc tta agt caa gcc aga aaa agt ggt ttc caa atg gat tgg2832Tyr Leu Pro Leu Ser Gln Ala Arg Lys Ser Gly Phe Gln Met Asp Trp930 935 940ctg tct gaa cct cac cca gtg aag ccc acg ttt att ggg acc cag gtc2880Leu Ser Glu Pro His Pro Val Lys Pro Thr Phe Ile Gly Thr Gln Val945 950 955 960ttt gaa gac tat gac ctg cag aag ctg gtg gac tac att gac tgg aag2928Phe Glu Asp Tyr Asp Leu Gln Lys Leu Val Asp Tyr Ile Asp Trp Lys965 970 975cct ttc ttt gat gtc tgg cag ctc cgg ggc aag tac ccg aat cga ggc2976Pro Phe Phe Asp Val Trp Gln Leu Arg Gly Lys Tyr Pro Asn Arg Gly980 985 990ttt ccc aag ata ttt aac gac aaa aca gta ggt gga gag gcc agg aag3024Phe Pro Lys Ile Phe Asn Asp Lys Thr Val Gly Gly Glu Ala Arg Lys99510001005gtc tac gat gat gcc cac aat atg ctg aac aca ctg att agt caa aag3072Val Tyr Asp Asp Ala His Asn Met Leu Asn Thr Leu Ile Ser Gln Lys101010151020aaa ctc cgg gcc cgg ggt gtg gtt ggg ttc tgg cca gca cag agt atc3120Lys Leu Arg Ala Arg Gly Val Val Gly Phe Trp Pro Ala Gln Ser Ile1025103010351040caa gac gac att cac ctg tac gcg gag gct gct gtg ccc cag gct gca3168Gln Asp Asp Ile His Leu Tyr Ala Glu Ala Ala Val Pro Gln Ala Ala104510501055gag ccc ata gcc acc ttc tat ggg tta agg caa cag gct gag aag gac3216Glu Pro Ile Ala Thr Phe Tyr Gly Leu Arg Gln Gln Ala Glu Lys Asp106010651070tct gcc agc acg gag cca tac tac tgc ctc tca gac ttc atc gct ccc3264
Ser Ala Ser Thr Glu Pro Tyr Tyr Cys Leu Ser Asp Phe Ile Ala Pro107510801085ttg cat tct ggc atc cgt gac tac ctg ggc ctg ttt gcc gtt gcc tgc3312Leu His Ser Gly Ile Arg Asp Tyr Leu Gly Leu Phe Ala Val Ala Cys109010951100ttt ggg gta gaa gag ctg agc aag gcc tat gag gat gat ggt gac gac3360Phe Gly Val Glu Glu Leu Ser Lys Ala Tyr Glu Asp Asp Gly Asp Asp1105111011151120tac agc agc atc atg gtc aag gcg ctg ggg gac cgg ctg gca gag gcc3408Tyr Ser Ser Ile Met Val Lys Ala Leu Gly Asp Arg Leu Ala Glu Ala112511301135ttt gca gaa gag ctc cat gaa aga gtt cgc cga gaa ctg tgg gcc tac3456Phe Ala Glu Glu Leu His Glu Arg Val Arg Arg Glu Leu Trp Ala Tyr114011451150tgt ggc agt gag cag ctg gac gtc gca gac ctg cgc agg ctg cgg tac3504Cys Gly Ser Glu Gln Leu Asp Val Ala Asp Leu Arg Arg Leu Arg Tyr115511601165aag ggc atc cgc ccg gct cct ggc tac ccc agc cag ccc gac cac acc3552Lys Gly Ile Arg Pro Ala Pro Gly Tyr Pro Ser Gln Pro Asp His Thr117011751180gag aag ctc acc atg tgg aga ctt gca gac atc gag cag tct aca ggc3600Glu Lys Leu Thr Met Trp Arg Leu Ala Asp Ile Glu Gln Ser Thr Gly1185119011951200att agg tta aca gaa tca tta gca atg gca cct gct tca gca gtc tca3648Ile Arg Leu Thr Glu Ser Leu Ala Met Ala Pro Ala Ser Ala Val Ser120512101215ggc ctc tac ttc tcc aat ttg aag tcc aaa tat ttt gct gtg ggg aag3696Gly Leu Tyr Phe Ser Asn Leu Lys Ser Lys Tyr Phe Ala Val Gly Lys122012251230att tcc aag gat cag gtt gag gat tat gca ttg agg aag aac ata tct3744Ile Ser Lys Asp Gln Val Glu Asp Tyr Ala Leu Arg Lys Asn Ile Ser123512401245gtg gct gag gtt gag aaa tgg ctt gga ccc att ttg gga tat gat aca3792Val Ala Glu Val Glu Lys Trp Leu Gly Pro Ile Leu Gly Tyr Asp Thr125012551260gac taa3798Asp1265210442111265212PRT213人40044Met Ser Pro Ala Leu Gln Asp Leu Ser Gln Pro Glu Gly Leu Lys Lys1 5 10 15Thr Leu Arg Asp Glu Ile Asn Ala Ile Leu Gln Lys Arg Ile Met Val
20 25 30Leu Asp Gly Gly Met Gly Thr Met Ile Gln Arg Glu Lys Leu Asn Glu35 40 45Glu His Phe Arg Gly Gln Glu Phe Lys Asp His Ala Arg Pro Leu Lys50 55 60Gly Asn Asn Asp Ile Leu Ser Ile Thr Gln Pro Asp Val Ile Tyr Gln65 70 75 80Ile His Lys Glu Tyr Leu Leu Ala Gly Ala Asp Ile Ile Glu Thr Asn85 90 95Thr Phe Ser Ser Thr Ser Ile Ala Gln Ala Asp Tyr Gly Leu Glu His100 105 110Leu Ala Tyr Arg Met Asn Met Cys Ser Ala Gly Val Ala Arg Lys Ala115 120 125Ala Glu Glu Val Thr Leu Gln Thr Gly Ile Lys Arg Phe Val Ala Gly130 135 140Ala Leu Gly Pro Thr Asn Lys Thr Leu Ser Val Ser Pro Ser Val Glu145 150 155 160Arg Pro Asp Tyr Arg Asn Ile Thr Phe Asp Glu Leu Val Glu Ala Tyr165 170 175Gln Glu Gln Ala Lys Gly Leu Leu Asp Gly Gly Val Asp Ile Leu Leu180 185 190Ile Glu Thr Ile Phe Asp Thr Ala Asn Ala Lys Ala Ala Leu Phe Ala195 200 205Leu Gln Asn Leu Phe Glu Glu Lys Tyr Ala Pro Arg Pro Ile Phe Ile210 215 220Ser Gly Thr Ile Val Asp Lys Ser Gly Arg Thr Leu Ser Gly Gln Thr225 230 235 240Gly Glu Gly Phe Val Ile Ser Val Ser His Gly Glu Pro Leu Cys Ile245 250 255Gly Leu Asn Cys Ala Leu Gly Ala Ala Glu Met Arg Pro Phe Ile Glu260 265 270Ile Ile Gly Lys Cys Thr Thr Ala Tyr Val Leu Cys Tyr Pro Asn Ala275 280 285Gly Leu Pro Asn Thr Phe Gly Asp Tyr Asp Glu Thr Pro Ser Met Met290 295 300Ala Lys His Leu Lys Asp Phe Ala Met Asp Gly Leu Val Asn Ile Val305 310 315 320Gly Gly Cys Cys Gly Ser Thr Pro Asp His Ile Arg Glu Ile Ala Glu325 330 335Ala Val Lys Asn Cys Lys Pro Arg Val Pro Pro Ala Thr Ala Phe Glu340 345 350
Gly His Met Leu Leu Ser Gly Leu Glu Pro Phe Arg Ile Gly Pro Tyr355 360 365Thr Asn Phe Val Asn Ile Gly Glu Arg Cys Asn Val Ala Gly Ser Arg370 375 380Lys Phe Ala Lys Leu Ile Met Ala Gly Asn Tyr Glu Glu Ala Leu Cys385 390 395 400Val Ala Lys Val Gln Val Glu Met Gly Ala Gln Val Leu Asp Val Asn405 410 415Met Asp Asp Gly Met Leu Asp Gly Pro Ser Ala Met Thr Arg Phe Cys420 425 430Asn Leu Ile Ala Ser Glu Pro Asp Ile Ala Lys Val Pro Leu Cys Ile435 440 445Asp Ser Ser Asn Phe Ala Val Ile Glu Ala Gly Leu Lys Cys Cys Gln450 455 460Gly Lys Cys Ile Val Asn Ser Ile Ser Leu Lys Glu Gly Glu Asp Asp465 470 475 480Phe Leu Glu Lys Ala Arg Lys Ile Lys Lys Tyr Gly Ala Ala Met Val485 490 495Val Met Ala Phe Asp Glu Glu Gly Gln Ala Thr Glu Thr Asp Thr Lys500 505 510Ile Arg Val Cys Thr Arg Ala Tyr His Leu Leu Val Lys Lys Leu Gly515 520 525Phe Asn Pro Asn Asp Ile Ile Phe Asp Pro Asn Ile Leu Thr Ile Gly530 535 540Thr Gly Met Glu Glu His Asn Leu Tyr Ala Ile Asn Phe Ile His Ala545 550 555 560Thr Lys Val Ile Lys Glu Thr Leu Pro Gly Ala Arg Ile Ser Gly Gly565 570 575Leu Ser Asn Leu Ser Phe Ser Phe Arg Gly Met Glu Ala Ile Arg Glu580 585 590Ala Met His Gly Val Phe Leu Tyr His Ala Ile Lys Ser Gly Met Asp595 600 605Met Gly Ile Val Asn Ala Gly Asn Leu Pro Val Tyr Asp Asp Ile His610 615 620Lys Glu Leu Leu Gln Leu Cys Glu Asp Leu Ile Trp Asn Lys Asp Pro625 630 635 640Glu Ala Thr Glu Lys Leu Leu Arg Tyr Ala Gln Thr Gln Gly Thr Gly645 650 655Gly Lys Lys Val Ile Gln Thr Asp Glu Trp Arg Asn Gly Pro Val Glu660 665 670Glu Arg Leu Glu Tyr Ala Leu Val Lys Gly Ile Glu Lys His Ile Ile675 680 685
Glu Asp Thr Glu Glu Ala Arg Leu Asn Gln Lys Lys Tyr Pro Arg Pro690 695 700Leu Asn Ile Ile Glu Gly Pro Leu Met Asn Gly Met Lys Ile Val Gly705 710 715 720Asp Leu Phe Gly Ala Gly Lys Met Phe Leu Pro Gln Val Ile Lys Ser725 730 735Ala Arg Val Met Lys Lys Ala Val Gly His Leu Ile Pro Phe Met Glu740 745 750Lys Glu Arg Glu Glu Thr Arg Val Leu Asn Gly Thr Val Glu Glu Glu755 760 765Asp Pro Tyr Gln Gly Thr Ile Val Leu Ala Thr Val Lys Gly Asp Val770 775 780His Asp Ile Gly Lys Asn Ile Val Gly Val Val Leu Gly Cys Asn Asn785 790 795 800Phe Arg Val Ile Asp Leu Gly Val Met Thr Pro Cys Asp Lys Ile Leu805 810 815Lys Ala Ala Leu Asp His Lys Ala Asp Ile Ile Gly Leu Ser Gly Leu820 825 830Ile Thr Pro Ser Leu Asp Glu Met Ile Phe Val Ala Lys Glu Met Glu835 840 845Arg Leu Ala Ile Arg Ile Pro Leu Leu Ile Gly Gly Ala Thr Thr Ser850 855 860Lys Thr His Thr Ala Val Lys Ile Ala Pro Arg Tyr Ser Ala Pro Val865 870 875 880Ile His Val Leu Asp Ala Ser Lys Ser Val Val Val Cys Ser Gln Leu885 890 895Leu Asp Glu Asn Leu Lys Asp Glu Tyr Phe Glu Glu Ile Met Glu Glu900 905 910Tyr Glu Asp Ile Arg Gln Asp His Tyr Glu Ser Leu Lys Glu Arg Arg915 920 925Tyr Leu Pro Leu Ser Gln Ala Arg Lys Ser Gly Phe Gln Met Asp Trp930 935 940Leu Ser Glu Pro His Pro Val Lys Pro Thr Phe Ile Gly Thr Gln Val945 950 955 960Phe Glu Asp Tyr Asp Leu Gln Lys Leu Val Asp Tyr Ile Asp Trp Lys965 970 975Pro Phe Phe Asp Val Trp Gln Leu Arg Gly Lys Tyr Pro Asn Arg Gly980 985 990Phe Pro Lys Ile Phe Asn Asp Lys Thr Val Gly Gly Glu Ala Arg Lys99510001005Val Tyr Asp Asp Ala His Asn Met Leu Asn Thr Leu Ile Ser Gln Lys
1010 10151020Lys Leu Arg Ala Arg Gly Val Val Gly Phe Trp Pro Ala Gln Ser Ile1025 103010351040Gln Asp Asp Ile His Leu Tyr Ala Glu Ala Ala Val Pro Gln Ala Ala104510501055Glu Pro Ile Ala Thr Phe Tyr Gly Leu Arg Gln Gln Ala Glu Lys Asp106010651070Ser Ala Ser Thr Glu Pro Tyr Tyr Cys Leu Ser Asp Phe Ile Ala Pro107510801085Leu His Ser Gly Ile Arg Asp Tyr Leu Gly Leu Phe Ala Val Ala Cys109010951100Phe Gly Val Glu Glu Leu Ser Lys Ala Tyr Glu Asp Asp Gly Asp Asp1105 111011151120Tyr Ser Ser Ile Met Val Lys Ala Leu Gly Asp Arg Leu Ala Glu Ala112511301135Phe Ala Glu Glu Leu His Glu Arg Val Arg Arg Glu Leu Trp Ala Tyr114011451150Cys Gly Ser Glu Gln Leu Asp Val Ala Asp Leu Arg Arg Leu Arg Tyr115511601165Lys Gly Ile Arg Pro Ala Pro Gly Tyr Pro Ser Gln Pro Asp His Thr117011751180Glu Lys Leu Thr Met Trp Arg Leu Ala Asp Ile Glu Gln Ser Thr Gly1185 119011951200Ile Arg Leu Thr Glu Ser Leu Ala Met Ala Pro Ala Ser Ala Val Ser120512101215Gly Leu Tyr Phe Ser Asn Leu Lys Ser Lys Tyr Phe Ala Val Gly Lys122012251230Ile Ser Lys Asp Gln Val Glu Asp Tyr Ala Leu Arg Lys Asn Ile Ser123512401245Val Ala Glu Val Glu Lys Trp Leu Gly Pro Ile Leu Gly Tyr Asp Thr125012551260Asp1265210452113681212DNA213費氏弧菌(Vibrio fisheri)220
221CDS222(1)..(3678)223AB03995540045
gtg gca gga agc aat ata aaa gta caa ata gaa aag caa ctt tca gag48Val Ala Gly Ser Asn Ile Lys Val Gln Ile Glu Lys Gln Leu Ser Glu1 5 10 15cga att tta ttg att gat ggt ggt atg ggc acc atg att caa ggt tat96Arg Ile Leu Leu Ile Asp Gly Gly Met Gly Thr Met Ile Gln Gly Tyr20 25 30aag ttt gaa gag aaa gat tat aga ggg gga cgc ttt aat caa tgg cat144Lys Phe Glu Glu Lys Asp Tyr Arg Gly Gly Arg Phe Asn Gln Trp His35 40 45tgt gat ctt aaa ggt aac aat gat tta tta gtt ctt tca caa cca caa192Cys Asp Leu Lys Gly Asn Asn Asp Leu Leu Val Leu Ser Gln Pro Gln50 55 60att ata aga gat ata cac gaa gcc tat tta gaa gct ggt gct gat atc240Ile Ile Arg Asp Ile His Glu Ala Tyr Leu Glu Ala Gly Ala Asp Ile65 70 75 80ctt gaa act aat acc ttt aat gca aca act att gct atg gct gat tat288Leu Glu Thr Asn Thr Phe Asn Ala Thr Thr Ile Ala Met Ala Asp Tyr85 90 95gat atg gaa agc ctt agt gaa gag att aac ttt gaa gca gca aag ctt336Asp Met Glu Ser Leu Ser Glu Glu Ile Asn Phe Glu Ala Ala Lys Leu100 105 110gct cgt gaa gtt gca gat aaa tgg aca gaa aaa aca cca aac aaa cct384Ala Arg Glu Val Ala Asp Lys Trp Thr Glu Lys Thr Pro Asn Lys Pro115 120 125cgc tat gta gca gga gtg ctt gga cca aca aat cga act tgt tct att432Arg Tyr Val Ala Gly Val Leu Gly Pro Thr Asn Arg Thr Cys Ser Ile130 135 140tct cca gac gta aat gac cct ggc ttt cgt aat gta tcg ttt gat gaa480Ser Pro Asp Val Asn Asp Pro Gly Phe Arg Asn Val Ser Phe Asp Glu145 150 155 160tta gtc gaa gct tat tca gag tca act cga gca ctt att aga ggt ggt528Leu Val Glu Ala Tyr Ser Glu Ser Thr Arg Ala Leu Ile Arg Gly Gly165 170 175tca gat ctt atc ctc atc gaa act ata ttt gat aca tta aat gct aaa576Ser Asp Leu Ile Leu Ile Glu Thr Ile Phe Asp Thr Leu Asn Ala Lys180 185 190gcg tgt tct ttt gct gtt gaa tct gtt ttt gaa gag ctt ggt att act624Ala Cys Ser Phe Ala Val Glu Ser Val Phe Glu Glu Leu Gly Ile Thr195 200 205ttg cct gtt atg att tca ggg acc att acc gat gca tca gga aga aca672Leu Pro Val Met Ile Ser Gly Thr Ile Thr Asp Ala Ser Gly Arg Thr210 215 220tta tcg ggg caa aca aca gaa gct ttt tat aat gca tta aga cat gta720Leu Ser Gly Gln Thr Thr Glu Ala Phe Tyr Asn Ala Leu Arg His Val225 230 235 240aaa cct att tct ttt ggt ctt aac tgt gca ctt ggt cct gat gaa tta768Lys Pro Ile Ser Phe Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu
245 250 255cgt gaa tat gta agc gag ctt tca cgt att tct gaa tgt tat gtt tct816Arg Glu Tyr Val Ser Glu Leu Ser Arg Ile Ser Glu Cys Tyr Val Ser260 265 270gcg cac cca aac gct ggt ttg oct aat gca ttt ggt gag tat gat tta864Ala His Pro Asn Ala Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu275 280 285tct ccc gaa gat atg gct gag cat gtt gcg gaa tgg gca agc agc gga912Ser Pro Glu Asp Met Ala Glu His Val Ala Glu Trp Ala Ser Ser Gly290 295 300ttt tta aat ctt att ggt ggg tgt tgt ggc acc act cct gaa cat att960Phe Leu Asn Leu Ile Gly Gly Cys Cys Gly Thr Thr Pro Glu His Ile305 310 315 320cgt caa atg gct tta gtt gtt gaa ggt gtg aaa cct cga caa tta cct1008Arg Gln Met Ala Leu Val Val Glu Gly Val Lys Pro Arg Gln Leu Pro325 330 335gaa tta ccc gta gct tgt cgt ctt tcc gga tta gag cct tta aca ata1056Glu Leu Pro Val Ala Cys Arg Leu Ser Gly Leu Glu Pro Leu Thr Ile340 345 350gaa aaa gat tct ttg ttt att aat gtt ggt gaa cgt aca aat gtt act1104Glu Lys Asp Ser Leu Phe Ile Asn Val Gly Glu Arg Thr Asn Val Thr355 360 365gga tct gca cgt ttt aaa cgc tta att aaa gaa gag ctt tat gac gaa1152Gly Ser Ala Arg Phe Lys Arg Leu Ile Lys Glu Glu Leu Tyr Asp Glu370 375 380gca cta agt gtt gct caa gag caa gtt gaa aac ggt gct caa att atc1200Ala Leu Ser Val Ala Gln Glu Gln Val Glu Asn Gly Ala Gln Ile Ile385 390 395 400gat atc aac atg gat gaa ggc atg ctt gat gct gaa gca tgt atg gtt1248Asp Ile Asn Met Asp Glu Gly Met Leu Asp Ala Glu Ala Cys Met Val405 410 415cgt ttt tta aat ctt tgt gca tca gaa cct gaa ata tct aaa gta cca1296Arg Phe Leu Asn Leu Cys Ala Ser Glu Pro Glu Ile Ser Lys Val Pro420 425 430gtg atg gtt gat tct tct aaa tgg gaa gta att gaa gct gga tta aag1344Val Met Val Asp Ser Ser Lys Trp Glu Val Ile Glu Ala Gly Leu Lys435 440 445tgt att caa ggt aag ggg ata gtt aat tca atc tct tta aag gaa ggc1392Cys Ile Gln Gly Lys Gly Ile Val Asn Ser Ile Ser Leu Lys Glu Gly450 455 460aaa gaa aag ttt gta cat caa gcc aag tta ata cgt cgt tat ggt gct1440Lys Glu Lys Phe Val His Gln Ala Lys Leu Ile Arg Arg Tyr Gly Ala465 470 475 480gca gtg atc gtt atg gct ttt gat gaa gtt ggc caa gcg gac act cgg1488Ala Val Ile Val Met Ala Phe Asp Glu Val Gly Gln Ala Asp Thr Arg485 490 495
gag cgt aaa att gaa att tgt acc aat gcc tac aat att tta gtt gat1536Glu Arg Lys Ile Glu Ile Cys Thr Asn Ala Tyr Asn Ile Leu Val Asp500 505 510gaa gtt ggc ttc cca cct gaa gat att att ttt gac cct aat att ttt1584Glu Val Gly Phe Pro Pro Glu Asp Ile Ile Phe Asp Pro Asn Ile Phe515 520 525gcg gtt gct aca ggt atc gat gaa cat aat aac tat gca gta gac ttt1632Ala Val Ala Thr Gly Ile Asp Glu His Asn Asn Tyr Ala Val Asp Phe530 535 540att gaa gcc gtt ggt gat ata aag cga acg ctt cct cat gca atg att1680Ile Glu Ala Val Gly Asp Ile Lys Arg Thr Leu Pro His Ala Met Ile545 550 555 560tca ggt ggt gtt tct aac gtc tct ttt tct ttc cgt gga aat aac tac1728Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asn Tyr565 570 575gtt cgt gaa gct atc cat gcc gta ttt tta tat cac tgt ttt aaa aat1776Val Arg Glu Ala Ile His Ala Val Phe Leu Tyr His Cys Phe Lys Asn580 585 590ggt atg gat atg ggc atc gta aat gcg ggg cag ctg gaa ata tat gat1824Gly Met Asp Met Gly Ile Val Asn Ala Gly Gln Leu Glu Ile Tyr Asp595 600 605aac gta cca gaa gat ctg cgt gaa gcg gtt gaa gat gtg gta ttg aat1872Asn Val Pro Glu Asp Leu Arg Glu Ala Val Glu Asp Val Val Leu Asn610 615 620cgt cga gat gat tct acg gag cgt tta ctt gat att gca act gag tat1920Arg Arg Asp Asp Ser Thr Glu Arg Leu Leu Asp Ile Ala Thr Glu Tyr625 630 635 640tta gaa cga gct gtt ggt aaa gtt gaa gat aaa tct gct tta gag tgg1968Leu Glu Arg Ala Val Gly Lys Val Glu Asp Lys Ser Ala Leu Glu Trp645 650 655cgt gac tgg cct gtt gaa aaa cgt ctt gag cat tct cta gtg aag ggg 2016Arg Asp Trp Pro Val Glu Lys Arg Leu Glu His Ser Leu Val Lys Gly660 665 670ata aca gag ttt att gtc gaa gat aca gaa gaa gca cga atc aat gca2064Ile Thr Glu Phe Ile Val Glu Asp Thr Glu Glu Ala Arg Ile Asn Ala675 680 685gaa aga cca ata gag gta att gaa ggg cca ttg atg gac gga atg aac2112Glu Arg Pro Ile Glu Val Ile Glu Gly Pro Leu Met Asp Gly Met Asn690 695 700gtc gtt ggt gat ctt ttt ggg gaa gga aaa atg ttc ctt ccc caa gta2160Val Val Gly Asp Leu Phe Gly Glu Gly Lys Met Phe Leu Pro Gln Val705 710 715 720gta aag tct gct cgt gta atg aaa caa gct gtt gct cat tta gaa ccg2208Val Lys Ser Ala Arg Val Met Lys Gln Ala Val Ala His Leu Glu Pro725 730 735ttt att aat gcg tct aaa gaa gtt gga gca aca aac ggt aaa ata ctt2256Phe Ile Asn Ala Ser Lys Glu Val Gly Ala Thr Asn Gly Lys Ile Leu
740 745 750tta gca aca gta aaa ggt gat gtt cat gat att ggt aag aat atc gtt2304Leu Ala Thr Val Lys Gly Asp Val His Asp Ile Gly Lys Asn Ile Val755 760 765ggc gtg gtt tta cag tgt aat aac tat gaa ata att gat ctt ggt gtc2352Gly Val Val Leu Gln Cys Asn Asn Tyr Glu Ile Ile Asp Leu Gly Val770 775 780atg gtc tct tgt gaa act atc tta aaa gta gcc aaa gaa gaa aat gta2400Met Val Ser Cys Glu Thr Ile Leu Lys Val Ala Lys Glu Glu Asn Val785 790 795 800gac atc att ggt tta tct gga tta ata aca cca tca tta gat gaa atg2448Asp Ile Ile Gly Leu Ser Gly Leu Ile Thr Pro Ser Leu Asp Glu Met805 810 815gtc cat gtt gct aaa gag atg gaa cga caa ggg ttt gat tta cca ttg2496Val His Val Ala Lys Glu Met Glu Arg Gln Gly Phe Asp Leu Pro Leu820 825 830ttg att ggt gga gca aca act tca aaa gca cat aca gcg gta aaa att2544Leu Ile Gly Gly Ala Thr Thr Ser Lys Ala His Thr Ala Val Lys Ile835 840 845gaa caa aac tat tct caa cct gtt gtg tac gtt aat aat gct tct cga2592Glu Gln Asn Tyr Ser Gln Pro Val Val Tyr Val Asn Asn Ala Ser Arg850 855 860gct gta ggt gta tgt act tca tta ctt tca aat gaa cta aaa cct tct2640Ala Val Gly Val Cys Thr Ser Leu Leu Ser Asn Glu Leu Lys Pro Ser865 870 875 880ttt gtt gag aag cta gat att gat tac gaa cgt gtt aga gag cag cat2688Phe Val Glu Lys Leu Asp Ile Asp Tyr Glu Arg Val Arg Glu Gln His885 890 895agt cgt aaa caa ccg cga act aag cct gtg act tta gag gtt gct cga2736Ser Arg Lys Gln Pro Arg Thr Lys Pro Val Thr Leu Glu Val Ala Arg900 905 910gcg aat aaa gtc gct att gac tgg gct tct tat aca cct cct gtc cca2784Ala Asn Lys Val Ala Ile Asp Trp Ala Ser Tyr Thr Pro Pro Val Pro915 920 925cta aag cct ggt gta cat ata ttt gat aac ttt gat gtt tca aca ttg2832Leu Lys Pro Gly Val His Ile Phe Asp Asn Phe Asp Val Ser Thr Leu930 935 940cgt aat tat att gat tgg acc cca ttt ttt atg acg tgg tct ctt gtt2880Arg Asn Tyr Ile Asp Trp Thr Pro Phe Phe Met Thr Trp Ser Leu Val945 950 955 960gga aaa tac ccg aag atc tta gag cat gaa gaa gtt ggt gaa gaa gcc2928Gly Lys Tyr Pro Lys Ile Leu Glu His Glu Glu Val Gly Glu Glu Ala965 970 975aaa cga tta ttt aaa gat gca aat gat cta tta gat cga gtt gaa aaa2976Lys Arg Leu Phe Lys Asp Ala Asn Asp Leu Leu Asp Arg Val Glu Lys980 985 990
gaa ggg tta ctt aaa gcc cgt gga atg tgt gcg cta ttt cca gct tcc3024Glu Gly Leu Leu Lys Ala Arg Gly Met Cys Ala Leu Phe Pro Ala Ser99510001005agt gtt ggt gat gat att gaa gta tat act gat gaa tca cgc act aca3072Ser Val Gly Asp Asp Ile Glu Val Tyr Thr Asp Glu Ser Arg Thr Thr101010151020gtt gca aaa gta ctt cat aat ttg cga caa caa acg gag aag ccg aaa3120Val Ala Lys Val Leu His Asn Leu Arg Gln Gln Thr Glu Lys Pro Lys1025 103010351040ggt ttt aat tat tgt tta tct gat tat ata gca ccc aaa gag tcg ggt3168Gly Phe Asn Tyr Cys Leu Ser Asp Tyr Ile Ala Pro Lys Glu Ser Gly104510501055aaa aat gat tgg atc ggt ggt ttt gct gta act ggt ggt att ggt gag3216Lys Asn Asp Trp Ile Gly Gly Phe Ala Val Thr Gly Gly Ile Gly Glu106010651070cgt gaa cta gct gat gaa tat aaa gca aat ggt gat gat tat aac gct3264Arg Glu Leu Ala Asp Glu Tyr Lys Ala Asn Gly Asp Asp Tyr Asn Ala107510801085atc atg att caa gcg gtg gct gat cgt cta gct gaa gct ttt gct gaa3312Ile Met Ile Gln Ala Val Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu109010951100tat tta cat gaa aaa gta cgt aag gaa att tgg ggt tac tct cct aat3360Tyr Leu His Glu Lys Val Arg Lys Glu Ile Trp Gly Tyr Ser Pro Asn1105 111011151120gag acg ctt tca aat gat gat tta atc cgt gaa aaa tac caa ggc att3408Glu Thr Leu Ser Asn Asp Asp Leu Ile Arg Glu Lys Tyr Gln Gly Ile112511301135cgt cct gct cct ggt tac cca gct tgt cct gaa cat aca gaa aaa ggg3456Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Glu His Thr Glu Lys Gly114011451150gct tta tgg gag tta atg aat gtt gaa gaa tct att gga atg tct tta3504Ala Leu Trp Glu Leu Met Asn Val Glu Glu Ser Ile Gly Met Ser Leu115511601165aca tca agc tat gca atg tgg ccc ggt gca tct gtg tca gga atg tat3552Thr Ser Ser Tyr Ala Met Trp Pro Gly Ala Ser Val Ser Gly Met Tyr117011751180ttt tca cac cca gat tct cgt tat ttt gcg att gct cag att cag caa3600Phe Ser His Pro Asp Ser Arg Tyr Phe Ala Ile Ala Gln Ile Gln Gln1185119011951200gat caa gcc gaa agc tat gcc gat cgt aaa ggt tgg aat atg ctt gaa3648Asp Gln Ala Glu Ser Tyr Ala Asp Arg Lys Gly Trp Asn Met Leu Glu120512101215gct gag aag tgg tta ggt cca aat ttg aat taa3681Ala Glu Lys Trp Leu Gly Pro Asn Leu Asn1220122521046
2111226212PRT213費氏弧菌40046Val Ala Gly Ser Asn Ile Lys Val Gln Ile Glu Lys Gln Leu Ser Glu1 5 10 15Arg Ile Leu Leu Ile Asp Gly Gly Met Gly Thr Met Ile Gln Gly Tyr20 25 30Lys Phe Glu Glu Lys Asp Tyr Arg Gly Gly Arg Phe Asn Gln Trp His35 40 45Cys Asp Leu Lys Gly Asn Asn Asp Leu Leu Val Leu Ser Gln Pro Gln50 55 60Ile Ile Arg Asp Ile His Glu Ala Tyr Leu Glu Ala Gly Ala Asp Ile65 70 75 80Leu Glu Thr Asn Thr Phe Asn Ala Thr Thr Ile Ala Met Ala Asp Tyr85 90 95Asp Met Glu Ser Leu Ser Glu Glu Ile Asn Phe Glu Ala Ala Lys Leu100 105 110Ala Arg Glu Val Ala Asp Lys Trp Thr Glu Lys Thr Pro Asn Lys Pro115 120 125Arg Tyr Val Ala Gly Val Leu Gly Pro Thr Asn Arg Thr Cys Ser Ile130 135 140Ser Pro Asp Val Asn Asp Pro Gly Phe Arg Asn Val Ser Phe Asp Glu145 150 155 160Leu Val Glu Ala Tyr Ser Glu Ser Thr Arg Ala Leu Ile Arg Gly Gly165 170 175Ser Asp Leu Ile Leu Ile Glu Thr Ile Phe Asp Thr Leu Asn Ala Lys180 185 190Ala Cys Ser Phe Ala Val Glu Ser Val Phe Glu Glu Leu Gly Ile Thr195 200 205Leu Pro Val Met Ile Ser Gly Thr Ile Thr Asp Ala Ser Gly Arg Thr210 215 220Leu Ser Gly Gln Thr Thr Glu Ala Phe Tyr Asn Ala Leu Arg His Val225 230 235 240Lys Pro Ile Ser Phe Gly Leu Asn Cys Ala Leu Gly Pro Asp Glu Leu245 250 255Arg Glu Tyr Val Ser Glu Leu Ser Arg Ile Ser Glu Cys Tyr Val Ser260 265 270Ala His Pro Asn Ala Gly Leu Pro Asn Ala Phe Gly Glu Tyr Asp Leu275 280 285Ser Pro Glu Asp Met Ala Glu His Val Ala Glu Trp Ala Ser Ser Gly290 295 300
Phe Leu Asn Leu Ile Gly Gly Cys Cys Gly Thr Thr Pro Glu His Ile305 310 315 320Arg Gln Met Ala Leu Val Val Glu Gly Val Lys Pro Arg Gln Leu Pro325 330 335Glu Leu Pro Val Ala Cys Arg Leu Ser Gly Leu Glu Pro Leu Thr Ile340 345 350Glu Lys Asp Ser Leu Phe Ile Asn Val Gly Glu Arg Thr Asn Val Thr355 360 365Gly Ser Ala Arg Phe Lys Arg Leu Ile Lys Glu Glu Leu Tyr Asp Glu370 375 380Ala Leu Ser Val Ala Gln Glu Gln Val Glu Asn Gly Ala Gln Ile Ile385 390 395 400Asp Ile Asn Met Asp Glu Gly Met Leu Asp Ala Glu Ala Cys Met Val405 410 415Arg Phe Leu Asn Leu Cys Ala Ser Glu Pro Glu Ile Ser Lys Val Pro420 425 430Val Met Val Asp Ser Ser Lys Trp Glu Val Ile Glu Ala Gly Leu Lys435 440 445Cys Ile Gln Gly Lys Gly Ile Val Asn Ser Ile Ser Leu Lys Glu Gly450 455 460Lys Glu Lys Phe Val His Gln Ala Lys Leu Ile Arg Arg Tyr Gly Ala465 470 475 480Ala Val Ile Val Met Ala Phe Asp Glu Val Gly Gln Ala Asp Thr Arg485 490 495Glu Arg Lys Ile Glu Ile Cys Thr Asn Ala Tyr Asn Ile Leu Val Asp500 505 510Glu Val Gly Phe Pro Pro Glu Asp Ile Ile Phe Asp Pro Asn Ile Phe515 520 525Ala Val Ala Thr Gly Ile Asp Glu His Asn Asn Tyr Ala Val Asp Phe530 535 540Ile Glu Ala Val Gly Asp Ile Lys Arg Thr Leu Pro His Ala Met Ile545 550 555 560Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asn Tyr565 570 575Val Arg Glu Ala Ile His Ala Val Phe Leu Tyr His Cys Phe Lys Asn580 585 590Gly Met Asp Met Gly Ile Val Asn Ala Gly Gln Leu Glu Ile Tyr Asp595 600 605Asn Val Pro Glu Asp Leu Arg Glu Ala Val Glu Asp Val Val Leu Asn610 615 620Arg Arg Asp Asp Ser Thr Glu Arg Leu Leu Asp Ile Ala Thr Glu Tyr625 630 635 640
Leu Glu Arg Ala Val Gly Lys Val Glu Asp Lys Ser Ala Leu Glu Trp645 650 655Arg Asp Trp Pro Val Glu Lys Arg Leu Glu His Ser Leu Val Lys Gly660 665 670Ile Thr Glu Phe Ile Val Glu Asp Thr Glu Glu Ala Arg Ile Asn Ala675 680 685Glu Arg Pro Ile Glu Val Ile Glu Gly Pro Leu Met Asp Gly Met Asn690 695700Val Val Gly Asp Leu Phe Gly Glu Gly Lys Met Phe Leu Pro Gln Val705 710 715 720Val Lys Ser Ala Arg Val Met Lys Gln Ala Val Ala His Leu Glu Pro725 730 735Phe Ile Asn Ala Ser Lys Glu Val Gly Ala Thr Asn Gly Lys Ile Leu740 745 750Leu Ala Thr Val Lys Gly Asp Val His Asp Ile Gly Lys Asn Ile Val755 760 765Gly Val Val Leu Gln Cys Asn Asn Tyr Glu Ile Ile Asp Leu Gly Val770 775 780Met Val Ser Cys Glu Thr Ile Leu Lys Val Ala Lys Glu Glu Asn Val785 790 795 800Asp Ile Ile Gly Leu Ser Gly Leu Ile Thr Pro Ser Leu Asp Glu Met805 810 815Val His Val Ala Lys Glu Met Glu Arg Gln Gly Phe Asp Leu Pro Leu820 825 830Leu Ile Gly Gly Ala Thr Thr Ser Lys Ala His Thr Ala Val Lys Ile835 840 845Glu Gln Asn Tyr Ser Gln Pro Val Val Tyr Val Asn Asn Ala Ser Arg850 855 860Ala Val Gly Val Cys Thr Ser Leu Leu Ser Asn Glu Leu Lys Pro Ser865 870 875 880Phe Val Glu Lys Leu Asp Ile Asp Tyr Glu Arg Val Arg Glu Gln His885 890 895Ser Arg Lys Gln Pro Arg Thr Lys Pro Val Thr Leu Glu Val Ala Arg900 905 910Ala Asn Lys Val Ala Ile Asp Trp Ala Ser Tyr Thr Pro Pro Val Pro915 920 925Leu Lys Pro Gly Val His Ile Phe Asp Asn Phe Asp Val Ser Thr Leu930 935 940Arg Asn Tyr Ile Asp Trp Thr Pro Phe Phe Met Thr Trp Ser Leu Val945 950 955 960Gly Lys Tyr Pro Lys Ile Leu Glu His Glu Glu Val Gly Glu Glu Ala
965 970 975Lys Arg Leu Phe Lys Asp Ala Asn Asp Leu Leu Asp Arg Val Glu Lys980 985 990Glu Gly Leu Leu Lys Ala Arg Gly Met Cys Ala Leu Phe Pro Ala Ser99510001005Ser Val Gly Asp Asp Ile Glu Val Tyr Thr Asp Glu Ser Arg Thr Thr101010151020Val Ala Lys Val Leu His Asn Leu Arg Gln Gln Thr Glu Lys Pro Lys1025 103010351040Gly Phe Asn Tyr Cys Leu Ser Asp Tyr Ile Ala Pro Lys Glu Ser Gly104510501055Lys Asn Asp Trp Ile Gly Gly Phe Ala Val Thr Gly Gly Ile Gly Glu106010651070Arg Glu Leu Ala Asp Glu Tyr Lys Ala Asn Gly Asp Asp Tyr Asn Ala107510801085Ile Met Ile Gln Ala Val Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu109010951100Tyr Leu His Glu Lys Val Arg Lys Glu Ile Trp Gly Tyr Ser Pro Asn1105 111011151120Glu Thr Leu Ser Asn Asp Asp Leu Ile Arg Glu Lys Tyr Gln Gly Ile112511301135Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Glu His Thr Glu Lys Gly114011451150Ala Leu Trp Glu Leu Met Asn Val Glu Glu Ser Ile Gly Met Ser Leu115511601165Thr Ser Ser Tyr Ala Met Trp Pro Gly Ala Ser Val Ser Gly Met Tyr117011751180Phe Ser His Pro Asp Ser Arg Tyr Phe Ala Ile Ala Gln Ile Gln Gln1185 119011951200Asp Gln Ala Glu Ser Tyr Ala Asp Arg Lys Gly Trp Asn Met Leu Glu120512101215Ala Glu Lys Trp Leu Gly Pro Asn Leu Asn12201225210472113780212DNA213根癌農桿菌(Agrobacterium tumefaciens)220
221CDS222(1)..(3777)2231588735940047
gtg ccc gtg ttt gac gac ctg ttt ggc cct gaa ggg gca aag cgc gac48Val Pro Val Phe Asp Asp Leu Phe Gly Pro Glu Gly Ala Lys Arg Asp1 5 10 15ggc gcg gaa att ttc aag gcg ttg cgc gat gcc gcc agc gaa cgc arc96Gly Ala Glu Ile phe Lys Ala Leu Arg Asp Ala Ala Ser Glu Arg Ile20 25 30ctc att ctc gat ggt gcc atg ggc acg cag atc cag ggt ctc ggt ttt144Leu Ile Leu Asp Gly Ala Met Gly Thr Gln Ile Gln Gly Leu Gly Phe35 40 45gac gag gat cat ttt cgt ggc gac cgt ttt atc ggc tgc gcc tgt cac192Asp Glu Asp His Phe Arg Gly Asp Arg Phe Ile Gly Cys Ala Cys His50 55 60cag aag ggc aat aac gac ctt ctg atc ctg aca cag ccc gat gcc atc240Gln Lys Gly Asn Asn Asp Leu Leu Ile Leu Thr Gln Pro Asp Ala Ile65 70 75 80gag gaa atc cac tat cgc tac gcc atg gcg ggc gcg gat att ctc gaa288Glu Glu Ile His Tyr Arg Tyr Ala Met Ala Gly Ala Asp Ile Leu Glu85 90 95acc aac acg ttt tcc tcc acc cgc atc gcg cag gcc gat tac gag atg336Thr Asn Thr Phe Ser Ser Thr Arg Ile Ala Gln Ala Asp Tyr Glu Met100 105 110gag aat gcc gtc tac gat ctc aac cgc gag ggc gcg gcg atc gtg cgc384Glu Asn Ala Val Tyr Asp Leu Asn Arg Glu Gly Ala Ala Ile Val Arg115 120 125cgg gcg gct cag cgc gcc gag cgc gag gat ggc cgc cgc cgt ttc gtg432Arg Ala Ala Gln Arg Ala Glu Arg Glu Asp Gly Arg Arg Arg Phe Val130 135 140gcc ggt gcc atc ggt ccg acc aac cgc acg gcc tcg atc tcg cct gac480Ala Gly Ala Ile Gly Pro Thr Asn Arg Thr Ala Ser Ile Ser Pro Asp145 150 155 160gtc aac aat ccc ggt tac cgc gcc gtc agt ttc gac gat ctg cgc att528Val Asn Asn Pro Gly Tyr Arg Ala Val Ser Phe Asp Asp Leu Arg Ile165 170 175gcc tat ggc gag cag atc gat ggc ctg atc gac ggt ggt gcc gat atc576Ala Tyr Gly Glu Gln Ile Asp Gly Leu Ile Asp Gly Gly Ala Asp Ile180 185 190atc ctc atc gag acg atc ttc gat acg ctg aac gcc aag gcg gcg atc624Ile Leu Ile Glu Thr Ile Phe Asp Thr Leu Asn Ala Lys Ala Ala Ile195 200 205ttc gcc tgc gag gaa cgt ttc gag gct aag ggc atc cgc ctg ccg gtc672Phe Ala Cys Glu Glu Arg Phe Glu Ala Lys Gly Ile Arg Leu Pro Val210 215 220atg atc tca ggc acg atc acc gac ctt tcc ggt cgc acg ttg tcc ggc720Met Ile Ser Gly Thr Ile Thr Asp Leu Ser Gly Arg Thr Leu Ser Gly225 230 235 240cag acg cct tcg gcg ttc tgg aac tcg gtg cgc cac gcc aac ccc ttc768Gln Thr Pro Ser Ala Phe Trp Asn Ser Val Arg His Ala Asn Pro Phe
245 250 255acc atc ggc ctc aac tgc gcg ctc ggt gcg gat gcc atg cgc ccg cat816Thr Ile Gly Leu Asn Cys Ala Leu Gly Ala Asp Ala Met Arg Pro His260 265 270ctg cag gaa ctg tcc gat gtg gcc gac acc ttt gtc tgc gcc tat ccg864Leu Gln Glu Leu Ser Asp Val Ala Asp Thr Phe Val Cys Ala Tyr Pro275 280 285aat gcc ggc ctg ccg aac gag ttc ggc caa tat gac gaa acg ccc gag912Asn Ala Gly Leu Pro Asn Glu Phe Gly Gln Tyr Asp Glu Thr Pro Glu290 295 300atg atg gcg cgc cag gtt gag ggc ttc gtt cgt gac ggt ctc gtc aac960Met Met Ala Arg Gln Val Glu Gly Phe Val Arg Asp Gly Leu Val Asn305 310 315 320atc gtc ggc ggt tgc tgc ggt tcg acg ccg gaa cat atc cgg gcg att1008Ile Val Gly Gly Cys Cys Gly Ser Thr Pro Glu His Ile Arg Ala Ile325 330 335gcc gaa gcc gtc aag gat tac aag ccc cgc gaa att cct gaa cac aag1056Ala Glu Ala Val Lys Asp Tyr Lys Pro Arg Glu Ile Pro Glu His Lys340 345 350ccg ttc atg tcg ctt tcc ggc ctt gaa ccc ttc gtg ctg acc aag gac1104Pro Phe Met Ser Leu Ser Gly Leu Glu Pro Phe Val Leu Thr Lys Asp355 360 365att ccc ttc gtc aac gtg ggc gag cgc acc aac gtc acc ggt tcg gcc1152Ile Pro Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Ala370 375 380cgc ttc cgc aag ctc atc act gcc ggc gac tat acg gcg gcg ctg gct1200Arg Phe Arg Lys Leu Ile Thr Ala Gly Asp Tyr Thr Ala Ala Leu Ala385 390 395 400gtt gcc cgc gac cag gtg gaa aac ggc gcg cag atc atc gac atc aac1248Val Ala Arg Asp Gln Val Glu Asn Gly Ala Gln Ile Ile Asp Ile Asn405 410 415atg gat gag ggc ctg atc gat tcg gaa aag gcg atg gtc gag ttc ctg1296Met Asp Glu Gly Leu Ile Asp Ser Glu Lys Ala Met Val Glu Phe Leu420 425 430aac ctc atc gcc gcc gag cct gac att gcc cgt gtg ccc gtc atg atc1344Asn Leu Ile Ala Ala Glu Pro Asp Ile Ala Arg Val Pro Val Met Ile435 440 445gac tca tcc aag ttc gag atc atc gag gcc ggc ctg aaa tgc gtg cag1392Asp Ser Ser Lys Phe Glu Ile Ile Glu Ala Gly Leu Lys Cys Val Gln450 455 460ggc aaa tcg atc gtc aat tcc att tcg ctg aag gaa ggc gag gag aag1440Gly Lys Ser Ile Val Asn Ser Ile Ser Leu Lys Glu Gly Glu Glu Lys465 470 475 480ttt ctc cag cag gct cgg ctc gtc cac aat tac ggt gcg gcg gtt gtc1488Phe Leu Gln Gln Ala Arg Leu Val His Asn Tyr Gly Ala Ala Val Val485 490 495
gtc atg gcc ttt gat gag gtc ggg cag gcg gat acc tat cag cgc aag1536Val Met Ala Phe Asp Glu Val Gly Gln Ala Asp Thr Tyr Gln Arg Lys500 505 510gtg gaa atc tgc gcg cgc gcc tac aag ctt ctg acc gaa aag gcc ggt1584Val Glu Ile Cys Ala Arg Ala Tyr Lys Leu Leu Thr Glu Lys Ala Gly515 520 525ctg tct ccg gaa gac atc atc ttc gac ccg aat gtg ttt gcg gta gct1632Leu Ser Pro Glu Asp Ile Ile Phe Asp Pro Asn Val Phe Ala Val Ala530 535 540acg ggc atc gag gag cac aat aat tac ggc gtg gac ttc atc gag gcc1680Thr Gly Ile Glu Glu His Asn Asn Tyr Gly Val Asp Phe Ile Glu Ala545 550 555 560acc aag acc atc cgc gaa acc atg ccg ctc acg cat att tcc ggg ggc1728Thr Lys Thr Ile Arg Glu Thr Met Pro Leu Thr His Ile Ser Gly Gly565 570 575gtt tcc aac ctg tcc ttc tcc ttc cgc ggc aat gag ccg gtg cgt gag1776Val Ser Asn Leu Ser Phe Ser Phe Arg Gly Asn Glu Pro Val Arg Glu580 585 590gcg atg cat gcc gtg ttc ctc tat cac gcc att cag gtc ggc atg gat1824Ala Met His Ala Val Phe Leu Tyr His Ala Ile Gln Val Gly Met Asp595 600 605atg ggc atc gtc aac gcc ggg cag ctt gcg gtt tac gac aat atc gat1872Met Gly Ile Val Asn Ala Gly Gln Leu Ala Val Tyr Asp Asn Ile Asp610 615 620gcg gaa ctg cgc gag gcc tgc gaa gac gtg gtg ctg aac cgc cgc gac1920Ala Glu Leu Arg Glu Ala Cys Glu Asp Val Val Leu Asn Arg Arg Asp625 630 635 640gat gcc acg gag cgt ctg ctc gag gtg gcg gag cgt ttc cgt ggt acg1968Asp Ala Thr Glu Arg Leu Leu Glu Val Ala Glu Arg Phe Arg Gly Thr645 650 655ggt gaa aaa cag gcc aag gtg cag gat ctt tcc tgg cgc gag tat ccc2016Gly Glu Lys Gln Ala Lys Val Gln Asp Leu Ser Trp Arg Glu Tyr Pro660 665 670gtt gaa aag cgg ctg gaa cat gct ctg gtc aac ggc att acc gac tat2064Val Glu Lys Arg Leu Glu His Ala Leu Val Asn Gly Ile Thr Asp Tyr675 680 685atc gag gcc gat acg gaa gag gca cgc cag cag gcc gcc cgc ccg ctg2112Ile Glu Ala Asp Thr Glu Glu Ala Arg Gln Gln Ala Ala Arg Pro Leu690 695 700cat gtc atc gaa ggg ccg ctg atg gcc ggt atg aat gtg gtg ggt gac2160His Val Ile Glu Gly Pro Leu Met Ala Gly Met Asn Val Val Gly Asp705 710 715 720ctg ttc ggt tcc ggc aag atg ttc ctg cca cag gtg gtg aaa tcc gcc2208Leu Phe Gly Ser Gly Lys Met Phe Leu Pro Gln Val Val Lys Ser Ala725 730 735cgt gtg atg aag cag gcg gtt gcc gtt ctg ctg cct tac atg gaa gag2256Arg Val Met Lys Gln Ala Val Ala Val Leu Leu Pro Tyr Met Glu Glu
740 745 750gaa aag cgc ctg aat ggc ggt tcc gag cgc agt gcc gcc ggc aag gtg2304Glu Lys Arg Leu Asn Gly Gly Ser Glu Arg Ser Ala Ala Gly Lys Val755 760 765cta atg gcg acc gtg aag ggc gac gtg cac gat atc ggc aag aac atc2352Leu Met Ala Thr Val Lys Gly Asp Val His Asp Ile Gly Lys Asn Ile770 775 780gtc ggc gtt gtg cta gcc tgc aac aat tac gag atc att gat ctc ggc2400Val Gly Val Val Leu Ala Cys Asn Asn Tyr Glu Ile Ile Asp Leu Gly785 790 795 800gtg atg gtg ccg acg acg aaa atc ctc gaa acg gcg atc gcc gaa aag 2448Val Met Val Pro Thr Thr Lys Ile Leu Glu Thr Ala Ile Ala Glu Lys805 810 815gtg gat gtg atc ggc ctc tcc ggc ctc atc acc ccg tcg ctg gat gag2496Val Asp Val Ile Gly Leu Ser Gly Leu Ile Thr Pro Ser Leu Asp Glu820 825 830atg gtg cat gtg gcg gcc gaa atg gag cga cag ggt ttc gac att ccg2544Met Val His Val Ala Ala Glu Met Glu Arg Gln Gly Phe Asp Ile Pro835 840 845ctg ctg atc ggc ggt gcg acg acc agc cgt gtg cat acg gcg gta aaa2592Leu Leu Ile Gly Gly Ala Thr Thr Ser Arg Val His Thr Ala Val Lys850 855 860atc cat ccg cgt tac gag cag ggg cag gcg atc tat gtc acc gac gcc2640Ile His Pro Arg Tyr Glu Gln Gly Gln Ala Ile Tyr Val Thr Asp Ala865 870 875 880tcg cgc gcg gtg ggc gtc gtt tca gcg ctc ctc tcc gaa gag cag aag2688Ser Arg Ala Val Gly Val Val Ser Ala Leu Leu Ser Glu Glu Gln Lys885 890 895ccc gct tat atc gac ggc atc cga gcc gaa tat gcc aag gtg gcg gaa2736Pro Ala Tyr Ile Asp Gly Ile Arg Ala Glu Tyr Ala Lys Val Ala Glu900 905 910gcc cat gcc cgc aat gag cgc gaa aag cag cgc ctg ccg ctt tcc cgc2784Ala His Ala Arg Asn Glu Arg Glu Lys Gln Arg Leu Pro Leu Ser Arg915 920 925gcc cgg gag aat gcg cac aag atc gac tgg tcg agc tac agc gtt gtc2832Ala Arg Glu Asn Ala His Lys Ile Asp Trp Ser Ser Tyr Ser Val Val930 935 940aag ccg cag ttc ttc ggc acc aag gtt ttt gag acc tat gat ctg gaa2880Lys Pro Gln Phe Phe Gly Thr Lys Val Phe Glu Thr Tyr Asp Leu Glu945 950 955 960gag ctt tcc cgt tac atc gac tgg acg ccg ttc ttc cag acc tgg gaa2928Glu Leu Ser Arg Tyr Ile Asp Trp Thr Pro Phe Phe Gln Thr Trp Glu965 970 975ttg aag ggc cgt ttc ccg gcg atc ctt gaa gac gaa aag cag ggc gag2976Leu Lys Gly Arg Phe Pro Ala Ile Leu Glu Asp Glu Lys Gln Gly Glu980 985 990
gcg gcg cgg cag ctt tat gcc gat gcg cag gcc atg ctt gcg aag atc3024Ala Ala Arg Gln Leu Tyr Ala Asp Ala Gln Ala Met Leu Ala Lys Ile99510001005atc gag gaa aag tgg ttc cga cca cgc gcg gtg atc ggc ttc tgg ccg3072Ile Glu Glu Lys Trp Phe Arg Pro Arg Ala Val Ile Gly Phe Trp Pro101010151020gcc aat gcc gtg ggt gac gat atc agg ctc ttt acg gat gaa ggt cgg3120Ala Asn Ala Val Gly Asp Asp Ile Arg Leu Phe Thr Asp Glu Gly Arg1025103010351040aag gaa gag ttg gcg acg ttc ttc acg ctg cgc cag cag ctt tcc aag3168Lys Glu Glu Leu Ala Thr Phe Phe Thr Leu Arg Gln Gln Leu Ser Lys104510501055cgc gat ggc cgt ccg aac gtg gcg ctg tcc gat ttc gtc gcg ccc gtc3216Arg Asp Gly Arg Pro Asn Val Ala Leu Ser Asp Phe Val Ala Pro Val106010651070gat agc ggc gtt gcc gat tat gtc ggc ggt ttc gtg gta acg gcg ggt3264Asp Ser Gly Val Ala Asp Tyr Val Gly Gly Phe Val Val Thr Ala Gly107510801085atc gag gaa gtg gcg att gcc gag cgc ttc gag cgg gcc aat gac gat3312Ile Glu Glu Val Ala Ile Ala Glu Arg Phe Glu Arg Ala Asn Asp Asp109010951100tat tcg tcc atc ctc gtc aag gcg ttg gct gac cgt ttt gcc gaa gcc3360Tyr Ser Ser Ile Leu Val Lys Ala Leu Ala Asp Arg Phe Ala Glu Ala1105111011151120ttt gcc gag cgt atg cat gag cgc gtg cgc aag gag ttc tgg ggt tat3408Phe Ala Glu Arg Met His Glu Arg Val Arg Lys Glu Phe Trp Gly Tyr112511301135gcg ccg gac gag gct ctt gcc ggt gac gat ctg ata ggc gaa gcc tat3456Ala Pro Asp Glu Ala Leu Ala Gly Asp Asp Leu Ile Gly Glu Ala Tyr114011451150gcc ggt atc cgc ccg gca ccg ggt tat ccg gcc cag ccg gac cac acc3504Ala Gly Ile Arg Pro Ala Pro Gly Tyr Pro Ala Gln Pro Asp His Thr115511601165gaa aag aag acg ctg ttt gct ctg ctg gac gcc acc aat gcg gcg ggt3552Glu Lys Lys Thr Leu Phe Ala Leu Leu Asp Ala Thr Asn Ala Ala Gly117011751180gtg gaa ttg acg gaa agc tat gcg atg tgg ccc ggc tcg tcg gtt tcg3600Val Glu Leu Thr Glu Ser Tyr Ala Met Trp Pro Gly Ser Ser Val Ser1185119011951200ggc ctc tat atc ggc cat ccc gaa agc tat tat ttc ggc gtt gcc aag3648Gly Leu Tyr Ile Gly His Pro Glu Ser Tyr Tyr Phe Gly Val Ala Lys120512101215gtg gag cgg gat cag gtt ctc gac tat gcg cgc cgc aag gat atg ccg3696Val Glu Arg Asp Gln Val Leu Asp Tyr Ala Arg Arg Lys Asp Met Pro122012251230gtc aca gag gtg gag cgc tgg ctc ggg ccg gtg ctc aac tac gtg ccg3744Val Thr Glu Val Glu Arg Trp Leu Gly Pro Val Leu Asn Tyr Val Pro
1235 12401245acc aac ggc gag gag aaa atc gac agc gct gcg tga 3780Thr Asn Gly Glu Glu Lys Ile Asp Ser Ala Ala12501255210482111259212PRT213根癌農桿菌40048Val Pro Val Phe Asp Asp Leu Phe Gly Pro Glu Gly Ala Lys Arg Asp1 5 10 15Gly Ala Glu Ile Phe Lys Ala Leu Arg Asp Ala Ala Ser Glu Arg Ile20 25 30Leu Ile Leu Asp Gly Ala Met Gly Thr Gln Ile Gln Gly Leu Gly Phe35 40 45Asp Glu Asp His Phe Arg Gly Asp Arg Phe Ile Gly Cys Ala Cys His50 55 60Gln Lys Gly Asn Asn Asp Leu Leu Ile Leu Thr Gln Pro Asp Ala Ile65 70 75 80Glu Glu Ile His Tyr Arg Tyr Ala Met Ala Gly Ala Asp Ile Leu Glu85 90 95Thr Asn Thr Phe Ser Ser Thr Arg Ile Ala Gln Ala Asp Tyr Glu Met100 105 110Glu Asn Ala Val Tyr Asp Leu Asn Arg Glu Gly Ala Ala Ile Val Arg115 120 125Arg Ala Ala Gln Arg Ala Glu Arg Glu Asp Gly Arg Arg Arg Phe Val130 135 140Ala Gly Ala Ile Gly Pro Thr Asn Arg Thr Ala Ser Ile Ser Pro Asp145 150 155 160Val Asn Asn Pro Gly Tyr Arg Ala Val Ser Phe Asp Asp Leu Arg Ile165 170 175Ala Tyr Gly Glu Gln Ile Asp Gly Leu Ile Asp Gly Gly Ala Asp Ile180 185 190Ile Leu Ile Glu Thr Ile Phe Asp Thr Leu Asn Ala Lys Ala Ala Ile195 200 205Phe Ala Cys Glu Glu Arg Phe Glu Ala Lys Gly Ile Arg Leu Pro Val210 215 220Met Ile Ser Gly Thr Ile Thr Asp Leu Ser Gly Arg Thr Leu Ser Gly225 230 235 240Gln Thr Pro Ser Ala Phe Trp Asn Ser Val Arg His Ala Asn Pro Phe245 250 255Thr Ile Gly Leu Asn Cys Ala Leu Gly Ala Asp Ala Met Arg Pro His
260 265 270Leu Gln Glu Leu Ser Asp Val Ala Asp Thr Phe Val Cys Ala Tyr Pro275 280 285Asn Ala Gly Leu Pro Asn Glu Phe Gly Gln Tyr Asp Glu Thr Pro Glu290 295 300Met Met Ala Arg Gln Val Glu Gly Phe Val Arg Asp Gly Leu Val Asn305 310 315 320Ile Val Gly Gly Cys Cys Gly Ser Thr Pro Glu His Ile Arg Ala Ile325 330 335Ala Glu Ala Val Lys Asp Tyr Lys Pro Arg Glu Ile Pro Glu His Lys340 345 350Pro Phe Met Ser Leu Ser Gly Leu Glu Pro Phe Val Leu Thr Lys Asp355 360 365Ile Pro Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser Ala370 375 380Arg Phe Arg Lys Leu Ile Thr Ala Gly Asp Tyr Thr Ala Ala Leu Ala385 390 395 400Val Ala Arg Asp Gln Val Glu Asn Gly Ala Gln Ile Ile Asp Ile Asn405 410 415Met Asp Glu Gly Leu Ile Asp Ser Glu Lys Ala Met Val Glu Phe Leu420 425 430Asn Leu Ile Ala Ala Glu Pro Asp Ile Ala Arg Val Pro Val Met Ile435 440 445Asp Ser Ser Lys Phe Glu Ile Ile Glu Ala Gly Leu Lys Cys Val Gln450 455 460Gly Lys Ser Ile Val Asn Ser Ile Ser Leu Lys Glu Gly Glu Glu Lys465 470 475 480Phe Leu Gln Gln Ala Arg Leu Val His Asn Tyr Gly Ala Ala Val Val485 490 495Val Met Ala Phe Asp Glu Val Gly Gln Ala Asp Thr Tyr Gln Arg Lys500 505 510Val Glu Ile Cys Ala Arg Ala Tyr Lys Leu Leu Thr Glu Lys Ala Gly515 520 525Leu Ser Pro Glu Asp Ile Ile Phe Asp Pro Asn Val Phe Ala Val Ala530 535 540Thr Gly Ile Glu Glu His Asn Asn Tyr Gly Val Asp Phe Ile Glu Ala545 550 555 560Thr Lys Thr Ile Arg Glu Thr Met Pro Leu Thr His Ile Ser Gly Gly565 570 575Val Ser Asn Leu Ser Phe Ser Phe Arg Gly Asn Glu Pro Val Arg Glu580 585 590
Ala Met His Ala Val Phe Leu Tyr His Ala Ile Gln Val Gly Met Asp595 600 605Met Gly Ile Val Asn Ala Gly Gln Leu Ala Val Tyr Asp Asn Ile Asp610 615 620Ala Glu Leu Arg Glu Ala Cys Glu Asp Val Val Leu Asn Arg Arg Asp625 630 635 640Asp Ala Thr Glu Arg Leu Leu Glu Val Ala Glu Arg Phe Arg Gly Thr645 650 655Gly Glu Lys Gln Ala Lys Val Gln Asp Leu Ser Trp Arg Glu Tyr Pro660 665 670Val Glu Lys Arg Leu Glu His Ala Leu Val Asn Gly Ile Thr Asp Tyr675 680 685Ile Glu Ala Asp Thr Glu Glu Ala Arg Gln Gln Ala Ala Arg Pro Leu690 695 700His Val Ile Glu Gly Pro Leu Met Ala Gly Met Asn Val Val Gly Asp705 710 715 720Leu Phe Gly Ser Gly Lys Met Phe Leu Pro Gln Val Val Lys Ser Ala725 730 735Arg Val Met Lys Gln Ala Val Ala Val Leu Leu Pro Tyr Met Glu Glu740 745 750Glu Lys Arg Leu Asn Gly Gly Ser Glu Arg Ser Ala Ala Gly Lys Val755 760 765Leu Met Ala Thr Val Lys Gly Asp Val His Asp Ile Gly Lys Asn Ile770 775 780Val Gly Val Val Leu Ala Cys Asn Asn Tyr Glu Ile Ile Asp Leu Gly785 790 795 800Val Met Val Pro Thr Thr Lys Ile Leu Glu Thr Ala Ile Ala Glu Lys805 810 815Val Asp Val Ile Gly Leu Ser Gly Leu Ile Thr Pro Ser Leu Asp Glu820 825 830Met Val His Val Ala Ala Glu Met Glu Arg Gln Gly Phe Asp Ile Pro835 840 845Leu Leu Ile Gly Gly Ala Thr Thr Ser Arg Val His Thr Ala Val Lys850 855 860Ile His Pro Arg Tyr Glu Gln Gly Gln Ala Ile Tyr Val Thr Asp Ala865 870 875 880Ser Arg Ala Val Gly Val Val Ser Ala Leu Leu Ser Glu Glu Gln Lys885 890 895Pro Ala Tyr Ile Asp Gly Ile Arg Ala Glu Tyr Ala Lys Val Ala Glu900 905 910Ala His Ala Arg Asn Glu Arg Glu Lys Gln Arg Leu Pro Leu Ser Arg915 920 925
Ala Arg Glu Asn Ala His Lys Ile Asp Trp Ser Ser Tyr Ser Val Val930 935 940Lys Pro Gln Phe Phe Gly Thr Lys Val Phe Glu Thr Tyr Asp Leu Glu945 950 955 960Glu Leu Ser Arg Tyr Ile Asp Trp Thr Pro Phe Phe Gln Thr Trp Glu965 970 975Leu Lys Gly Arg Phe Pro Ala Ile Leu Glu Asp Glu Lys Gln Gly Glu980 985 990Ala Ala Arg Gln Leu Tyr Ala Asp Ala Gln Ala Met Leu Ala Lys Ile99510001005Ile Glu Glu Lys Trp Phe Arg Pro Arg Ala Val Ile Gly Phe Trp Pro101010151020Ala Asn Ala Val Gly Asp Asp Ile Arg Leu Phe Thr Asp Glu Gly Arg1025 103010351040Lys Glu Glu Leu Ala Thr Phe Phe Thr Leu Arg Gln Gln Leu Ser Lys104510501055Arg Asp Gly Arg Pro Asn Val Ala Leu Ser Asp Phe Val Ala Pro Val106010651070Asp Ser Gly Val Ala Asp Tyr Val Gly Gly Phe Val Val Thr Ala Gly107510801085Ile Glu Glu Val Ala Ile Ala Glu Arg Phe Glu Arg Ala Asn Asp Asp109010951100Tyr Ser Ser Ile Leu Val Lys Ala Leu Ala Asp Arg Phe Ala Glu Ala1105 111011151120Phe Ala Glu Arg Met His Glu Arg Val Arg Lys Glu Phe Trp Gly Tyr112511301135Ala Pro Asp Glu Ala Leu Ala Gly Asp Asp Leu Ile Gly Glu Ala Tyr114011451150Ala Gly Ile Arg Pro Ala Pro Gly Tyr Pro Ala Gln Pro Asp His Thr115511601165Glu Lys Lys Thr Leu Phe Ala Leu Leu Asp Ala Thr Asn Ala Ala Gly117011751180Val Glu Leu Thr Glu Ser Tyr Ala Met Trp Pro Gly Ser Ser Val Ser1185 119011951200Gly Leu Tyr Ile Gly His Pro Glu Ser Tyr Tyr Phe Gly Val Ala Lys120512101215Val Glu Arg Asp Gln Val Leu Asp Tyr Ala Arg Arg Lys Asp Met Pro122012251230Val Thr Glu Val Glu Arg Trp Leu Gly Pro Val Leu Asn Tyr Val Pro123512401245Thr Asn Gly Glu Glu Lys Ile Asp Ser Ala Ala
1250 1255210492112718212DNA213Ralstonia solanacearum220
221CDS222(1)..(2715)223RSOL_GMI100040049atg acc gac cac ctc atg cgc ctc tcc ggc ctc gaa ccg ttc aac atc48Met Thr Asp His Leu Met Arg Leu Ser Gly Leu Glu Pro Phe Asn Ile1 5 10 15ggc gag gac acg ctg ttc gtc aac gtc ggc gaa cgc acc aac gtc acc96Gly Glu Asp Thr Leu Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr20 25 30gga tcc aag gcg ttc gcg cgc atg atc ctc aac agc cag ttc gac gag144Gly Ser Lys Ala Phe Ala Arg Met Ile Leu Asn Ser Gln Phe Asp Glu35 40 45gcg ctc gcc gtg gca cgc cag cag gtc gag aac ggc gcg cag gtc atc192Ala Leu Ala Val Ala Arg Gln Gln Val Glu Asn Gly Ala Gln Val Ile50 55 60gac atc aac atg gac gag gcc atg ctc gac tcc aag gcg gcg atg gtg240Asp Ile Asn Met Asp Glu Ala Met Leu Asp Ser Lys Ala Ala Met Val65 70 75 80cgc ttc ctg aac ctg atc gcc tcg gag ccg gac atc gcg cgc gtg ccg288Arg Phe Leu Asn Leu Ile Ala Ser Glu Pro Asp Ile Ala Arg Val Pro85 90 95atc atg atc gac tcg tcc aag tgg gag gtg atc gag gcc ggc ctg aag336Ile Met Ile Asp Ser Ser Lys Trp Glu Val Ile Glu Ala Gly Leu Lys100 105 110tgc gtg cag ggc aag gcc atc gtc aac tcg atc tcg ctc aag gaa ggc384Cys Val Gln Gly Lys Ala Ile Val Asn Ser Ile Ser Leu Lys Glu Gly115 120 125gag gaa cag ttc gcc cac cac gcc aag ctg atc aag cgc tac ggc gcc432Glu Glu Gln Phe Ala His His Ala Lys Leu Ile Lys Arg Tyr Gly Ala130 135 140gcc gcc gtg gtg atg gcc ttc gac gag cag ggc cag gcc gac acg ttc480Ala Ala Val Val Met Ala Phe Asp Glu Gln Gly Gln Ala Asp Thr Phe145 150 155 160gcg cgc aag acc gag atc tgc aag cgc agc tat gac ttc ctc gtg aac528Ala Arg Lys Thr Glu Ile Cys Lys Arg Ser Tyr Asp Phe Leu Val Asn165 170 175cag gtc ggc ttt gcg ccg gaa gac atc atc ttc gat ccg aac atc ttc576Gln Val Gly Phe Ala Pro Glu Asp Ile Ile Phe Asp Pro Asn Ile Phe180 185 190
gcg gtc gcc acc ggc atc gag gag cac aac aac tac gcc gtc gac ttc624Ala Val Ala Thr Gly Ile Glu Glu His Asn Asn Tyr Ala Val Asp Phe195 200 205atc gag gcc acg cgc tgg atc aag cag aaa ttg ccg cac gcc aag gtg672Ile Glu Ala Thr Arg Trp Ile Lys Gln Lys Leu Pro His Ala Lys Val210 215 220agc ggc ggc gtg tcg aac gtc tcg ttc tcg ttc cgc ggc aac gac gtg720Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asp Val225 230 235 240gtg cgc gag gcc atc cac acc gtg ttc ctg tac cac gcc atc ggt gcg768Val Arg Glu Ala Ile His Thr Val Phe Leu Tyr His Ala Ile Gly Ala245 250 255ggc atg gac atg ggc atc gtc aac gcg ggc cag ttg ggc gtg tac gag816Gly Met Asp Met Gly Ile Val Asn Ala Gly Gln Leu Gly Val Tyr Glu260 265 270aac ctc gcc ccc gaa ctg cgc gag cgc gtg gaa gac gtg gtg ctc aac864Asn Leu Ala Pro Glu Leu Arg Glu Arg Val Glu Asp Val Val Leu Asn275 280 285cgc cgc ccg gat gcg acc gac cgc ctg ctg gaa att gcc gac cgc tac912Arg Arg Pro Asp Ala Thr Asp Arg Leu Leu Glu Ile Ala Asp Arg Tyr290 295 300aag ggc ggc ggc gcc aag cgc gag gag aac ctc gcc tgg cgc cag gag960Lys Gly Gly Gly Ala Lys Arg Glu Glu Asn Leu Ala Trp Arg Gln Glu305 310 315 320ccg gtg gaa aag cgc ctg gcc cac gcg ctc gtg cac ggc atc acc gac1008Pro Val Glu Lys Arg Leu Ala His Ala Leu Val His Gly Ile Thr Asp325 330 335tac gtg gtc gaa gac acc gag gaa gtt cgc cag aag atc ttt gcc gcc1056Tyr Val Val Glu Asp Thr Glu Glu Val Arg Gln Lys Ile Phe Ala Ala340 345 350ggc ggc cgc ccg atc cag gtg atc gag ggc ccg ctg atg gac ggc atg1104Gly Gly Arg Pro Ile Gln Val Ile Glu Gly Pro Leu Met Asp Gly Met355 360 365aac atc gtc ggc gat ctg ttc ggc gcg ggc aag atg ttc ctg ccg cag1152Asn Ile Val Gly Asp Leu Phe Gly Ala Gly Lys Met Phe Leu Pro Gln370 375 380gtg gtg aaa tcc gcc cgc gtg atg aag cag gcg gtg gcc cac ctg atc1200Val Val Lys Ser Ala Arg Val Met Lys Gln Ala Val Ala His Leu Ile385 390 395 400ccg ttc atc gag gaa gag aag cgg cag atc gcg gcc gcc ggc ggc gac1248Pro Phe Ile Glu Glu Glu Lys Arg Gln Ile Ala Ala Ala Gly Gly Asp405 410 415gtg cgc tcg cgc ggc aag atc gtc atc gcc acc gtg aag ggc gac gtg1296Val Arg Ser Arg Gly Lys Ile Val Ile Ala Thr Val Lys Gly Asp Val420 425 430cac gac atc ggc aag aac atc gtc acc gtc gtg ctc cag tgc aac aac1344His Asp Ile Gly Lys Asn Ile Val Thr Val Val Leu Gln Cys Asn Asn
435 440 445ttc gaa gtc gtg aac atg ggc gtg atg gtc ccg tgc aac gag atc ctg1392Phe Glu Val Val Asn Met Gly Val Met Val Pro Cys Asn Glu Ile Leu450 455 460gcc aag gcg aag gtc gag ggc gcg gac atc atc ggc ctg tcg ggc ctg1440Ala Lys Ala Lys Val Glu Gly Ala Asp Ile Ile Gly Leu Ser Gly Leu465 470 475 480atc aca ccg tcg ctg gaa gag atg gcc tac gtg gcc tcc gag atg cag1488Ile Thr Pro Ser Leu Glu Glu Met Ala Tyr Val Ala Ser Glu Met Gln485 490 495cgc gac gag tac ttc cgc gtg aag aag atc ccg ctg ctg atc ggt ggc1536Arg Asp Glu Tyr Phe Arg Val Lys Lys Ile Pro Leu Leu Ile Gly Gly500 505 510gcg acc acg agc cgc gtg cac acc gcc gtg aag atc gcg ccc aat tac1584Ala Thr Thr Ser Arg Val His Thr Ala Val Lys Ile Ala Pro Asn Tyr515 520 525gaa ggc ccg gtc gtg tac gtg ccc gac gcc tcg cgc tcg gtg agc gtg1632Glu Gly Pro Val Val Tyr Val Pro Asp Ala Ser Arg Ser Val Ser Val530 535 540gcc tcc agc ctg ctg tcc gac gag gcc gcc gcg cgc tac atc gaa gag1680Ala Ser Ser Leu Leu Ser Asp Glu Ala Ala Ala Arg Tyr Ile Glu Glu545 550 555 560ctg cac gcc gac tac gac cgc atc cgc acc cag cac gcc agc aag aaa1728Leu His Ala Asp Tyr Asp Arg Ile Arg Thr Gln His Ala Ser Lys Lys565 570 575gcc atg ccg atg gtg tcg ctg gcc gcc gcg cgc gcc aac aag acc cgg1776Ala Met Pro Met Val Ser Leu Ala Ala Ala Arg Ala Asn Lys Thr Arg580 585 590atc gac tgg tcg aac tac acg ccg ccc aag ccc aag ttc gtc ggc cgc1824Ile Asp Trp Ser Asn Tyr Thr Pro Pro Lys Pro Lys Phe Val Gly Arg595 600 605cgc gtg ttc cgc aac tac gac ctg aac gag ctc gcg cag tac atc gac1872Arg Val Phe Arg Asn Tyr Asp Leu Asn Glu Leu Ala Gln Tyr Ile Asp610 615 620tgg ggc ccg ttc ttc cag acg tgg gac ctg gcc ggc aaa ttc ccc gac1920Trp Gly Pro Phe Phe Gln Thr Trp Asp Leu Ala Gly Lys Phe Pro Asp625 630 635 640atc ctc aac gac gcg atc gtc ggc gaa tcg gcc cgc cgc gtg ttc tcc1968Ile Leu Asn Asp Ala Ile Val Gly Glu Ser Ala Arg Arg Val Phe Ser645 650 655gac ggc aag agc atg ctc gcg cgc ctg atc gcc gga cgc tgg ctg acg2016Asp Gly Lys Ser Met Leu Ala Arg Leu Ile Ala Gly Arg Trp Leu Thr660 665 670gcc aac ggc gtg atc gcg ctg ctg ccg gcc aac acc gtc aac gac gac 2064Ala Asn Gly Val Ile Ala Leu Leu Pro Ala Asn Thr Val Asn Asp Asp675 680 685
gac atc gag atc tac acc gac gag acc cgc tcg gaa gtc gcc ctc acc2112Asp Ile Glu Ile Tyr Thr Asp Glu Thr Arg Ser Glu Val Ala Leu Thr690 695 700tgg cgc aac atc cgc cag cag agc gag cgc ccg atc atc gac ggc gtg2160Trp Arg Asn Ile Arg Gln Gln Ser Glu Arg Pro Ile Ile Asp Gly Val705 710 715 720atg cgc ccg aac cgc tgc ctg gcg gac ttc atc gcc ccc aag gac acc2208Met Arg Pro Asn Arg Cys Leu Ala Asp Phe Ile Ala Pro Lys Asp Thr725 730 735ggc atc gcc gat tac atc ggc ctc ttc gcg gtg acg ggc ggc atc ggg2256Gly Ile Ala Asp Tyr Ile Gly Leu Phe Ala Val Thr Gly Gly Ile Gly740 745 750atc gac aag cgc gaa gcc gcc ttc gaa gcc gac cac gac gac tac agc2304Ile Asp Lys Arg Glu Ala Ala Phe Glu Ala Asp His Asp Asp Tyr Ser755 760 765gcg atc atg ctc aag gcc ctg gcc gac cgc ttc gcc gaa gcc ttc gcc2352Ala Ile Met Leu Lys Ala Leu Ala Asp Arg Phe Ala Glu Ala Phe Ala770 775 780gag tgc ctg cac gcc cgt gtg cgc cgc gac ctg tgg ggc tac gcg cag2400Glu Cys Leu His Ala Arg Val Arg Arg Asp Leu Trp Gly Tyr Ala Gln785 790 795 800gac gaa acg ctc gac aac gac gcg ctg atc cgc gag gaa tac cgc ggc2448Asp Glu Thr Leu Asp Asn Asp Ala Leu Ile Arg Glu Glu Tyr Arg Gly805 810 815atc cgc ccg gcg ccc ggc tac ccg gcc tgc ccg gag cac acc gtc aag2496Ile Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Glu His Thr Val Lys820 825 830cgc gac ctg ttc cgc gtg ctc gac gcg cag gag atc ggc atg aac ctg2544Arg Asp Leu Phe Arg Val Leu Asp Ala Gln Glu Ile Gly Met Asn Leu835 840 845acc gag gcg ctg gcg atg aca ccg gcc gcg tcg gtc tcg ggc ttc cag2592Thr Glu Ala Leu Ala Met Thr Pro Ala Ala Ser Val Ser Gly Phe Gln850 855 860ctg tcg cac ccg gac agc acg tac ttc acg atc ggc aag atc ggc cag2640Leu Ser His Pro Asp Ser Thr Tyr Phe Thr Ile Gly Lys Ile Gly Gln865 870 875 880gac cag gtg gac gac atg gcc gcg cgc agc ggg gaa gac cgc cgc aat2688Asp Gln Val Asp Asp Met Ala Ala Arg Ser Gly Glu Asp Arg Arg Asn885 890 895gtg gag cgc gcc ctg gca ccc aac ctg taa2718Val Glu Arg Ala Leu Ala Pro Asn Leu900 90521050211905212PRT213Ralstonia solanacearum
40050Met Thr Asp His Leu Met Arg Leu Ser Gly Leu Glu Pro Phe Asn Ile1 5 10 15Gly Glu Asp Thr Leu Phe Val Asn Val Gly Glu Arg Thr Asn Val Thr20 25 30Gly Ser Lys Ala Phe Ala Arg Met Ile Leu Asn Ser Gln Phe Asp Glu35 40 45Ala Leu Ala Val Ala Arg Gln Gln Val Glu Asn Gly Ala Gln Val Ile50 55 60Asp Ile Asn Met Asp Glu Ala Met Leu Asp Ser Lys Ala Ala Met Val65 70 75 80Arg Phe Leu Asn Leu Ile Ala Ser Glu Pro Asp Ile Ala Arg Val Pro85 90 95Ile Met Ile Asp Ser Ser Lys Trp Glu Val Ile Glu Ala Gly Leu Lys100 105 110Cys Val Gln Gly Lys Ala Ile Val Asn Ser Ile Ser Leu Lys Glu Gly115 120 125Glu Glu Gln Phe Ala His His Ala Lys Leu Ile Lys Arg Tyr Gly Ala130 135 140Ala Ala Val Val Met Ala Phe Asp Glu Gln Gly Gln Ala Asp Thr Phe145 150 155 160Ala Arg Lys Thr Glu Ile Cys Lys Arg Ser Tyr Asp Phe Leu Val Asn165 170 175Gln Val Gly Phe Ala Pro Glu Asp Ile Ile Phe Asp Pro Asn Ile Phe180 185 190Ala Val Ala Thr Gly Ile Glu Glu His Asn Asn Tyr Ala Val Asp Phe195 200 205Ile Glu Ala Thr Arg Trp Ile Lys Gln Lys Leu Pro His Ala Lys Val210 215 220Ser Gly Gly Val Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Asp Val225 230 235 240Val Arg Glu Ala Ile His Thr Val Phe Leu Tyr His Ala Ile Gly Ala245 250 255Gly Met Asp Met Gly Ile Val Asn Ala Gly Gln Leu Gly Val Tyr Glu260 265 270Asn Leu Ala Pro Glu Leu Arg Glu Arg Val Glu Asp Val Val Leu Asn275 280 285Arg Arg Pro Asp Ala Thr Asp Arg Leu Leu Glu Ile Ala Asp Arg Tyr290 295 300Lys Gly Gly Gly Ala Lys Arg Glu Glu Asn Leu Ala Trp Arg Gln Glu305 310 315 320Pro Val Glu Lys Arg Leu Ala His Ala Leu Val His Gly Ile Thr Asp
325 330 335Tyr Val Val Glu Asp Thr Glu Glu Val Arg Gln Lys Ile Phe Ala Ala340 345 350Gly Gly Arg Pro Ile Gln Val Ile Glu Gly Pro Leu Met Asp Gly Met355 360 365Asn Ile Val Gly Asp Leu Phe Gly Ala Gly Lys Met Phe Leu Pro Gln370 375 380Val Val Lys Ser Ala Arg Val Met Lys Gln Ala Val Ala His Leu Ile385 390 395 400Pro Phe Ile Glu Glu Glu Lys Arg Gln Ile Ala Ala Ala Gly Gly Asp405 410 415Val Arg Ser Arg Gly Lys Ile Val Ile Ala Thr Val Lys Gly Asp Val420 425 430His Asp Ile Gly Lys Asn Ile Val Thr Val Val Leu Gln Cys Asn Asn435 440 445Phe Glu Val Val Asn Met Gly Val Met Val Pro Cys Asn Glu Ile Leu450 455 460Ala Lys Ala Lys Val Glu Gly Ala Asp Ile Ile Gly Leu Ser Gly Leu465 470 475 480Ile Thr Pro Ser Leu Glu Glu Met Ala Tyr Val Ala Ser Glu Met Gln485 490 495Arg Asp Glu Tyr Phe Arg Val Lys Lys Ile Pro Leu Leu Ile Gly Gly500 505 510Ala Thr Thr Ser Arg Val His Thr Ala Val Lys Ile Ala Pro Asn Tyr515 520 525Glu Gly Pro Val Val Tyr Val Pro Asp Ala Ser Arg Ser Val Ser Val530 535 540Ala Ser Ser Leu Leu Ser Asp Glu Ala Ala Ala Arg Tyr Ile Glu Glu545 550 555 560Leu His Ala Asp Tyr Asp Arg Ile Arg Thr Gln His Ala Ser Lys Lys565 570 575Ala Met Pro Met Val Ser Leu Ala Ala Ala Arg Ala Asn Lys Thr Arg580 585 590Ile Asp Trp Ser Asn Tyr Thr Pro Pro Lys Pro Lys Phe Val Gly Arg595 600 605Arg Val Phe Arg Asn Tyr Asp Leu Asn Glu Leu Ala Gln Tyr Ile Asp610 615 620Trp Gly Pro Phe Phe Gln Thr Trp Asp Leu Ala Gly Lys Phe Pro Asp625 630 635 640Ile Leu Asn Asp Ala Ile Val Gly Glu Ser Ala Arg Arg Val Phe Ser645 650 655
Asp Gly Lys Ser Met Leu Ala Arg Leu Ile Ala Gly Arg Trp Leu Thr660 665 670Ala Asn Gly Val Ile Ala Leu Leu Pro Ala Asn Thr Val Asn Asp Asp675 680 685Asp Ile Glu Ile Tyr Thr Asp Glu Thr Arg Ser Glu Val Ala Leu Thr690 695 700Trp Arg Asn Ile Arg Gln Gln Ser Glu Arg Pro Ile Ile Asp Gly Val705 710 715 720Met Arg Pro Asn Arg Cys Leu Ala Asp Phe Ile Ala Pro Lys Asp Thr725 730 735Gly Ile Ala Asp Tyr Ile Gly Leu Phe Ala Val Thr Gly Gly Ile Gly740 745 750Ile Asp Lys Arg Glu Ala Ala Phe Glu Ala Asp His Asp Asp Tyr Ser755 760 765Ala Ile Met Leu Lys Ala Leu Ala Asp Arg Phe Ala Glu Ala Phe Ala770 775 780Glu Cys Leu His Ala Arg Val Arg Arg Asp Leu Trp Gly Tyr Ala Gln785 790 795 800Asp Glu Thr Leu Asp Asn Asp Ala Leu Ile Arg Glu Glu Tyr Arg Gly805 810 815Ile Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Glu His Thr Val Lys820 825 830Arg Asp Leu Phe Arg Val Leu Asp Ala Gln Glu Ile Gly Met Asn Leu835 840 845Thr Glu Ala Leu Ala Met Thr Pro Ala Ala Ser Val Ser Gly Phe Gln850 855 860Leu Ser His Pro Asp Ser Thr Tyr Phe Thr Ile Gly Lys Ile Gly Gln865 870 875 880Asp Gln Val Asp Asp Met Ala Ala Arg Ser Gly Glu Asp Arg Arg Asn885 890 895Val Glu Arg Ala Leu Ala Pro Asn Leu900 905210512113645212DNA213Chlorobium tepidum220
221CDS222(1)..(3642)223RCL0042040051gtg ctc gac ggg gcc atg ggc acc atg atc cag agg cat ggc ctc gac48Val Leu Asp Gly Ala Met Gly Thr Met Ile Gln Arg His Gly Leu Asp
1 5 10 15gaa cag gac tac cgg ggc gag cgt ttc gct tcg cat gac cat ccg ctg96Glu Gln Asp Tyr Arg Gly Glu Arg Phe Ala Ser His Asp His Pro Leu20 25 30aag ggc aac aac gac ctt ctt gtc atc acc cgg ccc gac atc atc cgt144Lys Gly Asn Asn Asp Leu Leu Val Ile Thr Arg Pro Asp Ile Ile Arg35 40 45tcg atc cac tgc gac ttc ctc gac gcg ggt gcg gac atc atc gag acc192Ser Ile His Cys Asp Phe Leu Asp Ala Gly Ala Asp Ile Ile Glu Thr50 55 60tgc acc ttc aac gcc aac ccg atc tcg cag tcg gac tac cag ttg cag240Cys Thr Phe Asn Ala Asn Pro Ile Ser Gln Ser Asp Tyr Gln Leu Gln65 70 75 80gac ttg acc cgc gag ctg aac gtg gcg gcg gca aag ata gcc cgc tcg288Asp Leu Thr Arg Glu Leu Asn Val Ala Ala Ala Lys Ile Ala Arg Ser85 90 95gca gcg gac gag ttc acc gca aag act ccc gac aag ccg cgt ttc gtg336Ala Ala Asp Glu Phe Thr Ala Lys Thr Pro Asp Lys Pro Arg Phe Val100 105 110gcc ggt tcc atc gga ccg acc aac aag acg ctc tcg ctc tcg ccg gac384Ala Gly Ser Ile Gly Pro Thr Asn Lys Thr Leu Ser Leu Ser Pro Asp115 120 125gtg aac aac ccc ggc ttc cgc gcc gtc acc ttc cag gag atg gtc gat432Val Asn Asn Pro Gly Phe Arg Ala Val Thr Phe Gln Glu Met Val Asp130 135 140aac tac act gcc cag ctc gaa ggc ttg cac gag ggc ggt gtc gat ctc480Asn Tyr Thr Ala Gln Leu Glu Gly Leu His Glu Gly Gly Val Asp Leu145 150 155 160ttg ctc gtc gag acg gtg ttc gac aca ctg aac tgc aag gcg gcg ctc528Leu Leu Val Glu Thr Val Phe Asp Thr Leu Asn Cys Lys Ala Ala Leu165 170 175tac gct atc gag gag tac gcg gtg aaa acc ggc tgg cag gtg ccc gtg576Tyr Ala Ile Glu Glu Tyr Ala Val Lys Thr Gly Trp Gln Val Pro Val180 185 190atg gtc tcc ggc acg gtg gtg gac gcg agc ggc cgc acc ctc tcc ggc624Met Val Ser Gly Thr Val Val Asp Ala Ser Gly Arg Thr Leu Ser Gly195 200 205caa acc acc gag gcg ttc tgg att tcg att tcg cac atg ccg agt ctg672Gln Thr Thr Glu Ala Phe Trp Ile Ser Ile Ser His Met Pro Ser Leu210 215 220ctc tcg gtc ggc ctg aac tgc gca ctc ggc tcc aag cag atg cgc ccc720Leu Ser Val Gly Leu Asn Cys Ala Leu Gly Ser Lys Gln Met Arg Pro225 230 235 240ttc atc gag gcg ctc tcg aac atc gcc gaa agc tac gtc agc gtc tat768Phe Ile Glu Ala Leu Ser Asn Ile Ala Glu Ser Tyr Val Ser Val Tyr245 250 255
ccc aac gcg ggc ctg ccg aat gag ttc ggc gag tac gac gac tcc ccc816Pro Asn Ala Gly Leu Pro Asn Glu Phe Gly Glu Tyr Asp Asp Ser Pro260 265 270gag tac atg gcc gcg cag atc gcg ggc ttc gcc gaa tca ggc ttc gtg864Glu Tyr Met Ala Ala Gln Ile Ala Gly Phe Ala Glu Ser Gly Phe Val275 280 285aac atc gtc ggc ggc tgc tgc ggc acc acg ccg acg cac atc cgc gcc912Asn Ile Val Gly Gly Cys Cys Gly Thr Thr Pro Thr His Ile Arg Ala290 295 300att gcc gaa gcg gtc aag act ctc ccg ccg aga aag cgc ccc gcc aac960Ile Ala Glu Ala Val Lys Thr Leu Pro Pro Arg Lys Arg Pro Ala Asn305 310 315 320aag cac gtg ctg agg ctc tcc ggc ctc gaa ccg ctc gtg gtt gac gaa1008Lys His Val Leu Arg Leu Ser Gly Leu Glu Pro Leu Val Val Asp Glu325 330 335acc acc ggc ttc atc aac gtc ggc gag cgc acc aac gtc acc ggt tcg1056Thr Thr Gly Phe Ile Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser340 345 350cgc aag ttc gcc cgc ctc atc aag gag gcc aat tac gac gaa gcg ctc1104Arg Lys Phe Ala Arg Leu Ile Lys Glu Ala Asn Tyr Asp Glu Ala Leu355 360 365tcc att gcc cgc cag cag gtc gag aac ggc gcg cag gtg atc gac gtg1152Ser Ile Ala Arg Gln Gln Val Glu Asn Gly Ala Gln Val Ile Asp Val370 375 380aac ctc gac gaa gga atg ctc gac tcc gaa aag gtg atc gtc gaa ttc1200Asn Leu Asp Glu Gly Met Leu Asp Ser Glu Lys Val Ile Val Glu Phe385 390 395 400ctg aac ctc atc gcc tcc gag cct gag atc gcc aag gtg ccg gtg atg1248Leu Asn Leu Ile Ala Ser Glu Pro Glu Ile Ala Lys Val Pro Val Met405 410 415atc gac tcg tcg aaa tgg tcg gtc atc gaa aac ggc ctg cgc tgc acc1296Ile Asp Ser Ser Lys Trp Ser Val Ile Glu Asn Gly Leu Arg Cys Thr420 425 430cag ggc aag agc atc gtc aac tcg atc agc ctc aag gag ggc gag gag1344Gln Gly Lys Ser Ile Val Asn Ser Ile Ser Leu Lys Glu Gly Glu Glu435 440 445ctg ttc aag gag cgc gct cgc aag atc atg caa tac ggc gcg gcg gcg1392Leu Phe Lys Glu Arg Ala Arg Lys Ile Met Gln Tyr Gly Ala Ala Ala450 455 460gtg gtc atg gcc ttc gac gag cag ggc cag gcc gac agc ctg cac cgc1440Val Val Met Ala Phe Asp Glu Gln Gly Gln Ala Asp Ser Leu His Arg465 470 475 480cgc atc gag att tgc agc cgc gcc tac aaa att ctc acc gaa gag gtg1488Arg Ile Glu Ile Cys Ser Arg Ala Tyr Lys Ile Leu Thr Glu Glu Val485 490 495ggc ttc ccg ccg gag gac atc atc ttt gac ccg aac gtg ctg acc gtg1536Gly Phe Pro Pro Glu Asp Ile Ile Phe Asp Pro Asn Val Leu Thr Val
500 505 510gcc acc ggc atc gac gag cac aac aac tac gcg ctc gac ttc atc gaa1584Ala Thr Gly Ile Asp Glu His Asn Asn Tyr Ala Leu Asp Phe Ile Glu515 520 525agc gtg cgc tgg atc aag cag aac ctg ccg cac gcg aag gtc tcc ggc1632Ser Val Arg Trp Ile Lys Gln Asn Leu Pro His Ala Lys Val Ser Gly530 535 540ggc atc agc aac gtt tcg ttc tcc ttc cgc ggc aac gag ccg gtg cgc1680Gly Ile Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Glu Pro Val Arg545 550 555 560gag gcg atg cac acc gcg ttc ctc tac cac gcc atc cac gcc ggt ctc1728Glu Ala Met His Thr Ala Phe Leu Tyr His Ala Ile His Ala Gly Leu565 570 575gac atg ggc atc gtc aac gcc gcc cag ctt ggc atc tac gaa gag atc1776Asp Met Gly Ile Val Asn Ala Ala Gln Leu Gly Ile Tyr Glu Glu Ile580 585 590gac ccg gag ctt ctt gtc tat gtc gag gac gtg ctg ctg aac cgc cgc1824Asp Pro Glu Leu Leu Val Tyr Val Glu Asp Val Leu Leu Asn Arg Arg595 600 605gac gac gcc acc gag cgg ctc gtg gcg ttc gct gaa acg atc cgc gac1872Asp Asp Ala Thr Glu Arg Leu Val Ala Phe Ala Glu Thr Ile Arg Asp610 615 620ggc ggc gaa aag gcc gag gcc aag aac gcc gaa tgg cgc aac gcc ccg1920Gly Gly Glu Lys Ala Glu Ala Lys Asn Ala Glu Trp Arg Asn Ala Pro625 630 635 640gtc gag gag cgg ctg aaa cac gcg ctc gtc aag ggc atc gtt gac tac1968Val Glu Glu Arg Leu Lys His Ala Leu Val Lys Gly Ile Val Asp Tyr645 650 655atc gac gag gac acc gaa gag gcc cgc cag ctc tac ccg agt ccg ctg2016Ile Asp Glu Asp Thr Glu Glu Ala Arg Gln Leu Tyr Pro Ser Pro Leu660 665 670gag gtg atc gag ggg ccg ctc atg aac ggc atg aac cac gtc ggc gac2064Glu Val Ile Glu Gly Pro Leu Met Asn Gly Met Asn His Val Gly Asp675 680 685ctc ttc gcc gaa ggc aag atg ttc ctg cca cag gtg gtc aaa agc gcc2112Leu Phe Ala Glu Gly Lys Met Phe Leu Pro Gln Val Val Lys Ser Ala690 695 700cgc gtc atg aag cgc tcg gta gct gcg ctg att ccc tat atc gag gag2160Arg Val Met Lys Arg Ser Val Ala Ala Leu Ile Pro Tyr Ile Glu Glu705 710 715 720gag aag tcg aaa aac tgc gac acg agc gcc aaa gcc aag gtg ctg ctc2208Glu Lys Ser Lys Asn Cys Asp Thr Ser Ala Lys Ala Lys Val Leu Leu725 730 735gcc acg gtg aag ggc gac gtg cac gac atc ggc aag aac atc gtg tcg2256Ala Thr Val Lys Gly Asp Val His Asp Ile Gly Lys Asn Ile Val Ser740 745 750
gtg gtg ctt gcc tgc aac aac ttc gac gtg atc gac atc ggc gtc atg2304Val Val Leu Ala Cys Asn Asn Phe Asp Val Ile Asp Ile Gly Val Met755 760 765atg cca tgc gac aag att ctc gaa gcg ctg gca gaa cac aag ccc gac2352Met Pro Cys Asp Lys Ile Leu Glu Ala Leu Ala Glu His Lys Pro Asp770 775 780gtg ctc ggc ctc tcc ggc ctc atc acc ccg tcg ctc gaa gag atg gcg2400Val Leu Gly Leu Ser Gly Leu Ile Thr Pro Ser Leu Glu Glu Met Ala785 790 795 800cac gtg gcc aaa gag atg gag cgg ctc ggc atg aac att ccg ctc atc2448His Val Ala Lys Glu Met Glu Arg Leu Gly Met Asn Ile Pro Leu Ile805 810 815atc ggc ggc gcg acc acc tcg aag gtg cac acg gcg gtg aaa ctc gcg2496Ile Gly Gly Ala Thr Thr Ser Lys Val His Thr Ala Val Lys Leu Ala820 825 830ccc tgc tac ccc agc ggc gcg gta gta cac gtg ctc gac gcc tcg cgc2544Pro Cys Tyr Pro Ser Gly Ala Val Val His Val Leu Asp Ala Ser Arg835 840 845agc gtg ccg gtg gtc agc aac ctc tgc aac ccc gcc cag cgc gac agc2592Ser Val Pro Val Val Ser Asn Leu Cys Asn Pro Ala Gln Arg Asp Ser850 855 860tat atc gcg gcg ctg aag gat gag cag gag gcg atg cgc aag agc cac2640Tyr Ile Ala Ala Leu Lys Asp Glu Gln Glu Ala Met Arg Lys Ser His865 870 875 880gcc gag cgc atg gcg gca aaa aag tac gtc tcg ctc gac gcc gcc cgc2688Ala Glu Arg Met Ala Ala Lys Lys Tyr Val Ser Leu Asp Ala Ala Arg885 890 895gac aac cgc ctc acc att gac tgg gag gcc gaa acc atc gac aag ccc2736Asp Asn Arg Leu Thr Ile Asp Trp Glu Ala Glu Thr Ile Asp Lys Pro900 905 910gcc cag act ggc gtc acc gtg ctg gag gat gtc acc gtc ggc gcg ctc2784Ala Gln Thr Gly Val Thr Val Leu Glu Asp Val Thr Val Gly Ala Leu915 920 925cgc ccg tat atc gac tgg gca mcc ttc ttc tgg agc tgg gag ctg cac2832Arg Pro Tyr Ile Asp Trp Ala Xaa Phe Phe Trp Ser Trp Glu Leu His930 935 940ggc gtc tat ccg cag att ctg gag gat gaa aag gtc ggc gag gag gca2880Gly Val Tyr Pro Gln Ile Leu Glu Asp Glu Lys Val Gly Glu Glu Ala945 950 955 960acc aaa ctc ttc aac gac gcc acc gct ctg ctc gac cgg atc gac agc2928Thr Lys Leu Phe Asn Asp Ala Thr Ala Leu Leu Asp Arg Ile Asp Ser965 970 975gaa aag ctg ctc ggc atc aaa ggc gtg gcg ggc atc ttc ccg gcc aac2976Glu Lys Leu Leu Gly Ile Lys Gly Val Ala Gly Ile Phe Pro Ala Asn980 985 990agc atc ggc gac gac atc ttc gtc tat gcg gat gac gag cgc tcg ata3024Ser Ile Gly Asp Asp Ile Phe Val Tyr Ala Asp Asp Glu Arg Ser Ile
99510001005atc cgc acc gtg ctg cac acc ctg cgc cag caa ggc gaa aag cac ggc3072Ile Arg Thr Val Leu His Thr Leu Arg Gln Gln Gly Glu Lys His Gly101010151020gaa gcg aac ctc gcg ctg gcg gac ttc gtg gcc ccg cgc gaa agc ggc3120Glu Ala Asn Leu Ala Leu Ala Asp Phe Val Ala Pro Arg Glu Ser Gly1025103010351040gtc aac gac tgg atc ggc tgc ttc acc gta acc gcc gga ctc ggc atc3168Val Asn Asp Trp Ile Gly Cys Phe Thr Val Thr Ala Gly Leu Gly Ile104510501055cag aat ttg ctc gac gag ttc aca gca gag aac gac gac tac cac cgc3216Gln Asn Leu Leu Asp Glu Phe Thr Ala Glu Asn Asp Asp Tyr His Arg106010651070atc atg aca cag gcg ctc gcc gac cga ctg gcc gaa gcg ttc gca gag3264Ile Met Thr Gln Ala Leu Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu107510801085atg ctg cac gaa aag gtg cgc cgc gaa ctc tgg ggc tac gcg ccc ggc3312Met Leu His Glu Lys Val Arg Arg Glu Leu Trp Gly Tyr Ala Pro Gly109010951100gaa atc ctc ggc aac gaa gag ctg atc gcc gaa aag tac cga ggc atc3360Glu Ile Leu Gly Asn Glu Glu Leu Ile Ala Glu Lys Tyr Arg Gly Ile1105111011151120cgc ccc gcc ccc ggc tac ccc gcc tgc ccg gat cac acc gaa aag gca3408Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Asp His Thr Glu Lys Ala112511301135atc atc ttc gac ctg ctc aac gct gaa gcg gcc acc ggc gtc acg ctg3456Ile Ile Phe Asp Leu Leu Asn Ala Glu Ala Ala Thr Gly Val Thr Leu114011451150acg gaa act ttc gcg atg aac ccc gca gcc tca gtc tgc ggc ctc tac3504Thr Glu Thr Phe Ala Met Asn Pro Ala Ala Ser Val Cys Gly Leu Tyr115511601165ttc gcc aac ccg gcc tcg aaa tac ttc gta ctc ggc aag att ggt aag3552Phe Ala Asn Pro Ala Ser Lys Tyr Phe Val Leu Gly Lys Ile Gly Lys117011751180gat cag gtc gaa gac tac gcc aac cgc aaa ggg ctg gaa gta gca gaa3600Asp Gln Val Glu Asp Tyr Ala Asn Arg Lys Gly Leu Glu Val Ala Glu1185119011951200gcc gag aag tgg ctc gcg ccc tcg ctg aac tac gat cca gcg3642Ala Glu Lys Trp Leu Ala Pro Ser Leu Asn Tyr Asp Pro Ala12051210taa3645210522111214212PRT213Chlorobium tepidum
220
221unsure222936..936223所有的Xaa表示任何胺基酸40052Val Leu Asp Gly Ala Met Gly Thr Met Ile Gln Arg His Gly Leu Asp1 5 10 15Glu Gln Asp Tyr Arg Gly Glu Arg Phe Ala Ser His Asp His Pro Leu20 25 30Lys Gly Asn Asn Asp Leu Leu Val Ile Thr Arg Pro Asp Ile Ile Arg35 40 45Ser Ile His Cys Asp Phe Leu Asp Ala Gly Ala Asp Ile Ile Glu Thr50 55 60Cys Thr Phe Asn Ala Asn Pro Ile Ser Gln Ser Asp Tyr Gln Leu Gln65 70 75 80Asp Leu Thr Arg Glu Leu Asn Val Ala Ala Ala Lys Ile Ala Arg Ser85 90 95Ala Ala Asp Glu Phe Thr Ala Lys Thr Pro Asp Lys Pro Arg Phe Val100 105 110Ala Gly Ser Ile Gly Pro Thr Asn Lys Thr Leu Ser Leu Ser Pro Asp115 120 125Val Asn Asn Pro Gly Phe Arg Ala Val Thr Phe Gln Glu Met Val Asp130 135 140Asn Tyr Thr Ala Gln Leu Glu Gly Leu His Glu Gly Gly Val Asp Leu145 150 155 160Leu Leu Val Glu Thr Val Phe Asp Thr Leu Asn Cys Lys Ala Ala Leu165 170 175Tyr Ala Ile Glu Glu Tyr Ala Val Lys Thr Gly Trp Gln Val Pro Val180 185 190Met Val Ser Gly Thr Val Val Asp Ala Ser Gly Arg Thr Leu Ser Gly195 200 205Gln Thr Thr Glu Ala Phe Trp Ile Ser Ile Ser His Met Pro Ser Leu210 215 220Leu Ser Val Gly Leu Asn Cys Ala Leu Gly Ser Lys Gln Met Arg Pro225 230 235 240Phe Ile Glu Ala Leu Ser Asn Ile Ala Glu Ser Tyr Val Ser Val Tyr245 250 255Pro Asn Ala Gly Leu Pro Asn Glu Phe Gly Glu Tyr Asp Asp Ser Pro260 265 270Glu Tyr Met Ala Ala Gln Ile Ala Gly Phe Ala Glu Ser Gly Phe Val275 280 285Asn Ile Val Gly Gly Cys Cys Gly Thr Thr Pro Thr His Ile Arg Ala290 295 300
Ile Ala Glu Ala Val Lys Thr Leu Pro Pro Arg Lys Arg Pro Ala Asn305 310 315 320Lys His Val Leu Arg Leu Ser Gly Leu Glu Pro Leu Val Val Asp Glu325 330 335Thr Thr Gly Phe Ile Asn Val Gly Glu Arg Thr Asn Val Thr Gly Ser340 345 350Arg Lys Phe Ala Arg Leu Ile Lys Glu Ala Asn Tyr Asp Glu Ala Leu355 360 365Ser Ile Ala Arg Gln Gln Val Glu Asn Gly Ala Gln Val Ile Asp Val370 375 380Asn Leu Asp Glu Gly Met Leu Asp Ser Glu Lys Val Ile Val Glu Phe385 390 395 400Leu Asn Leu Ile Ala Ser Glu Pro Glu Ile Ala Lys Val Pro Val Met405 410 415Ile Asp Ser Ser Lys Trp Ser Val Ile Glu Asn Gly Leu Arg Cys Thr420 425 430Gln Gly Lys Ser Ile Val Asn Ser Ile Ser Leu Lys Glu Gly Glu Glu435 440 445Leu Phe Lys Glu Arg Ala Arg Lys Ile Met Gln Tyr Gly Ala Ala Ala450 455 460Val Val Met Ala Phe Asp Glu Gln Gly Gln Ala Asp Ser Leu His Arg465 470 475 480Arg Ile Glu Ile Cys Ser Arg Ala Tyr Lys Ile Leu Thr Glu Glu Val485 490 495Gly Phe Pro Pro Glu Asp Ile Ile Phe Asp Pro Asn Val Leu Thr Val500 505 510Ala Thr Gly Ile Asp Glu His Asn Asn Tyr Ala Leu Asp Phe Ile Glu515 520 525Ser Val Arg Trp Ile Lys Gln Asn Leu Pro His Ala Lys Val Ser Gly530 535 540Gly Ile Ser Asn Val Ser Phe Ser Phe Arg Gly Asn Glu Pro Val Arg545 550 555 560Glu Ala Met His Thr Ala Phe Leu Tyr His Ala Ile His Ala Gly Leu565 570 575Asp Met Gly Ile Val Asn Ala Ala Gln Leu Gly Ile Tyr Glu Glu Ile580 585 590Asp Pro Glu Leu Leu Val Tyr Val Glu Asp Val Leu Leu Asn Arg Arg595 600 605Asp Asp Ala Thr Glu Arg Leu Val Ala Phe Ala Glu Thr Ile Arg Asp610 615 620Gly Gly Glu Lys Ala Glu Ala Lys Asn Ala Glu Trp Arg Asn Ala Pro
625 630 635 640Val Glu Glu Arg Leu Lys His Ala Leu Val Lys Gly Ile Val Asp Tyr645 650 655Ile Asp Glu Asp Thr Glu Glu Ala Arg Gln Leu Tyr Pro Ser Pro Leu660 665 670Glu Val Ile Glu Gly Pro Leu Met Asn Gly Met Asn His Val Gly Asp675 680 685Leu Phe Ala Glu Gly Lys Met Phe Leu Pro Gln Val Val Lys Ser Ala690 695 700Arg Val Met Lys Arg Ser Val Ala Ala Leu Ile Pro Tyr Ile Glu Glu705 710 715 720Glu Lys Ser Lys Asn Cys Asp Thr Ser Ala Lys Ala Lys Val Leu Leu725 730 735Ala Thr Val Lys Gly Asp Val His Asp Ile Gly Lys Asn Ile Val Ser740 745 750Val Val Leu Ala Cys Asn Asn Phe Asp Val Ile Asp Ile Gly Val Met755 760 765Met Pro Cys Asp Lys Ile Leu Glu Ala Leu Ala Glu His Lys Pro Asp770 775 780Val Leu Gly Leu Ser Gly Leu Ile Thr Pro Ser Leu Glu Glu Met Ala785 790 795 800His Val Ala Lys Glu Met Glu Arg Leu Gly Met Asn Ile Pro Leu Ile805 810 815Ile Gly Gly Ala Thr Thr Ser Lys Val His Thr Ala Val Lys Leu Ala820 825 830Pro Cys Tyr Pro Ser Gly Ala Val Val His Val Leu Asp Ala Ser Arg835 840 845Ser Val Pro Val Val Ser Asn Leu Cys Asn Pro Ala Gln Arg Asp Ser850 855 860Tyr Ile Ala Ala Leu Lys Asp Glu Gln Glu Ala Met Arg Lys Ser His865 870 875 880Ala Glu Arg Met Ala Ala Lys Lys Tyr Val Ser Leu Asp Ala Ala Arg885 890 895Asp Asn Arg Leu Thr Ile Asp Trp Glu Ala Glu Thr Ile Asp Lys Pro900 905 910Ala Gln Thr Gly Val Thr Val Leu Glu Asp Val Thr Val Gly Ala Leu915 920 925Arg Pro Tyr Ile Asp Trp Ala Xaa Phe Phe Trp Ser Trp Glu Leu His930 935 940Gly Val Tyr Pro Gln Ile Leu Glu Asp Glu Lys Val Gly Glu Glu Ala945 950 955 960
Thr Lys Leu Phe Asn Asp Ala Thr Ala Leu Leu Asp Arg Ile Asp Ser965 970 975Glu Lys Leu Leu Gly Ile Lys Gly Val Ala Gly Ile Phe Pro Ala Asn980 985 990Ser Ile Gly Asp Asp Ile Phe Val Tyr Ala Asp Asp Glu Arg Ser Ile99510001005Ile Arg Thr Val Leu His Thr Leu Arg Gln Gln Gly Glu Lys His Gly101010151020Glu Ala Asn Leu Ala Leu Ala Asp Phe Val Ala Pro Arg Glu Ser Gly1025 103010351040Val Asn Asp Trp Ile Gly Cys Phe Thr Val Thr Ala Gly Leu Gly Ile104510501055Gln Asn Leu Leu Asp Glu Phe Thr Ala Glu Asn Asp Asp Tyr His Arg106010651070Ile Met Thr Gln Ala Leu Ala Asp Arg Leu Ala Glu Ala Phe Ala Glu107510801085Met Leu His Glu Lys Val Arg Arg Glu Leu Trp Gly Tyr Ala Pro Gly109010951100Glu Ile Leu Gly Asn Glu Glu Leu Ile Ala Glu Lys Tyr Arg Gly Ile1105 111011151120Arg Pro Ala Pro Gly Tyr Pro Ala Cys Pro Asp His Thr Glu Lys Ala112511301135Ile Ile Phe Asp Leu Leu Asn Ala Glu Ala Ala Thr Gly Val Thr Leu114011451150Thr Glu Thr Phe Ala Met Asn Pro Ala Ala Ser Val Cys Gly Leu Tyr115511601165Phe Ala Asn Pro Ala Ser Lys Tyr Phe Val Leu Gly Lys Ile Gly Lys117011751180Asp Gln Val Glu Asp Tyr Ala Asn Arg Lys Gly Leu Glu Val Ala Glu1185 119011951200Ala Glu Lys Trp Leu Ala Pro Ser Leu Asn Tyr Asp Pro Ala120512102105321152212DNA213人工序列220
223人工序列的描繪PCR引物40053cccgggatcc gctagcggcg cgccggccgg cccggtgtga aataccgcac ag 52
2105421153212DNA213人工序列220
223人工序列的描繪PCR引物40054tctagactcg agcggccgcg gccggccttt aaattgaaga cgaaagggcc tcg532105521147212DNA213人工序列220
223人工序列的描繪PCR引物40055gagatctaga cccggggatc cgctagcggg ctgctaaagg aagcgga472105621138212DNA213人工序列220
223人工序列的描繪PCR引物40056gagaggcgcg ccgctagcgt gggcgaagaa ctccagca 382105721134212DNA213人工序列220
223人工序列的描繪PCR引物40057gagagggcgg ccgcgcaaag tcccgcttcg tgaa 342105821134212DNA213人工序列220
223人工序列的描繪PCR引物40058gagagggcgg ccgctcaagt cggtcaagcc acgc 34
21059211140212DNA213人工序列220
223人工序列的描繪PCR引物40059tcgaatttaa atctcgagag gcctgacgtc gggcccggta ccacgcgtca tatgactagt 60tcggacctag ggatatcgtc gacatcgatg ctcttctgcg ttaattaaca attgggatcc 120tctagacccg ggatttaaat 14021060211140212DNA213人工序列220
223人工序列的描繪PCR引物40060gatcatttaa atcccgggtc tagaggatcc caattgttaa ttaacgcaga agagcatcga 60tgtcgacgat atccctaggt ccgaactagt catatgacgc gtggtaccgg gcccgacgtc 120aggcctctcg agatttaaat 1402106121133212DNA213人工序列220
223人工序列的描繪PCR引物40061gagagcggcc gccgatcctt tttaacccat cac332106221132212DNA213人工序列220
223人工序列的描繪PCR引物40062aggagcggcc gccatcggca ttttcttttg cg 32210632115091212DNA213人工序列220
223人工序列的描繪質粒
40063gccgcgactg ccttcgcgaa gccttgcccc gcggaaattt cctccaccga gttcgtgcac 60acccctatgc caagcttctt tcaccctaaa ttcgagagat tggattctta ccgtggaaat 120tcttcgcaaa aatcgtcccc tgatcgccct tgcgacgttg gcgtcggtgc cgctggttgc 180gcttggcttg accgacttga tcagcggccg ctcgatttaa atctcgagag gcctgacgtc 240gggcccggta ccacgcgtca tatgactagt tcggacctag ggatatcgtc gacatcgatg 300ctcttctgcg ttaattaaca attgggatcc tctagacccg ggatttaaat cgctagcggg 360ctgctaaagg aagcggaaca cgtagaaagc cagtccgcag aaacggtgct gaccccggat 420gaatgtcagc tactgggcta tctggacaag ggaaaacgca agcgcaaaga gaaagcaggt 480agcttgcagt gggcttacat ggcgatagct agactgggcg gttttatgga cagcaagcga 540accggaattg ccagctgggg cgccctctgg taaggttggg aagccctgca aagtaaactg 600gatggctttc ttgccgccaa ggatctgatg gcgcagggga tcaagatctg atcaagagac 660aggatgagga tcgtttcgca tgattgaaca agatggattg cacgcaggtt ctccggccgc 720ttgggtggag aggctattcg gctatgactg ggcacaacag acaatcggct gctctgatgc 780cgccgtgttc cggctgtcag cgcaggggcg cccggttctt tttgtcaaga ccgacctgtc 840cggtgccctg aatgaactgc aggacgaggc agcgcggcta tcgtggctgg ccacgacggg 900cgttccttgc gcagctgtgc tcgacgttgt cactgaagcg ggaagggact ggctgctatt 960gggcgaagtg ccggggcagg atctcctgtc atctcacctt gctcctgccg agaaagtatc 1020catcatggct gatgcaatgc ggcggctgca tacgcttgat ccggctacct gcccattcga 1080ccaccaagcg aaacatcgca tcgagcgagc acgtactcgg atggaagccg gtcttgtcga 1140tcaggatgat ctggacgaag agcatcaggg gctcgcgcca gccgaactgt tcgccaggct 1200caaggcgcgc atgcccgacg gcgaggatct cgtcgtgacc catggcgatg cctgcttgcc 1260gaatatcatg gtggaaaatg gccgcttttc tggattcatc gactgtggcc ggctgggtgt 1320ggcggaccgc tatcaggaca tagcgttggc tacccgtgat attgctgaag agcttggcgg 1380cgaatgggct gaccgcttcc tcgtgcttta cggtatcgcc gctcccgatt cgcagcgcat 1440cgccttctat cgccttcttg acgagttctt ctgagcggga ctctggggtt cgaaatgacc 1500gaccaagcga cgcccaacct gccatcacga gatttcgatt ccaccgccgc cttctatgaa 1560aggttgggct tcggaatcgt tttccgggac gccggctgga tgatcctcca gcgcggggat 1620ctcatgctgg agttcttcgc ccacgctagc ggcgcgccgg ccggcccggt gtgaaatacc 1680gcacagatgc gtaaggagaa aataccgcat caggcgctct tccgcttcct cgctcactga 1740ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca gctcactcaa aggcggtaat 1800acggttatcc acagaatcag gggataacgc aggaaagaac atgtgagcaa aaggccagca 1860aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc tccgcccccc 1920tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga caggactata 1980aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc 2040gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt ctcatagctc 2100acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga 2160accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg agtccaaccc 2220ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta gcagagcgag 2280gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct acactagaag 2340gacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa gagttggtag 2400ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt gcaagcagca 2460gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta cggggtctga 2520cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat caaaaaggat 2580cttcacctag atccttttaa aggccggccg cggccgcgca aagtcccgct tcgtgaaaat 2640tttcgtgccg cgtgattttc cgccaaaaac tttaacgaac gttcgttata atggtgtcat 2700gaccttcacg acgaagtact aaaattggcc cgaatcatca gctatggatc tctctgatgt 2760cgcgctggag tccgacgcgc tcgatgctgc cgtcgattta aaaacggtga tcggattttt 2820ccgagctctc gatacgacgg acgcgccagc atcacgagac tgggccagtg ccgcgagcga 2880cctagaaact ctcgtggcgg atcttgagga gctggctgac gagctgcgtg ctcggccagc 2940gccaggagga cgcacagtag tggaggatgc aatcagttgc gcctactgcg gtggcctgat 3000tcctccccgg cctgacccgc gaggacggcg cgcaaaatat tgctcagatg cgtgtcgtgc 3060cgcagccagc cgcgagcgcg ccaacaaacg ccacgccgag gagctggagg cggctaggtc 3120gcaaatggcg ctggaagtgc gtcccccgag cgaaattttg gccatggtcg tcacagagct 3180ggaagcggca gcgagaatta tcgcgatcgt ggcggtgccc gcaggcatga caaacatcgt 3240aaatgccgcg tttcgtgtgc cgtggccgcc caggacgtgt cagcgccgcc accacctgca 3300ccgaatcggc agcagcgtcg cgcgtcgaaa aagcgcacag gcggcaagaa gcgataagct 3360gcacgaatac ctgaaaaatg ttgaacgccc cgtgagcggt aactcacagg gcgtcggcta 3420acccccagtc caaacctggg agaaagcgct caaaaatgac tctagcggat tcacgagaca 3480ttgacacacc ggcctggaaa ttttccgctg atctgttcga cacccatccc gagctcgcgc 3540tgcgatcacg tggctggacg agcgaagacc gccgcgaatt cctcgctcac ctgggcagag 3600aaaatttcca gggcagcaag acccgcgact tcgccagcgc ttggatcaaa gacccggaca 3660
cggagaaaca cagccgaagt tataccgagt tggttcaaaa tcgcttgccc ggtgccagta 3720tgttgctctg acgcacgcgc agcacgcagc cgtgcttgtc ctggacattg atgtgccgag 3780ccaccaggcc ggcgggaaaa tcgagcacgt aaaccccgag gtctacgcga ttttggagcg 3840ctgggcacgc ctggaaaaag cgccagcttg gatcggcgtg aatccactga gcgggaaatg 3900ccagctcatc tggctcattg atccggtgta tgccgcagca ggcatgagca gcccgaatat 3960gcgcctgctg gctgcaacga ccgaggaaat gacccgcgtt ttcggcgctg accaggcttt 4020ttcacatagg ctgagccgtg gccactgcac tctccgacga tcccagccgt accgctggca 4080tgcccagcac aatcgcgtgg atcgcctagc tgatcttatg gaggttgctc gcatgatctc 4140aggcacagaa aaacctaaaa aacgctatga gcaggagttt tctagcggac gggcacgtat 4200cgaagcggca agaaaagcca ctgcggaagc aaaagcactt gccacgcttg aagcaagcct 4260gccgagcgcc gctgaagcgt ctggagagct gatcgacggc gtccgtgtcc tctggactgc 4320tccagggcgt gccgcccgtg atgagacggc ttttcgccac gctttgactg tgggatacca 4380gttaaaagcg gctggtgagc gcctaaaaga caccaagggt catcgagcct acgagcgtgc 4440ctacaccgtc gctcaggcgg tcggaggagg ccgtgagcct gatctgccgc cggactgtga 4500ccgccagacg gattggccgc gacgtgtgcg cggctacgtc gctaaaggcc agccagtcgt 4560ccctgctcgt cagacagaga cgcagagcca gccgaggcga aaagctctgg ccactatggg 4620aagacgtggc ggtaaaaagg ccgcagaacg ctggaaagac ccaaacagtg agtacgcccg 4680agcacagcga gaaaaactag ctaagtccag tcaacgacaa gctaggaaag ctaaaggaaa 4740tcgcttgacc attgcaggtt ggtttatgac tgttgaggga gagactggct cgtggccgac 4800aatcaatgaa gctatgtctg aatttagcgt gtcacgtcag accgtgaata gagcacttaa 4860ggtctgcggg cattgaactt ccacgaggac gccgaaagct tcccagtaaa tgtgccatct 4920cgtaggcaga aaacggttcc cccgtagggt ctctctcttg gcctcctttc taggtcgggc 4980tgattgctct tgaagctctc taggggggct cacaccatag gcagataacg ttccccaccg 5040gctcgcctcg taagcgcaca aggactgctc ccaaagatct tcaaagccac t 5091210642114323212DNA213人工序列220
223人工序列的描繪質粒40064tctctcagcg tatggttgtc gcctgagctg tagttgcctt catcgatgaa ctgctgtaca 60ttttgatacg tttttccgtc accgtcaaag attgatttat aatcctctac accgttgatg 120ttcaaagagc tgtctgatgc tgatacgtta acttgtgcag ttgtcagtgt ttgtttgccg 180taatgtttac cggagaaatc agtgtagaat aaacggattt ttccgtcaga tgtaaatgtg 240gctgaacctg accattcttg tgtttggtct tttaggatag aatcatttgc atcgaatttg 300tcgctgtctt taaagacgcg gccagcgttt ttccagctgt caatagaagt ttcgccgact 360ttttgataga acatgtaaat cgatgtgtca tccgcatttt taggatctcc ggctaatgca 420aagacgatgt ggtagccgtg atagtttgcg acagtgccgt cagcgttttg taatggccag 480ctgtcccaaa cgtccaggcc ttttgcagaa gagatatttt taattgtgga cgaatcaaat 540tcagaaactt gatatttttc atttttttgc tgttcaggga tttgcagcat atcatggcgt 600gtaatatggg aaatgccgta tgtttcctta tatggctttt ggttcgtttc tttcgcaaac 660gcttgagttg cgcctcctgc cagcagtgcg gtagtaaagg ttaatactgt tgcttgtttt 720gcaaactttt tgatgttcat cgttcatgtc tcctttttta tgtactgtgt tagcggtctg 780cttcttccag ccctcctgtt tgaagatggc aagttagtta cgcacaataa aaaaagacct 840aaaatatgta aggggtgacg ccaaagtata cactttgccc tttacacatt ttaggtcttg 900cctgctttat cagtaacaaa cccgcgcgat ttacttttcg acctcattct attagactct 960cgtttggatt gcaactggtc tattttcctc ttttgtttga tagaaaatca taaaaggatt 1020tgcagactac gggcctaaag aactaaaaaa tctatctgtt tcttttcatt ctctgtattt 1080tttatagttt ctgttgcatg ggcataaagt tgccttttta atcacaattc agaaaatatc 1140ataatatctc atttcactaa ataatagtga acggcaggta tatgtgatgg gttaaaaagg 1200atcggcggcc gctcgattta aatctcgaga ggcctgacgt cgggcccggt accacgcgtc 1260atatgactag ttcggaccta gggatatcgt cgacatcgat gctcttctgc gttaattaac 1320aattgggatc ctctagaccc gggatttaaa tcgctagcgg gctgctaaag gaagcggaac 1380acgtagaaag ccagtccgca gaaacggtgc tgaccccgga tgaatgtcag ctactgggct 1440atctggacaa gggaaaacgc aagcgcaaag agaaagcagg tagcttgcag tgggcttaca 1500
tggcgatagc tagactgggc ggttttatgg acagcaagcg aaccggaatt gccagctggg 1560gcgccctctg gtaaggttgg gaagccctgc aaagtaaact ggatggcttt cttgccgcca 1620aggatctgat ggcgcagggg atcaagatct gatcaagaga caggatgagg atcgtttcgc 1680atgattgaac aagatggatt gcacgcaggt tctccggccg cttgggtgga gaggctattc 1740ggctatgact gggcacaaca gacaatcggc tgctctgatg ccgccgtgtt ccggctgtca 1800gcgcaggggc gcccggttct ttttgtcaag accgacctgt ccggtgccct gaatgaactg 1860caggacgagg cagcgcggct atcgtggctg gccacgacgg gcgttccttg cgcagctgtg 1920ctcgacgttg tcactgaagc gggaagggac tggctgctat tgggcgaagt gccggggcag 1980gatctcctgt catctcacct tgctcctgcc gagaaagtat ccatcatggc tgatgcaatg 2040cggcggctgc atacgcttga tccggctacc tgcccattcg accaccaagc gaaacatcgc 2100atcgagcgag cacgtactcg gatggaagcc ggtcttgtcg atcaggatga tctggacgaa 2160gagcatcagg ggctcgcgcc agccgaactg ttcgccaggc tcaaggcgcg catgcccgac 2220ggcgaggatc tcgtcgtgac ccatggcgat gcctgcttgc cgaatatcat ggtggaaaat 2280ggccgctttt ctggattcat cgactgtggc cggctgggtg tggcggaccg ctatcaggac 2340atagcgttgg ctacccgtga tattgctgaa gagcttggcg gcgaatgggc tgaccgcttc 2400ctcgtgcttt acggtatcgc cgctcccgat tcgcagcgca tcgccttcta tcgccttctt 2460gacgagttct tctgagcggg actctggggt tcgaaatgac cgaccaagcg acgcccaacc 2520tgccatcacg agatttcgat tccaccgccg ccttctatga aaggttgggc ttcggaatcg 2580ttttccggga cgccggctgg atgatcctcc agcgcgggga tctcatgctg gagttcttcg 2640cccacgctag cggcgcgccg gccggcccgg tgtgaaatac cgcacagatg cgtaaggaga 2700aaataccgca tcaggcgctc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt 2760cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca 2820ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa 2880aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat 2940cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc 3000cctggaagct ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc 3060gcctttctcc cttcgggaag cgtggcgctt tctcatagct cacgctgtag gtatctcagt 3120tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac 3180cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg 3240ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca 3300gagttcttga agtggtggcc taactacggc tacactagaa ggacagtatt tggtatctgc 3360gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa 3420accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa 3480ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg gaacgaaaac 3540tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta gatcctttta 3600aaggccggcc gcggccgcca tcggcatttt cttttgcgtt tttatttgtt aactgttaat 3660tgtccttgtt caaggatgct gtctttgaca acagatgttt tcttgccttt gatgttcagc 3720aggaagctcg gcgcaaacgt tgattgtttg tctgcgtaga atcctctgtt tgtcatatag 3780cttgtaatca cgacattgtt tcctttcgct tgaggtacag cgaagtgtga gtaagtaaag 3840gttacatcgt taggatcaag atccattttt aacacaaggc cagttttgtt cagcggcttg 3900tatgggccag ttaaagaatt agaaacataa ccaagcatgt aaatatcgtt agacgtaatg 3960ccgtcaatcg tcatttttga tccgcgggag tcagtgaaca ggtaccattt gccgttcatt 4020ttaaagacgt tcgcgcgttc aatttcatct gttactgtgt tagatgcaat cagcggtttc 4080atcacttttt tcagtgtgta atcatcgttt agctcaatca taccgagagc gccgtttgct 4140aactcagccg tgcgtttttt atcgctttgc agaagttttt gactttcttg acggaagaat 4200gatgtgcttt tgccatagta tgctttgtta aataaagatt cttcgccttg gtagccatct 4260tcagttccag tgtttgcttc aaatactaag tatttgtggc ctttatcttc tacgtagtga 4320gga 43232106521135212DNA213PCR引物40065gagagagaga cgcgtcccag tggctgagac gcatc 352106621134212DNA
213PCR引物40066ctctctctgt cgacgaattc aatcttacgg cctg 342106721138212DNA213PCR引物40067cggcaccacc gacatcatcttcacctgccc tcgttccg 382106821138212DNA213PCR引物40068cggaacgagg gcaggtgaag atgatgtcgg tggtgccg 382106921131212DNA213PCR引物40069gagactcgag ggaaggtgaatcgaatttcg g 312107021138212DNA213PCR引物40070gtcccgggga gaacgcacga ttctccaaaa ataatcgc 382107121123212DNA213PCR引物40071gaatcgtgcg ttctccccgg gac 232107221122212DNA213PCR引物40072gtagttgacc gagttgatca cc 222107321118212DNA213PCR引物40073ccggcctgga gaagctcg18
2107421128212DNA213PCR引物40074gagagatatc cctcagcggg cgttgaag 28210752111266212DNA213LysC突變體220
221CDS222(1)..(1266)223
40075gtg gcc ctg gtc gta cag aaa tat ggc ggt tcc tcg ctt gag agt gcg 48Val Ala Leu Val Val Gln Lys Tyr Gly Gly Ser Ser Leu Glu Ser Ala1 5 10 15gaa cgc att aga aac gtc gct gaa cgg atc gtt gcc acc aag aag gct 96Glu Arg Ile Arg Asn Val Ala Glu Arg Ile Val Ala Thr Lys Lys Ala20 25 30gga aat gat gtc gtg gtt gtc tgc tcc gca atg gga gac acc acg gat144Gly Asn Asp Val Val Val Val Cys Ser Ala Met Gly Asp Thr Thr Asp35 40 45gaa ctt cta gaa ctt gca gcg gca gtg aat ccc gtt ccg cca gct cgt192Glu Leu Leu Glu Leu Ala Ala Ala Val Asn Pro Val Pro Pro Ala Arg50 55 60gaa atg gat atg ctc ctg act gct ggt gag cgt att tct aac gct ctc240Glu Met Asp Met Leu Leu Thr Ala Gly Glu Arg Ile Ser Asn Ala Leu65 70 75 80gtc gcc atg gct att gag tcc ctt ggc gca gaa gcc caa tct ttc acg288Val Ala Met Ala Ile Glu Ser Leu Gly Ala Glu Ala Gln Ser Phe Thr85 90 95ggc tct cag gct ggt gtg ctc acc acc gag cgc cac gga aac gca cgc336Gly Ser Gln Ala Gly Val Leu Thr Thr Glu Arg His Gly Asn Ala Arg100 105 110att gtt gat gtc act cca ggt cgt gtg cgt gaa gca ctc gat gag ggc384Ile Val Asp Val Thr Pro Gly Arg Val Arg Glu Ala Leu Asp Glu Gly115 120 125aag atc tgc att gtt gct ggt ttc cag ggt gtt aat aaa gaa acc cgc432Lys Ile Cys Ile Val Ala Gly Phe Gln Gly Val Asn Lys Glu Thr Arg130 135 140gat gtc acc acg ttg ggt cgt ggt ggt tct gac acc act gca gtt gcg480Asp Val Thr Thr Leu Gly Arg Gly Gly Ser Asp Thr Thr Ala Val Ala145 150 155 160ttg gca gct gct ttg aac gct gat gtg tgt gag att tac tcg gac gtt528Leu Ala Ala Ala Leu Asn Ala Asp Val Cys Glu Ile Tyr Ser Asp Val
165 170 175gac ggt gtg tat acc gct gac ccg cgc atc gtt cct aat gca cag aag576Asp Gly Val Tyr Thr Ala Asp Pro Arg Ile Val Pro Asn Ala Gln Lys180 185 190ctg gaa aag ctc agc ttc gaa gaa atg ctg gaa ctt gct gct gtt ggc624Leu Glu Lys Leu Ser Phe Glu Glu Met Leu Glu Leu Ala Ala Val Gly195 200 205tcc aag att ttg gtg ctg cgc agt gtt gaa tac gct cgt gca ttc aat672Ser Lys Ile Leu Val Leu Arg Ser Val Glu Tyr Ala Arg Ala Phe Asn210 215 220gtg cca ctt cgc gta cgc tcg tct tat agt aat gat ccc ggc act ttg720Val Pro Leu Arg Val Arg Ser Ser Tyr Ser Asn Asp Pro Gly Thr Leu225 230 235 240att gcc ggc tct atg gag gat att cct gtg gaa gaa gca gtc ctt acc768Ile Ala Gly Ser Met Glu Asp Ile Pro Val Glu Glu Ala Val Leu Thr245 250 255ggt gtc gca acc gac aag tcc gaa gcc aaa gta acc gtt ctg ggt att816Gly Val Ala Thr Asp Lys Ser Glu Ala Lys Val Thr Val Leu Gly Ile260 265 270tcc gat aag cca ggc gag gct gcg aag gtt ttc cgt gcg ttg gct gat864Ser Asp Lys Pro Gly Glu Ala Ala Lys Val Phe Arg Ala Leu Ala Asp275 280 285gca gaa atc aac att gac atg gtt ctg cag aac gtc tct tct gta gaa912Ala Glu Ile Asn Ile Asp Met Val Leu Gln Asn Val Ser Ser Val Glu290 295 300gac ggc acc acc gac atc atc ttc acc tgc cct cgt tcc gac ggc cgc960Asp Gly Thr Thr Asp Ile Ile Phe Thr Cys Pro Arg Ser Asp Gly Arg305 310 315 320cgc gcg atg gag atc ttg aag aag ctt cag gtt cag ggc aac tgg acc 1008Arg Ala Met Glu Ile Leu Lys Lys Leu Gln Val Gln Gly Asn Trp Thr325 330 335aat gtg ctt tac gac gac cag gtc ggc aaa gtc tcc ctc gtg ggt gct 1056Asn Val Leu Tyr Asp Asp Gln Val Gly Lys Val Ser Leu Val Gly Ala340 345 350ggc atg aag tct cac cca ggt gtt acc gca gag ttc atg gaa gct ctg 1104Gly Met Lys Ser His Pro Gly Val Thr Ala Glu Phe Met Glu Ala Leu355 360 365cgc gat gtc aac gtg aac atc gaa ttg att tcc acc tct gag att cgt 1152Arg Asp Val Asn Val Asn Ile Glu Leu Ile Ser Thr Ser Glu Ile Arg370 375 380att tcc gtg ctg atc cgt gaa gat gat ctg gat gct gct gca cgt gca 1200Ile Ser Val Leu Ile Arg Glu Asp Asp Leu Asp Ala Ala Ala Arg Ala385 390 395 400ttg cat gag cag ttc cag ctg ggc ggc gaa gac gaa gcc gtc gtt tat 1248Leu His Glu Gln Phe Gln Leu Gly Gly Glu Asp Glu Ala Val Val Tyr405 410 415
gca ggc acc gga cgc taa1266Ala Gly Thr Gly Arg42021076211421212PRT213LysC突變體40076Val Ala Leu Val Val Gln Lys Tyr Gly Gly Ser Ser Leu Glu Ser Ala1 5 10 15Glu Arg Ile Arg Asn Val Ala Glu Arg Ile Val Ala Thr Lys Lys Ala20 25 30Gly Asn Asp Val Val Val Val Cys Ser Ala Met Gly Asp Thr Thr Asp35 40 45Glu Leu Leu Glu Leu Ala Ala Ala Val Asn Pro Val Pro Pro Ala Arg50 55 60Glu Met Asp Met Leu Leu Thr Ala Gly Glu Arg Ile Ser Asn Ala Leu65 70 75 80Val Ala Met Ala Ile Glu Ser Leu Gly Ala Glu Ala Gln Ser Phe Thr85 90 95Gly Ser Gln Ala Gly Val Leu Thr Thr Glu Arg His Gly Asn Ala Arg100 105 110Ile Val Asp Val Thr Pro Gly Arg Val Arg Glu Ala Leu Asp Glu Gly115 120 125Lys Ile Cys Ile Val Ala Gly Phe Gln Gly Val Asn Lys Glu Thr Arg130 135 140Asp Val Thr Thr Leu Gly Arg Gly Gly Ser Asp Thr Thr Ala Val Ala145 150 155 160Leu Ala Ala Ala Leu Asn Ala Asp Val Cys Glu Ile Tyr Ser Asp Val165 170 175Asp Gly Val Tyr Thr Ala Asp Pro Arg Ile Val Pro Asn Ala Gln Lys180 185 190Leu Glu Lys Leu Ser Phe Glu Glu Met Leu Glu Leu Ala Ala Val Gly195 200 205
Ser Lys Ile Leu Val Leu Arg Ser Val Glu Tyr Ala Arg Ala Phe Asn210 215 220Val Pro Leu Arg Val Arg Ser Ser Tyr Ser Asn Asp Pro Gly Thr Leu225 230 235 240Ile Ala Gly Ser Met Glu Asp Ile Pro Val Glu Glu Ala Val Leu Thr245 250 255Gly Val Ala Thr Asp Lys Ser Glu Ala Lys Val Thr Val Leu Gly Ile260 265 270Ser Asp Lys Pro Gly Glu Ala Ala Lys Val Phe Arg Ala Leu Ala Asp275 280 285Ala Glu Ile Asn Ile Asp Met Val Leu Gln Asn Val Ser Ser Val Glu290 295 300Asp Gly Thr Thr Asp Ile Ile Phe Thr Cys Pro Arg Ser Asp Gly Arg305 310 315 320Arg Ala Met Glu Ile Leu Lys Lys Leu Gln Val Gln Gly Asn Trp Thr325 330 335Asn Val Leu Tyr Asp Asp Gln Val Gly Lys Val Ser Leu Val Gly Ala340 345 350Gly Met Lys Ser His Pro Gly Val Thr Ala Glu Phe Met Glu Ala Leu355 360 365Arg Asp Val Asn Val Asn Ile Glu Leu Ile Ser Thr Ser Glu Ile Arg370 375 380Ile Ser Val Leu Ile Arg Glu Asp Asp Leu Asp Ala Ala Ala Arg Ala385 390 395 400Leu His Glu Gln Phe Gln Leu Gly Gly Glu Asp Glu Ala Val Val Tyr405 410 415Ala Gly Thr Gly Arg420210772115860212DNA213質粒40077
cccggtacca cgcgtcccag tggctgagac gcatccgcta aagccccagg aaccctgtgc 60agaaagaaaa cactcctctg gctaggtaga cacagtttat aaaggtagag ttgagcgggt 120aactgtcagc acgtagatcg aaaggtgcac aaaggtggcc ctggtcgtac agaaatatgg 180cggttcctcg cttgagagtg cggaacgcat tagaaacgtc gctgaacgga tcgttgccac 240caagaaggct ggaaatgatg tcgtggttgt ctgctccgca atgggagaca ccacggatga 300acttctagaa cttgcagcgg cagtgaatcc cgttccgcca gctcgtgaaa tggatatgct 360cctgactgct ggtgagcgta tttctaacgc tctcgtcgcc atggctattg agtcccttgg 420cgcagaagcc caatctttca cgggctctca ggctggtgtg ctcaccaccg agcgccacgg 480aaacgcacgc attgttgatg tcactccagg tcgtgtgcgt gaagcactcg atgagggcaa 540gatctgcatt gttgctggtt tccagggtgt taataaagaa acccgcgatg tcaccacgtt 600gggtcgtggt ggttctgaca ccactgcagt tgcgttggca gctgctttga acgctgatgt 660gtgtgagatt tactcggacg ttgacggtgt gtataccgct gacccgcgca tcgttcctaa 720tgcacagaag ctggaaaagc tcagcttcga agaaatgctg gaacttgctg ctgttggctc 780caagattttg gtgctgcgca gtgttgaata cgctcgtgca ttcaatgtgc cacttcgcgt 840acgctcgtct tatagtaatg atcccggcac tttgattgcc ggctctatgg aggatattcc 900tgtggaagaa gcagtcctta ccggtgtcgc aaccgacaag tccgaagcca aagtaaccgt 960tctgggtatt tccgataagc caggcgaggc tgcgaaggtt ttccgtgcgt tggctgatgc1020agaaatcaac attgacatgg ttctgcagaa cgtctcttct gtagaagacg gcaccaccga1080catcaccttc acctgccctc gttccgacgg ccgccgcgcg atggagatct tgaagaagct1140tcaggttcag ggcaactgga ccaatgtgct ttacgacgac caggtcggca aagtctccct1200cgtgggtgct ggcatgaagt ctcacccagg tgttaccgca gagttcatgg aagctctgcg1260cgatgtcaac gtgaacatcg aattgatttc cacctctgag attcgtattt ccgtgctgat1320ccgtgaagat gatctggatg ctgctgcacg tgcattgcat gagcagttcc agctgggcgg1380cgaagacgaa gccgtcgttt atgcaggcac cggacgctaa agttttaaag gagtagtttt1440acaatgacca ccatcgcagt tgttggtgca accggccagg tcggccaggt tatgcgcacc1500cttttggaag agcgcaattt cccagctgac actgttcgtt tctttgcttc cccacgttcc1560gcaggccgta agattgaatt cgtcgacatc gatgctcttc tgcgttaatt aacaattggg1620atcctctaga cccgggattt aaatcgctag cgggctgcta aaggaagcgg aacacgtaga1680aagccagtcc gcagaaacgg tgctgacccc ggatgaatgt cagctactgg gctatctgga1740caagggaaaa cgcaagcgca aagagaaagc aggtagcttg cagtgggctt acatggcgat1800agctagactg ggcggtttta tggacagcaa gcgaaccgga attgccagct ggggcgccct1860
ctggtaaggt tgggaagccc tgcaaagtaa actggatggc tttcttgccg ccaaggatct1920gatggcgcag gggatcaaga tctgatcaag agacaggatg aggatcgttt cgcatgattg1980aacaagatgg attgcacgca ggttctccgg ccgcttgggt ggagaggcta ttcggctatg2040actgggcaca acagacaatc ggctgctctg atgccgccgt gttccggctg tcagcgcagg2100ggcgcccggt tctttttgtc aagaccgacc tgtccggtgc cctgaatgaa ctgcaggacg2160aggcagcgcg gctatcgtgg ctggccacga cgggcgttcc ttgcgcagct gtgctcgacg2220ttgtcactga agcgggaagg gactggctgc tattgggcga agtgccgggg caggatctcc2280tgtcatctca ccttgctcct gccgagaaag tatccatcat ggctgatgca atgcggcggc2340tgcatacgct tgatccggct acctgcccat tcgaccacca agcgaaacat cgcatcgagc2400gagcacgtac tcggatggaa gccggtcttg tcgatcagga tgatctggac gaagagcatc2460aggggctcgc gccagccgaa ctgttcgcca ggctcaaggc gcgcatgccc gacggcgagg2520atctcgtcgt gacccatggc gatgcctgct tgccgaatat catggtggaa aatggccgct2580tttctggatt catcgactgt ggccggctgg gtgtggcgga ccgctatcag gacatagcgt2640tggctacccg tgatattgct gaagagcttg gcggcgaatg ggctgaccgc ttcctcgtgc2700tttacggtat cgccgctccc gattcgcagc gcatcgcctt ctatcgcctt cttgacgagt2760tcttctgagc gggactctgg ggttcgaaat gaccgaccaa gcgacgccca acctgccatc2820acgagatttc gattccaccg ccgccttcta tgaaaggttg ggcttcggaa tcgttttccg2880ggacgccggc tggatgatcc tccagcgcgg ggatctcatg ctggagttct tcgcccacgc2940tagcggcgcg ccggccggcc cggtgtgaaa taccgcacag atgcgtaagg agaaaatacc3000gcatcaggcg ctcttccgct tcctcgctca ctgactcgct gcgctcggtc gttcggctgc3060ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata3120acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg3180cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct3240caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa3300gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc3360tcccttcggg aagcgtggcg ctttctcata gctcacgctg taggtatctc agttcggtgt3420aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg3480ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg3540cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct3600tgaagtggtg gcctaactac ggctacacta gaaggacagt atttggtatc tgcgctctgc3660tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg3720
ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc3780aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt3840aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt ttaaaggccg3900gccgcggccg ccatcggcat tttcttttgc gtttttattt gttaactgtt aattgtcctt3960gttcaaggat gctgtctttg acaacagatg ttttcttgcc tttgatgttc agcaggaagc4020tcggcgcaaa cgttgattgt ttgtctgcgt agaatcctct gtttgtcata tagcttgtaa4080tcacgacatt gtttcctttc gcttgaggta cagcgaagtg tgagtaagta aaggttacat4140cgttaggatc aagatccatt tttaacacaa ggccagtttt gttcagcggc ttgtatgggc4200cagttaaaga attagaaaca taaccaagca tgtaaatatc gttagacgta atgccgtcaa4260tcgtcatttt tgatccgcgg gagtcagtga acaggtacca tttgccgttc attttaaaga4320cgttcgcgcg ttcaatttca tctgttactg tgttagatgc aatcagcggt ttcatcactt4380ttttcagtgt gtaatcatcg tttagctcaa tcataccgag agcgccgttt gctaactcag4440ccgtgcgttt tttatcgctt tgcagaagtt tttgactttc ttgacggaag aatgatgtgc4500ttttgccata gtatgctttg ttaaataaag attcttcgcc ttggtagcca tcttcagttc4560cagtgtttgc ttcaaatact aagtatttgt ggcctttatc ttctacgtag tgaggatctc4620tcagcgtatg gttgtcgcct gagctgtagt tgccttcatc gatgaactgc tgtacatttt4680gatacgtttt tccgtcaccg tcaaagattg atttataatc ctctacaccg ttgatgttca4740aagagctgtc tgatgctgat acgttaactt gtgcagttgt cagtgtttgt ttgccgtaat4800gtttaccgga gaaatcagtg tagaataaac ggatttttcc gtcagatgta aatgtggctg4860aacctgacca ttcttgtgtt tggtctttta ggatagaatc atttgcatcg aatttgtcgc4920tgtctttaaa gacgcggcca gcgtttttcc agctgtcaat agaagtttcg ccgacttttt4980gatagaacat gtaaatcgat gtgtcatccg catttttagg atctccggct aatgcaaaga5040cgatgtggta gccgtgatag tttgcgacag tgccgtcagc gttttgtaat ggccagctgt5100cccaaacgtc caggcctttt gcagaagaga tatttttaat tgtggacgaa tcaaattcag5160aaacttgata tttttcattt ttttgctgtt cagggatttg cagcatatca tggcgtgtaa5220tatgggaaat gccgtatgtt tccttatatg gcttttggtt cgtttctttc gcaaacgctt5280gagttgcgcc tcctgccagc agtgcggtag taaaggttaa tactgttgct tgttttgcaa5340actttttgat gttcatcgtt catgtctcct tttttatgta ctgtgttagc ggtctgcttc5400ttccagccct cctgtttgaa gatggcaagt tagttacgca caataaaaaa agacctaaaa5460tatgtaaggg gtgacgccaa agtatacact ttgcccttta cacattttag gtcttgcctg5520ctttatcagt aacaaacccg cgcgatttac ttttcgacct cattctatta gactctcgtt5580
tggattgcaa ctggtctatt ttcctctttt gtttgataga aaatcataaa aggatttgca5640gactacgggc ctaaagaact aaaaaatcta tctgtttctt ttcattctct gtatttttta5700tagtttctgt tgcatgggca taaagttgcc tttttaatca caattcagaa aatatcataa5760tatctcattt cactaaataa tagtgaacgg caggtatatg tgatgggtta aaaaggatcg5820gcggccgctc gatttaaatc tcgagaggcc tgacgtcggg 5860210782115860212DNA213質粒40078cccggtacca cgcgtcccag tggctgagac gcatccgcta aagccccagg aaccctgtgc 60agaaagaaaa cactcctctg gctaggtaga cacagtttat aaaggtagag ttgagcgggt 120aactgtcagc acgtagatcg aaaggtgcac aaaggtggcc ctggtcgtac agaaatatgg 180cggttcctcg cttgagagtg cggaacgcat tagaaacgtc gctgaacgga tcgttgccac 240caagaaggct ggaaatgatg tcgtggttgt ctgctccgca atgggagaca ccacggatga 300acttctagaa cttgcagcgg cagtgaatcc cgttccgcca gctcgtgaaa tggatatgct 360cctgactgct ggtgagcgta tttctaacgc tctcgtcgcc atggctattg agtcccttgg 420cgcagaagcc caatctttca cgggctctca ggctggtgtg ctcaccaccg agcgccacgg 480aaacgcacgc attgttgatg tcactccagg tcgtgtgcgt gaagcactcg atgagggcaa 540gatctgcatt gttgctggtt tccagggtgt taataaagaa acccgcgatg tcaccacgtt 600gggtcgtggt ggttctgaca ccactgcagt tgcgttggca gctgctttga acgctgatgt 660gtgtgagatt tactcggacg ttgacggtgt gtataccgct gacccgcgca tcgttcctaa 720tgcacagaag ctggaaaagc tcagcttcga agaaatgctg gaacttgctg ctgttggctc 780caagattttg gtgctgcgca gtgttgaata cgctcgtgca ttcaatgtgc cacttcgcgt 840acgctcgtct tatagtaatg atcccggcac tttgattgcc ggctctatgg aggatattcc 900tgtggaagaa gcagtcctta ccggtgtcgc aaccgacaag tccgaagcca aagtaaccgt 960tctgggtatt tccgataagc caggcgaggc tgcgaaggtt ttccgtgcgt tggctgatgc1020agaaatcaac attgacatgg ttctgcagaa cgtctcttct gtagaagacg gcaccaccga1080catcatcttc acctgccctc gttccgacgg ccgccgcgcg atggagatct tgaagaagct1140tcaggttcag ggcaactgga ccaatgtgct ttacgacgac caggtcggca aagtctccct1200cgtgggtgct ggcatgaagt ctcacccagg tgttaccgca gagttcatgg aagctctgcg1260cgatgtcaac gtgaacatcg aattgatttc cacctctgag attcgtattt ccgtgctgat1320ccgtgaagat gatctggatg ctgctgcacg tgcattgcat gagcagttcc agctgggcgg1380
cgaagacgaa gccgtcgttt atgcaggcac cggacgctaa agttttaaag gagtagtttt1440acaatgacca ccatcgcagt tgttggtgca accggccagg tcggccaggt tatgcgcacc1500cttttggaag agcgcaattt cccagctgac actgttcgtt tctttgcttc cccacgttcc1560gcaggccgta agattgaatt cgtcgacatc gatgctcttc tgcgttaatt aacaattggg1620atcctctaga cccgggattt aaatcgctag cgggctgcta aaggaagcgg aacacgtaga1680aagccagtcc gcagaaacgg tgctgacccc ggatgaatgt cagctactgg gctatctgga1740caagggaaaa cgcaagcgca aagagaaagc aggtagcttg cagtgggctt acatggcgat1800agctagactg ggcggtttta tggacagcaa gcgaaccgga attgccagct ggggcgccct1860ctggtaaggt tgggaagccc tgcaaagtaa actggatggc tttcttgccg ccaaggatct1920gatggcgcag gggatcaaga tctgatcaag agacaggatg aggatcgttt cgcatgattg1980aacaagatgg attgcacgca ggttctccgg ccgcttgggt ggagaggcta ttcggctatg2040actgggcaca acagacaatc ggctgctctg atgccgccgt gttccggctg tcagcgcagg2100ggcgcccggt tctttttgtc aagaccgacc tgtccggtgc cctgaatgaa ctgcaggacg2160aggcagcgcg gctatcgtgg ctggccacga cgggcgttcc ttgcgcagct gtgctcgacg2220ttgtcactga agcgggaagg gactggctgc tattgggcga agtgccgggg caggatctcc2280tgtcatctca ccttgctcct gccgagaaag tatccatcat ggctgatgca atgcggcggc2340tgcatacgct tgatccggct acctgcccat tcgaccacca agcgaaacat cgcatcgagc2400gagcacgtac tcggatggaa gccggtcttg tcgatcagga tgatctggac gaagagcatc2460aggggctcgc gccagccgaa ctgttcgcca ggctcaaggc gcgcatgccc gacggcgagg2520atctcgtcgt gacccatggc gatgcctgct tgccgaatat catggtggaa aatggccgct2580tttctggatt catcgactgt ggccggctgg gtgtggcgga ccgctatcag gacatagcgt2640tggctacccg tgatattgct gaagagcttg gcggcgaatg ggctgaccgc ttcctcgtgc2700tttacggtat cgccgctccc gattcgcagc gcatcgcctt ctatcgcctt cttgacgagt2760tcttctgagc gggactctgg ggttcgaaat gaccgaccaa gcgacgccca acctgccatc2820acgagatttc gattccaccg ccgccttcta tgaaaggttg ggcttcggaa tcgttttccg2880ggacgccggc tggatgatcc tccagcgcgg ggatctcatg ctggagttct tcgcccacgc2940tagcggcgcg ccggccggcc cggtgtgaaa taccgcacag atgcgtaagg agaaaatacc3000gcatcaggcg ctcttccgct tcctcgctca ctgactcgct gcgctcggtc gttcggctgc3060ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata3120acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg3180cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct3240
caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa3300gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc3360tcccttcggg aagcgtggcg ctttctcata gctcacgctg taggtatctc agttcggtgt3420aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg3480ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg3540cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct3600tgaagtggtg gcctaactac ggctacacta gaaggacagt atttggtatc tgcgctctgc3660tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg3720ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc3780aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt3840aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt ttaaaggccg3900gccgcggccg ccatcggcat tttcttttgc gtttttattt gttaactgtt aattgtcctt3960gttcaaggat gctgtctttg acaacagatg ttttcttgcc tttgatgttc agcaggaagc4020tcggcgcaaa cgttgattgt ttgtctgcgt agaatcctct gtttgtcata tagcttgtaa4080tcacgacatt gtttcctttc gcttgaggta cagcgaagtg tgagtaagta aaggttacat4140cgttaggatc aagatccatt tttaacacaa ggccagtttt gttcagcggc ttgtatgggc4200cagttaaaga attagaaaca taaccaagca tgtaaatatc gttagacgta atgccgtcaa4260tcgtcatttt tgatccgcgg gagtcagtga acaggtacca tttgccgttc attttaaaga4320cgttcgcgcg ttcaatttca tctgttactg tgttagatgc aatcagcggt ttcatcactt4380ttttcagtgt gtaatcatcg tttagctcaa tcataccgag agcgccgttt gctaactcag4440ccgtgcgttt tttatcgctt tgcagaagtt tttgactttc ttgacggaag aatgatgtgc4500ttttgccata gtatgctttg ttaaataaag attcttcgcc ttggtagcca tcttcagttc4560cagtgtttgc ttcaaatact aagtatttgt ggcctttatc ttctacgtag tgaggatctc4620tcagcgtatg gttgtcgcct gagctgtagt tgccttcatc gatgaactgc tgtacatttt4680gatacgtttt tccgtcaccg tcaaagattg atttataatc ctctacaccg ttgatgttca4740aagagctgtc tgatgctgat acgttaactt gtgcagttgt cagtgtttgt ttgccgtaat4800gtttaccgga gaaatcagtg tagaataaac ggatttttcc gtcagatgta aatgtggctg4860aacctgacca ttcttgtgtt tggtctttta ggatagaatc atttgcatcg aatttgtcgc4920tgtctttaaa gacgcggcca gcgtttttcc agctgtcaat agaagtttcg ccgacttttt4980gatagaacat gtaaatcgat gtgtcatccg catttttagg atctccggct aatgcaaaga5040cgatgtggta gccgtgatag tttgcgacag tgccgtcagc gttttgtaat ggccagctgt5100
cccaaacgtc caggcctttt gcagaagaga tatttttaat tgtggacgaa tcaaattcag5160aaacttgata tttttcattt ttttgctgtt cagggatttg cagcatatca tggcgtgtaa5220tatgggaaat gccgtatgtt tccttatatg gcttttggtt cgtttctttc gcaaacgctt5280gagttgcgcc tcctgccagc agtgcggtag taaaggttaa tactgttgct tgttttgcaa5340actttttgat gttcatcgtt catgtctcct tttttatgta ctgtgttagc ggtctgcttc5400ttccagccct cctgtttgaa gatggcaagt tagttacgca caataaaaaa agacctaaaa5460tatgtaaggg gtgacgccaa agtatacact ttgcccttta cacattttag gtcttgcctg5520ctttatcagt aacaaacccg cgcgatttac ttttcgacct cattctatta gactctcgtt5580tggattgcaa ctggtctatt ttcctctttt gtttgataga aaatcataaa aggatttgca5640gactacgggc ctaaagaact aaaaaatcta tctgtttctt ttcattctct gtatttttta5700tagtttctgt tgcatgggca taaagttgcc tttttaatca caattcagaa aatatcataa5760tatctcattt cactaaataa tagtgaacgg caggtatatg tgatgggtta aaaaggatcg5820gcggccgctc gatttaaatc tcgagaggcc tgacgtcggg 5860210792118787212DNA213質粒40079tcgagggaag gtgaatcgaa tttcggggct ttaaagcaaa aatgaacagc ttggtctata 60gtggctaggt accctttttg ttttggacac atgtagggtg gccgaaacaa agtaatagga 120caacaacgct cgaccgcgat tatttttgga gaatcgtgcg ttctccccgg gacgtcccac 180gacgggcggc accgggcaga ggcaaagccg acagccgtcg catcctaggg agccctttca 240tggcctcgtc gccatccacc ccgcccgccg acacccgcac ccgcgtgtcc gccctccgag 300aggccctcgc cacccgcgtg gtggtcgccg acggcgccat gggcaccatg ctccaggccc 360agaaccccac gctggacgac ttccagcagc tcgaagggtg caacgaggtc ctgaacctca 420cccggcccga catcgtccgc tcggtgcacg aggagtactt cgcggccggc gtcgactgcg 480tcgagaccaa caccttcggc gccaaccact ccgccctggg cgagtacgac atccccgagc 540gcgtccacga actgtccgag gccggcgccc gcgtcgcccg cgaggtcgcc gacgagttcg 600gcgcccgcga cggccggcag cgctgggtgc tgggctccat gggccccggc accaagctcc 660ccaccctcgg ccacgccccg tacaccgtcc tgcgcgacgc ctaccagcgc aacgccgagg 720gactggtcgc gggcggcgcg gacgcactgc tggtggagac cacgcaggac ctgctccaga 780ccaaggcctc ggtgctcggc gcccggcgcg ccctggacgt cctcggcctc gacctgccgc 840tcatcgtgtc cgtcaccgtc gagaccaccg gcaccatgct gctcggctcg gagatcggcg 900
ccgcgctcac cgcgctggaa ccgctcggca tcgacatgat cggcctgaac tgcgccaccg 960gccccgccga gatgagcgag cacctgcgct acctcgcccg gcactcccgc atcccgctga1020cctgcatgcc caacgccggt ctgcccgtcc tcggcaagga cggcgcccac tacccgctga1080ccgcgcccga gctggccgac gcacacgaga ccttcgtgcg cgagtacggc ctgtccctgg1140tcggcggctg ctgcggcacc acgcccgagc acctgcgcca ggtcgtcgag cgggtccggg1200acaccgcccc caccgcacgc gacccgcgcc ccgagcccgg cgccgcctcg ctctaccaga1260ccgtgccctt ccgccaggac acctcctacc tggccatcgg cgagcgcacc aacgccaacg1320ggtccaagaa gttccgcgag gccatgctgg acggccgctg ggacgactgc gtcgagatgg1380cccgcgacca gatccgcgaa ggcgcgcaca tgctcgacct ctgcgtcgac tacgtcggcc1440gggacggcgt cgccgacatg gaggaactgg ccggccggtt cgccaccgcc tccacgctgc1500cgatcgtcct cgactccacc gaggtcgacg tcatccgggc cggcctggag aagctcggcg1560gccgcgcggt gatcaactcg gtcaactacg aggacggcgc cggccccgag tcccggttcg1620cccgcgtcac gaagctcgcc cgggagcacg gcgccgcgct gatcgcgctg accatcgacg1680aggtgggaca ggcccgcacc gccgagaaga aggtcgagat cgccgaacgg ctcatcgacg1740acctcaccgg caactggggc atccacgagt ccgacatcct cgtcgactgc ctgaccttca1800ccatctgcac cggccaggag gagtcccgca aggacggcct ggccaccatc gagggcatcc1860gggaactcaa gcggcgccac ccggacgtgc agaccacgct cggcctgtcg aacatctcct1920tcggcctcaa cccggccgcc cgcatcctgc tcaactccgt cttcctcgac gaatgcgtca1980aggccggcct ggactcggcc atcgtgcacg cgagcaagat cctgccgatc gcccgcttcg2040acgaggagca ggtcaccacc gccctcgact tgatctacga ccgccgccgc gagggctacg2100accccctgca aaagctcatg cagctcttcg agggcgccac cgccaagtcg ctgaaggcct2160ccaaggccga ggaactggcc gccctcccgc tggaggagcg cctcaagcgc cgcatcatcg2220acggcgagaa gaacggcctc gaacaggacc tcgacgaggc cctccgggag cgcccggccc2280tcgagatcgt caacgacacc ctgctcgacg gtatgaaggt cgtcggcgag ctgttcggct2340ccggccagat gcagctgccg ttcgtgctcc agtccgccga ggtcatgaag accgcggtgg2400cccacctgga gccgcacatg gagaagaccg acgacgacgg caagggcacg atcgtgctgg2460ccaccgtccg cggcgacgtc cacgacatcg gcaagaacct cgtcgacatc atcctgtcca2520acaacggcta caacgtcgtc aacctcggca tcaagcagcc cgtctccgcg atcctggaag2580cggccgacga gcaccgggcc gacgtcatcg gcatgtccgg cctcctcgtc aagtccacgg2640tgatcatgaa ggagaacctg gaggagctga accagcgcaa gctggccgcc gactacccgg2700tcatcctcgg cggcgccgcc ctcaccaggg cctacgtcga acaggacctg cacgagatct2760
acgacggcga ggtccgctac gcccgcgacg ccttcgaggg cctgcgcctc atggacgccc2820tcatcggcat caagcgcggc gtgcccggcg ccaagctgcc ggagctgaag cagcgccggg2880tgcgggccgc caccgtcgag atcgacgagc gccccgagga aggccacgtc cgctccgacg2940tcgccaccga caacccggtc ccgaccccgc ccttccgcgg cacccgcgtc gtcaagggca3000tccagctcaa ggagtacgcc tcctggctcg acgagggcgc cctcttcaag ggccagtggg3060gcctcaagca ggcccgcacc ggcgagggac cctcctacga ggaactggtc gagtccgagg3120gccggccgcg gctgcgcggc ctgctcgacc ggctccagac ggacaacctt ttggaggcgg3180ccgtggtcta cggctacttc ccctgcgtct ccaaggacga cgacctgatc gtcctcgacg3240acgacggcaa cgaacgcacc cgcttcacct tcccccgcca gcgccgcggc cggcgcctgt3300gcctggccga cttcttccgc ccggaggagt ccggcgagac cgacgtggtc ggcttccagg3360tcgtcaccgt cggctcccgc atcggcgagg agacggcccg catgttcgag gccaacgcct3420accgcgacta tctcgagctg cacggcctgt ccgtgcagct cgccgaggcc ctcgccgagt3480actggcacgc gcgcgtgcgc tcggaactcg gcttcgccgg ggaggacccg gccgagatgg3540aggacatgtt cgccctgaag taccggggtg cccgcttctc cctcggctac ggcgcctgcc3600ccgacctgga ggaccgcgcc aagatcgccg ccctgctgga gcccgagcgc atcggcgtcc3660acctatccga ggagttccag ctccaccccg agcagtccac cgacgccatc gtcatccacc3720acccggaggc caagtacttc aacgcccgct gagggatatc gtcgacatcg atgctcttct3780gcgttaatta acaattggga tcctctagac ccgggattta aatcgctagc gggctgctaa3840aggaagcgga acacgtagaa agccagtccg cagaaacggt gctgaccccg gatgaatgtc3900agctactggg ctatctggac aagggaaaac gcaagcgcaa agagaaagca ggtagcttgc3960agtgggctta catggcgata gctagactgg gcggttttat ggacagcaag cgaaccggaa4020ttgccagctg gggcgccctc tggtaaggtt gggaagccct gcaaagtaaa ctggatggct4080ttcttgccgc caaggatctg atggcgcagg ggatcaagat ctgatcaaga gacaggatga4140ggatcgtttc gcatgattga acaagatgga ttgcacgcag gttctccggc cgcttgggtg4200gagaggctat tcggctatga ctgggcacaa cagacaatcg gctgctctga tgccgccgtg4260ttccggctgt cagcgcaggg gcgcccggtt ctttttgtca agaccgacct gtccggtgcc4320ctgaatgaac tgcaggacga ggcagcgcgg ctatcgtggc tggccacgac gggcgttcct4380tgcgcagctg tgctcgacgt tgtcactgaa gcgggaaggg actggctgct attgggcgaa4440gtgccggggc aggatctcct gtcatctcac cttgctcctg ccgagaaagt atccatcatg4500gctgatgcaa tgcggcggct gcatacgctt gatccggcta cctgcccatt cgaccaccaa4560gcgaaacatc gcatcgagcg agcacgtact cggatggaag ccggtcttgt cgatcaggat4620
gatctggacg aagagcatca ggggctcgcg ccagccgaac tgttcgccag gctcaaggcg4680cgcatgcccg acggcgagga tctcgtcgtg acccatggcg atgcctgctt gccgaatatc4740atggtggaaa atggccgctt ttctggattc atcgactgtg gccggctggg tgtggcggac4800cgctatcagg acatagcgtt ggctacccgt gatattgctg aagagcttgg cggcgaatgg4860gctgaccgct tcctcgtgct ttacggtatc gccgctcccg attcgcagcg catcgccttc4920tatcgccttc ttgacgagtt cttctgagcg ggactctggg gttcgaaatg accgaccaag4980cgacgcccaa cctgccatca cgagatttcg attccaccgc cgccttctat gaaaggttgg5040gcttcggaat cgttttccgg gacgccggct ggatgatcct ccagcgcggg gatctcatgc5100tggagttctt cgcccacgct agcggcgcgc cggccggccc ggtgtgaaat accgcacaga5160tgcgtaagga gaaaataccg catcaggcgc tcttccgctt cctcgctcac tgactcgctg5220cgctcggtcg ttcggctgcg gcgagcggta tcagctcact caaaggcggt aatacggtta5280tccacagaat caggggataa cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc5340aggaaccgta aaaaggccgc gttgctggcg tttttccata ggctccgccc ccctgacgag5400catcacaaaa atcgacgctc aagtcagagg tggcgaaacc cgacaggact ataaagatac5460caggcgtttc cccctggaag ctccctcgtg cgctctcctg ttccgaccct gccgcttacc5520ggatacctgt ccgcctttct cccttcggga agcgtggcgc tttctcatag ctcacgctgt5580aggtatctca gttcggtgta ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc5640gttcagcccg accgctgcgc cttatccggt aactatcgtc ttgagtccaa cccggtaaga5700cacgacttat cgccactggc agcagccact ggtaacagga ttagcagagc gaggtatgta5760ggcggtgcta cagagttctt gaagtggtgg cctaactacg gctacactag aaggacagta5820tttggtatct gcgctctgct gaagccagtt accttcggaa aaagagttgg tagctcttga5880tccggcaaac aaaccaccgc tggtagcggt ggtttttttg tttgcaagca gcagattacg5940cgcagaaaaa aaggatctca agaagatcct ttgatctttt ctacggggtc tgacgctcag6000tggaacgaaa actcacgtta agggattttg gtcatgagat tatcaaaaag gatcttcacc6060tagatccttt taaaggccgg ccgcggccgc gcaaagtccc gcttcgtgaa aattttcgtg6120ccgcgtgatt ttccgccaaa aactttaacg aacgttcgtt ataatggtgt catgaccttc6180acgacgaagt actaaaattg gcccgaatca tcagctatgg atctctctga tgtcgcgctg6240gagtccgacg cgctcgatgc tgccgtcgat ttaaaaacgg tgatcggatt tttccgagct6300ctcgatacga cggacgcgcc agcatcacga gactgggcca gtgccgcgag cgacctagaa6360actctcgtgg cggatcttga ggagctggct gacgagctgc gtgctcggcc agcgccagga6420ggacgcacag tagtggagga tgcaatcagt tgcgcctact gcggtggcct gattcctccc6480
cggcctgacc cgcgaggacg gcgcgcaaaa tattgctcag atgcgtgtcg tgccgcagcc6540agccgcgagc gcgccaacaa acgccacgcc gaggagctgg aggcggctag gtcgcaaatg6600gcgctggaag tgcgtccccc gagcgaaatt ttggccatgg tcgtcacaga gctggaagcg6660gcagcgagaa ttatcgcgat cgtggcggtg cccgcaggca tgacaaacat cgtaaatgcc6720gcgtttcgtg tgccgtggcc gcccaggacg tgtcagcgcc gccaccacct gcaccgaatc6780ggcagcagcg tcgcgcgtcg aaaaagcgca caggcggcaa gaagcgataa gctgcacgaa6840tacctgaaaa atgttgaacg ccccgtgagc ggtaactcac agggcgtcgg ctaaccccca6900gtccaaacct gggagaaagc gctcaaaaat gactctagcg gattcacgag acattgacac6960accggcctgg aaattttccg ctgatctgtt cgacacccat cccgagctcg cgctgcgatc7020acgtggctgg acgagcgaag accgccgcga attcctcgct cacctgggca gagaaaattt7080ccagggcagc aagacccgcg acttcgccag cgcttggatc aaagacccgg acacggagaa7140acacagccga agttataccg agttggttca aaatcgcttg cccggtgcca gtatgttgct7200ctgacgcacg cgcagcacgc agccgtgctt gtcctggaca ttgatgtgcc gagccaccag7260gccggcggga aaatcgagca cgtaaacccc gaggtctacg cgattttgga gcgctgggca7320cgcctggaaa aagcgccagc ttggatcggc gtgaatccac tgagcgggaa atgccagctc7380atctggctca ttgatccggt gtatgccgca gcaggcatga gcagcccgaa tatgcgcctg7440ctggctgcaa cgaccgagga aatgacccgc gttttcggcg ctgaccaggc tttttcacat7500aggctgagcc gtggccactg cactctccga cgatcccagc cgtaccgctg gcatgcccag7560cacaatcgcg tggatcgcct agctgatctt atggaggttg ctcgcatgat ctcaggcaca7620gaaaaaccta aaaaacgcta tgagcaggag ttttctagcg gacgggcacg tatcgaagcg7680gcaagaaaag ccactgcgga agcaaaagca cttgccacgc ttgaagcaag cctgccgagc7740gccgctgaag cgtctggaga gctgatcgac ggcgtccgtg tcctctggac tgctccaggg7800cgtgccgccc gtgatgagac ggcttttcgc cacgctttga ctgtgggata ccagttaaaa7860gcggctggtg agcgcctaaa agacaccaag ggtcatcgag cctacgagcg tgcctacacc7920gtcgctcagg cggtcggagg aggccgtgag cctgatctgc cgccggactg tgaccgccag7980acggattggc cgcgacgtgt gcgcggctac gtcgctaaag gccagccagt cgtccctgct8040cgtcagacag agacgcagag ccagccgagg cgaaaagctc tggccactat gggaagacgt8100ggcggtaaaa aggccgcaga acgctggaaa gacccaaaca gtgagtacgc ccgagcacag8160cgagaaaaac tagctaagtc cagtcaacga caagctagga aagctaaagg aaatcgcttg8220accattgcag gttggtttat gactgttgag ggagagactg gctcgtggcc gacaatcaat8280gaagctatgt ctgaatttag cgtgtcacgt cagaccgtga atagagcact taaggtctgc8340
gggcattgaa cttccacgag gacgccgaaa gcttcccagt aaatgtgcca tctcgtaggc8400agaaaacggt tcccccgtag ggtctctctc ttggcctcct ttctaggtcg ggctgattgc8460tcttgaagct ctctaggggg gctcacacca taggcagata acgttcccca ccggctcgcc8520tcgtaagcgc acaaggactg ctcccaaaga tcttcaaagc cactgccgcg actgccttcg8580cgaagccttg ccccgcggaa atttcctcca ccgagttcgt gcacacccct atgccaagct8640tctttcaccc taaattcgag agattggatt cttaccgtgg aaattcttcg caaaaatcgt8700cccctgatcg cccttgcgac gttggcgtcg gtgccgctgg ttgcgcttgg cttgaccgac8760ttgatcagcg gccgctcgat ttaaatc878權利要求
1.發酵生產至少一種含硫精細化學品的方法,其包括下面的步驟a)發酵產生期望的含硫精細化學品的棒桿菌細菌培養物,該棒桿菌細菌表達至少一種編碼具有甲硫氨酸合酶(metF)活性的蛋白的異源核苷酸序列;b)濃縮培養基或細菌細胞中的含硫精細化學品,和c)分離含硫精細化學品。
2.根據權利要求1的方法,其中含硫精細化學品包括L-甲硫氨酸。
3.如前面權利要求任一項的方法,其中異源metF編碼核苷酸序列與來自穀氨酸棒狀桿菌ATCC 13032的metF編碼序列的同源性小於100%。
4.根據權利要求3的方法,其中metF編碼序列來自下面的任一種生物
5.如前面權利要求任一項的方法,其中metF編碼序列含有根據SEQID NO1、3、5、7、9、11、13、15、17、19、21、23、25、27、29、31、33、35、37、39、41、43、45、47、49、51或53的編碼序列或者與其同源的編碼具有metF活性的蛋白的核苷酸序列。
6.如前面權利要求任一項的方法,其中metF編碼序列編碼具有metF活性的蛋白,所述蛋白含有根據SEQ ID NO2、4、6、8、10、12、14、16、18、20、22、24、26、28、30、32、34、36、38、40、42、44、46、48、50、52或54的胺基酸序列或者與之同源的代表具有metF活性的蛋白的胺基酸序列。
7.如前面權利要求任一項的方法,其中metF編碼序列為可以在棒桿菌細菌中複製或者穩定整合到染色體中的DNA或RNA。
8.如權利要求7的方法,其中a)使用質粒載體轉化的細菌菌株,所述質粒載體攜帶處於調節序列控制下的至少一個拷貝的metF編碼序列,或b)使用其中metF編碼序列已經整合在細菌染色體中的菌株。
9.如前面權利要求任一項的方法,其中metF編碼序列被過表達。
10.如前面權利要求任一項的方法,其中發酵所述細菌,在該細菌中,期望的含硫精細化學品的生物合成途徑中的至少另一種基因也已經被擴增或者被突變,從而其活性不受代謝物影響。
11.如前面權利要求任一項的方法,其中發酵所述細菌,在該細菌中至少一種減少期望的含硫精細化學品的生產的代謝途徑被至少部分地關閉。
12.如前面權利要求任一項的方法,其中所述棒桿菌細菌被發酵,在該細菌中同時至少一種選自a)編碼天冬氨酸激酶的lysC基因,b)甘油醛-3-磷酸脫氫酶-編碼基因gap,c)3-磷酸甘油酸激酶-編碼基因pgk,d)丙酮酸羧化酶-編碼基因pyc,e)丙糖磷酸異構酶-編碼基因tpi,f)高絲氨酸O-乙醯轉移酶-編碼基因metA,g)胱硫醚γ-合酶-編碼基因metB,h)胱硫醚γ-裂合酶-編碼基因metC,i)絲氨酸羥甲基轉移酶-編碼基因glyA,j)O-乙醯高絲氨酸硫化氫解酶-編碼基因metY,k)依賴維生素B12的甲硫氨酸合酶-編碼基因metH,l)磷酸絲氨酸氨基轉移酶-編碼基因serC,m)磷酸絲氨酸磷酸酶-編碼基因serB,n)絲氨酸乙醯基轉移酶-編碼基因cysE,和o)編碼高絲氨酸脫氫酶的hom基因的基因被過表達或者被突變從而相應蛋白的活性與未突變的蛋白的活性相比受代謝物影響的程度,如果存在的話,較小。
13.如前面權利要求任一項的方法,其中所述棒桿菌細菌被發酵,在該細菌中,同時至少一種選自a)高絲氨酸激酶-編碼基因thrB,b)蘇氨酸脫水酶-編碼基因ilvA,c)蘇氨酸合酶-編碼基因thrC,d)內消旋-二氨基庚二酸D-脫氫酶-編碼基因ddh,e)磷酸烯醇丙酮酸羧基激酶-編碼基因pck,f)葡萄糖-6-磷酸6-異構酶-編碼基因pgi,g)丙酮酸氧化酶-編碼基因poxB,h)二氫吡啶二羧酸合酶-編碼基因dapA,i)二氫吡啶二羧酸還原酶-編碼基因dapB;和j)二氨基吡啶甲酸脫羧酶-編碼基因的基因通過改變表達速度或者通過導入特定突變而被弱化。
14.如前面權利要求一項或多項的方法,其中使用微生物穀氨酸棒狀桿菌。
15.從發酵液生產含L-甲硫氨酸的動物飼料添加劑的方法,其包括下面的步驟a)在發酵培養基中培養和發酵產生L-甲硫氨酸的微生物;b)從含有L-甲硫氨酸的發酵液除去水;c)除去按重量計發酵過程中形成的生物量的0到100%;和d)乾燥根據b)和/或c)得到的發酵液,以便得到期望的粉末或顆粒形式的動物飼料添加劑。
16.如權利要求15的方法,其中使用根據權利要求1到14任一項中定義的微生物。
全文摘要
本發明涉及通過使用表達編碼甲硫氨酸合酶metH基因的核苷酸序列的細菌發酵生產含硫精細化學品,尤其是L-甲硫氨酸的方法。
文檔編號C12R1/15GK1653186SQ03811344
公開日2005年8月10日 申請日期2003年4月16日 優先權日2002年4月17日
發明者B·克勒格爾, O·策爾德爾, C·克洛普羅格, H·施洛德, S·黑夫納 申請人:巴斯福股份公司

同类文章

一種新型多功能組合攝影箱的製作方法

一種新型多功能組合攝影箱的製作方法【專利摘要】本實用新型公開了一種新型多功能組合攝影箱,包括敞開式箱體和前攝影蓋,在箱體頂部設有移動式光源盒,在箱體底部設有LED脫影板,LED脫影板放置在底板上;移動式光源盒包括上蓋,上蓋內設有光源,上蓋部設有磨沙透光片,磨沙透光片將光源封閉在上蓋內;所述LED脫影

壓縮模式圖樣重疊檢測方法與裝置與流程

本發明涉及通信領域,特別涉及一種壓縮模式圖樣重疊檢測方法與裝置。背景技術:在寬帶碼分多址(WCDMA,WidebandCodeDivisionMultipleAccess)系統頻分復用(FDD,FrequencyDivisionDuplex)模式下,為了進行異頻硬切換、FDD到時分復用(TDD,Ti

個性化檯曆的製作方法

專利名稱::個性化檯曆的製作方法技術領域::本實用新型涉及一種檯曆,尤其涉及一種既顯示月曆、又能插入照片的個性化檯曆,屬於生活文化藝術用品領域。背景技術::公知的立式檯曆每頁皆由月曆和畫面兩部分構成,這兩部分都是事先印刷好,固定而不能更換的。畫面或為風景,或為模特、明星。功能單一局限性較大。特別是畫

一種實現縮放的視頻解碼方法

專利名稱:一種實現縮放的視頻解碼方法技術領域:本發明涉及視頻信號處理領域,特別是一種實現縮放的視頻解碼方法。背景技術: Mpeg標準是由運動圖像專家組(Moving Picture Expert Group,MPEG)開發的用於視頻和音頻壓縮的一系列演進的標準。按照Mpeg標準,視頻圖像壓縮編碼後包

基於加熱模壓的纖維增強PBT複合材料成型工藝的製作方法

本發明涉及一種基於加熱模壓的纖維增強pbt複合材料成型工藝。背景技術:熱塑性複合材料與傳統熱固性複合材料相比其具有較好的韌性和抗衝擊性能,此外其還具有可回收利用等優點。熱塑性塑料在液態時流動能力差,使得其與纖維結合浸潤困難。環狀對苯二甲酸丁二醇酯(cbt)是一種環狀預聚物,該材料力學性能差不適合做纖

一種pe滾塑儲槽的製作方法

專利名稱:一種pe滾塑儲槽的製作方法技術領域:一種PE滾塑儲槽一、 技術領域 本實用新型涉及一種PE滾塑儲槽,主要用於化工、染料、醫藥、農藥、冶金、稀土、機械、電子、電力、環保、紡織、釀造、釀造、食品、給水、排水等行業儲存液體使用。二、 背景技術 目前,化工液體耐腐蝕貯運設備,普遍使用傳統的玻璃鋼容

釘的製作方法

專利名稱:釘的製作方法技術領域:本實用新型涉及一種釘,尤其涉及一種可提供方便拔除的鐵(鋼)釘。背景技術:考慮到廢木材回收後再加工利用作業的方便性與安全性,根據環保規定,廢木材的回收是必須將釘於廢木材上的鐵(鋼)釘拔除。如圖1、圖2所示,目前用以釘入木材的鐵(鋼)釘10主要是在一釘體11的一端形成一尖

直流氧噴裝置的製作方法

專利名稱:直流氧噴裝置的製作方法技術領域:本實用新型涉及ー種醫療器械,具體地說是ー種直流氧噴裝置。背景技術:臨床上的放療過程極易造成患者的局部皮膚損傷和炎症,被稱為「放射性皮炎」。目前對於放射性皮炎的主要治療措施是塗抹藥膏,而放射性皮炎患者多伴有局部疼痛,對於止痛,多是通過ロ服或靜脈注射進行止痛治療

新型熱網閥門操作手輪的製作方法

專利名稱:新型熱網閥門操作手輪的製作方法技術領域:新型熱網閥門操作手輪技術領域:本實用新型涉及一種新型熱網閥門操作手輪,屬於機械領域。背景技術::閥門作為流體控制裝置應用廣泛,手輪傳動的閥門使用比例佔90%以上。國家標準中提及手輪所起作用為傳動功能,不作為閥門的運輸、起吊裝置,不承受軸向力。現有閥門

用來自動讀取管狀容器所載識別碼的裝置的製作方法

專利名稱:用來自動讀取管狀容器所載識別碼的裝置的製作方法背景技術:1-本發明所屬領域本發明涉及一種用來自動讀取管狀容器所載識別碼的裝置,其中的管狀容器被放在循環於配送鏈上的文檔匣或託架裝置中。本發明特別適用於,然而並非僅僅專用於,對引入自動分析系統的血液樣本試管之類的自動識別。本發明還涉及專為實現讀