新四季網

穩定化的流感血凝素莖區三聚體及其用途的製作方法

2023-08-11 00:34:56


本發明提供了新的基於血凝素(HA)蛋白的流感疫苗,其是易於製造的,有力的並且引發針對流感HA蛋白的莖區的廣泛中和性流感抗體。特別地,本發明提供了融合前構象的修飾的流感HA莖區蛋白及其部分,其可用於誘導中和性抗體的產生。本發明還提供在其表面上表達流感HA蛋白的新型基於納米顆粒(np)的疫苗。此類納米顆粒包含融合蛋白,每個融合蛋白包含連接到來自流感HA蛋白的莖區的抗原性或免疫原性部分的鐵蛋白的單體亞基。因為此類納米顆粒在其表面上展示流感HA蛋白莖區,所以它們可以用於針對流感病毒對個體接種疫苗。

發明背景

通過針對流感病毒接種疫苗誘導的保護性免疫應答主要針對病毒HA蛋白,其是病毒表面上負責病毒與宿主細胞受體的相互作用的糖蛋白。病毒表面上的HA蛋白是HA蛋白單體的三聚體,其被酶促切割以產生氨基端末端HA1和羧基端末端HA2多肽。球狀頭部僅由HA1多肽的主要部分組成,而將HA蛋白錨定到病毒脂質包膜中的莖由HA2和HA1的部分組成。HA蛋白的球狀頭部包括兩個結構域:受體結合結構域(RBD),包括唾液酸結合位點的約148個胺基酸殘基結構域,和退化酯酶結構域(vestigial esterase domain),剛好低於RBD的較小的約75個胺基酸殘基區域。球狀頭部牽涉幾個包括免疫顯性表位的抗原位點。實例包括Sa,Sb,Ca1,Ca2和Cb抗原位點(參見例如Caton AJ et al,1982,Cell 31,417-427)。RBD-A區包括Sa抗原位點和Sb抗原位點的部分。

針對流感的抗體通常靶向HA球狀頭中的可變抗原位點,其圍繞保守的唾液酸結合位點,因此僅中和抗原緊密相關的病毒。HA頭部的可變性是由於流感病毒的恆定抗原漂移所致,並且造成流感的季節性流行病。相比之下,HA莖是高度保守的,並且經歷很少的抗原漂移。不幸的是,不同於免疫顯性頭部,保守的HA莖不是非常免疫原性的。此外,病毒基因組的基因區段可以在宿主物種中進行重配(抗原漂移),創建具有改變的抗原性的能夠變成大流行的新病毒[Salomon,R.et al.Cell 136,402-410(2009)]。直到現在,每年更新流感疫苗以反映即將到來的流行病毒的預測的HA和神經氨酸酶(NA)。

最近,分離了一類全新的針對流感病毒的廣泛中和性抗體,其識別高度保守的HA莖[Corti,D.et al.J Clin Invest 120,1663-1673(2010);Ekiert,D.C.et al.Science 324,246-251(2009);Kashyap,A.K.et al.Proc Natl Acad Sci USA105,5986-5991(2008);Okuno,Y.et al.J Virol 67,2552-2558(1993);Sui,J.et al.Nat Struct Mol Biol 16,265-273(2009);Ekiert,D.C.et al.Science 333,843-850(2011);Corti,D.et al.Science 333,850-856(2011)]。與毒株特異性抗體不同,那些抗體能夠中和多種抗原性獨特的病毒,因此誘導此類抗體已成為下一代通用疫苗開發的焦點[Nabel,G.J.et al.Nat Med 16,1389-1391(2010)]。然而,通過疫苗接種用此類異源中和概況強力引發這些抗體是困難的[Steel,J.et al.MBio 1,e0018(2010);Wang,T.T.et al.PLoS Pathog 6,e1000796(2010);Wei,C.J.et al.Science 329,1060-1064(2010)]。通過遺傳操作除去HA(其含有競爭性表位)的免疫顯性頭部區和穩定化所得莖結構域是改善這些廣泛中和性莖抗體的引發的一種潛在方式。

目前用於流感的疫苗策略使用化學滅活或減毒活流感病毒。兩種疫苗通常在含胚卵中產生,其由於耗時的方法和有限的生產能力而存在主要的製造限制。當前疫苗的另一個更關鍵的限制是其高度毒株特異性功效。在2009年H1N1大流行的出現期間,這些挑戰變得顯著,從而驗證了能夠克服這些限制的新疫苗平臺的必要性。病毒樣顆粒代表了這種替代方法之一,目前正在臨床試驗中進行評估[Roldao,A.et al.Expert Rev Vaccines 9,1149-1176(2010);Sheridan,C.Nat Biotechnol 27,489-491(2009)]。代替含胚卵,通常包含HA,NA和基質蛋白1(M1)的VLP可以在哺乳動物或昆蟲細胞表達系統中大規模生產[Haynes,J.R.Expert Rev Vaccines 8,435-445(2009)]。這種方法的優點是其顆粒,多價性質和正確摺疊、三聚體HA刺突的真實展示,其忠實模擬感染性病毒體。相比之下,由於其組裝的性質,有包膜的VLP含有小的但有限的宿主細胞組分,其可以在重複使用該平臺後呈現潛在的安全性,免疫原性挑戰[Wu,C.Y.et al.PLoS One 5,e9784(2010)]。此外,VLP誘導的免疫與當前疫苗基本相同,因此不可能顯著改善疫苗誘導的保護性免疫的效力和廣度。除了VLP外,重組HA蛋白也已經在人體中進行了評估[Treanor,J.J.et al.Vaccine 19,1732-1737(2001);Treanor,J.J.JAMA 297,1577-1582(2007)],儘管誘導保護性中和性抗體滴度的能力有限。在這些試驗中使用的重組HA蛋白在昆蟲細胞中產生並且可能不優先形成天然三聚體[Stevens,J.Science303,1866-1870(2004)]。

儘管常規流感疫苗有幾種替代,但在過去幾十年中生物技術的進步已經允許利用生物材料的工程化來產生新的疫苗平臺。鐵蛋白,幾乎所有活生物體中發現的鐵貯存蛋白,是已經廣泛研究和工程化以用於許多潛在的生物化學/生物醫學目的的實例[Iwahori,K.U.S.Patent 2009/0233377(2009);Meldrum,F.C.et al.Science 257,522-523(1992);Naitou,M.et al.U.S.Patent2011/0038025(2011);Yamashita,I.Biochim Biophys Acta 1800,846-857(2010)],包括用於展示外源表位肽的潛在疫苗平臺[Carter,D.C.et al.U.S.Patent 2006/0251679(2006);Li,C.Q.et al.Industrial Biotechnol 2,143-147(2006)]。其作為疫苗平臺的用途是特別有趣的,這是由於其自身組裝和抗原的多價呈遞,這比單價形式誘導更強的B細胞應答以及誘導T細胞非依賴性抗體應答[Bachmann,M.F.et al.Annu Rev Immunol 15,235-270(1997);Dintzis,H.M.et al.Proc Natl Acad Sci USA 73,3671-3675(1976)]。此外,鐵蛋白的分子結構,其由組裝成具有432對稱的八面體籠的24個亞基組成,具有在其表面上展示多聚體抗原的潛力。

仍然需要提供強力的針對流感病毒的保護的有效的流感疫苗。特別地,仍然需要保護個體免受流感病毒異源株,包括進化中的未來的季節性和大流行性流感病毒株的流感疫苗。本發明通過提供新穎的基於納米顆粒的疫苗來滿足這種需要,所述疫苗由新的HA穩定化的莖(SS)組成,沒有遺傳上融合到納米顆粒表面的可變免疫顯性頭部區(gen6HA-SS np),從而產生流感疫苗,其是易於製造的,有力的,並且引發廣泛異亞型保護性的抗體。

附圖簡述

圖1a顯示了HA頭部的基於結構的除去允許保留莖免疫原抗原性。帶狀模型描繪了HA-SS設計途徑,開始於融合到T4摺疊物(foldon)三聚化結構域(在HA胞外域下方為綠色)的HA胞外域的模型。最後三個HA-SS設計(Gen4-6)遺傳融合到鐵蛋白納米顆粒(下圖)。每個HA三聚體的一個單體被遮蔽。用於創建Gen6的核心穩定化突變顯示為球體。每種HA-SS免疫原設計下方顯示了三聚化百分比(包括摺疊物)和對規定mAb的抗原親和常數(KD,M)。ND,未確定;NA,不適用。圖1b分別顯示沒有摺疊物結構域的H1N1HA胞外域(PDB ID 1GBN),Gen4HA-SS和Gen6HA-SS的HA部分的表面呈現,其通過與H5N1 2004VN的序列保守加陰影(深灰色,可變;白色,保守)。分別對於Gen4和Gen6HA-SS,無摺疊物結構域的免疫原的HA莖百分比增加。*進一步評估此免疫原,並且在本公開的實施例部分中稱為H1-SS-np。圖1c顯示了描繪在Gen6HA-SS中Glu103-Lys51鹽橋替換為Leu103-Met51疏水對的橫截面圖的帶狀圖。虛線(左)指示橫截面的位置。圖1d顯示了以其可溶性和納米顆粒形式呈現的Gen6HA-SS的抗原性。三個圖顯示了一個頭(CH65)和三個莖特異性抗體(CR6261,CR9114,FI6v3)對Gen6』HA-SS(左圖),H1-SS-np(中圖)和H1-SS-np』(右圖)的ELISA結合。濃度範圍為10-6.40×10-4μg/mL的抗體的ELISA結合。圖1e和圖1f顯示了H1-SS-np(圖1e)和H1-SS-np』(圖1f)與HA莖定向性bNAb結合的Octet傳感圖。將H1-SS-np固定在Octet探針上,並與不同濃度的抗體結合片段Fab或scFv莖定向性抗體溫育,其在每個傳感圖的頂部指示。圖1g顯示通過抗IgM(=總受體活性),空np,HA-np(HA含有Y98F突變,以消除與唾液酸的非特異性結合)和H1-SS-np』的野生型IGHV1-69v-基因逆轉的CR6261BCR(左圖)對雙重Ile53Ala/Phe54Ala CDRH2突變體BCR(右圖)的刺激通過流式細胞術測量為Ca2+敏感染料FuraRed的Ca2+結合/未結合狀態的比率。

圖2a顯示三聚體,而不是納米顆粒莖免疫原,展示HA莖展開。左圖描述了Gen3HA-SS(黑色和灰色)和mAb C179(標記)之間的複合物的晶體結構的帶狀圖。圖2a的中間圖示出了在兩個不同視圖(側面和底部)中比較晶體結構(光)與模型(暗)的展開的草圖。圖2a的右圖顯示了Gen3HA-SS/C179結合界面與1957H2N2HA/C179結合界面(PDB ID 4HLZ)的重疊。抗體CDR環對於重鏈用「H」標記,對於輕鏈用「L」標記。重鏈框架3環標記為FR3。RMSD,均方根偏差。圖2b描繪了與圖2a中相同的圖格式,顯示了Gen4HA-SS,並且在右圖中,Gen4HA-SS/CR6261重鏈結合界面與1918H1N1HA/CR6261結合界面(PDB ID 3GBN)的重疊。圖2c顯示H1-SS-np冷凍電子顯微術(cryo-electron microscopy)分析。前兩個圖分別顯示了Gen4HA-SS晶體結構(剪切(cropped))和H1-SS-np模型,分別適合於一個H1-SS-np刺突的冷凍電子顯微術圖。圖2c的接下來兩個圖顯示了適合到H1-SS-np低溫電子顯微術圖中的整個H1-SS-np模型的兩個不同視圖。圖2d顯示分別用Superdex 20010/300和Superose 610/300柱得到的HA,Gen4HA-SS和H1-SS-np』(左圖)和HA np,Gen4HA-SS-np np和H1-SS-np』和H1-SS-np(右圖)的大小排阻層析中流感病毒HA和HA-SS不溶性和納米顆粒形式的表徵。圖2e是HA-np(左圖)和Gen4HA-SS-np(中圖)和H1-SS-np(右圖)的負染色透射電子顯微術圖像。最初以67,000×放大率記錄圖像。圖2f顯示H1-SS-np場的低溫EM圖像。箭頭描繪一些環樣納米顆粒;比例尺為20nm。圖2g顯示了通過納米顆粒(插圖)的全局圓形平均值的2D徑向密度概況(曲線)對H1-SS-np的大小分析。該概況示出了兩層結構,其具有以距離顆粒中心約為中心的基峰和跨越約至範圍的第二峰。峰高度的差異對於以含有幾個離散刺突的層為頂部的更連續的蛋白質層是一致的。圖2h顯示H1-SS-np的無參考的2D類平均值,沒有施加對稱。類別指示具有蛋白質殼和突出的刺突密度的顆粒的不同視圖,並且視圖與預期的八面體對稱一致。圖2i通過傅立葉殼關聯(FSC)圖的H1-SS-np 3D重建的解析度評估。遵循如在RELION軟體包中實施的金標準程序(gold-standard procedure),使用FSC(0.143)作為截留值。

圖3a顯示免疫的小鼠和雪貂的免疫應答。左圖顯示針對多種多樣的HA蛋白的抗體端點滴度,並且右圖顯示來自用SAS佐劑化(SAS-adjuvanted)的H1-SS-np免疫的小鼠(每組n=10)的血清的中和滴度。圖3b顯示了用SAS-佐劑化的空np(n=5),H1-SS-np』(n=6),2006-07TIV(n=6)或H5HA(2xDNA/1xMIV;n=6)免疫的雪貂的免疫應答。圖3b的左圖顯示了H1-SS-np』免疫血清對多種多樣的HA蛋白的抗體端點滴度,並且右圖顯示了來自四種免疫方案的血清的HA莖反應性。圖3c顯示了用三種施用方案免疫的雪貂的血清的中和滴度。在加強後兩周對每個個體動物顯示抗體端點和IC50滴度。虛線指示ELISA和假型化慢病毒報告物測定法兩者的基線(1:25稀釋)。誤差棒表示平均值±s.d。使用雙尾學生t檢驗(two-tailed student’s t-test)進行統計分析。

圖4a顯示了在小鼠和雪貂中針對致死性H5N1 2004VN流感病毒攻擊賦予的免疫保護。在第0、8和11周,用SAS-佐劑化的空np或H1-SS-np對BALB/c小鼠(每組n=10)接種疫苗三次,或保持未接種疫苗(未處理)。最後一次疫苗接種後四周,用高劑量(25LD50)的H5N1 2004VN病毒攻擊小鼠,並監測體重減輕(左圖)和存活(右圖)達14天。圖4b顯示用SAS-佐劑化的空np(n=5),H1-SS-np』(n=6),2006-07TIV(n=6)或H5HA(DNA/MIV;n=6),並在用1000TCID50的H5N1 2004VN最後免疫後6周攻擊。監測體重減輕(左圖)和存活(右圖)達14天。圖4c顯示在用高劑量(25LD50)的H5N1 2004VN流感病毒攻擊之前24小時用來自未處理或H1-SS-np-免疫動物的10mg Ig被動免疫(腹膜內)的BALB/c小鼠(每組n=10)。監測體重減輕(左圖)和存活(右圖)達14天。在圖4a,4b和4c的每一個中,黑色虛線(右圖)指示50%存活。使用時序(Mantel-Cox)檢驗進行統計分析。圖4d顯示了未處理和H1-SS-np免疫Ig的表徵。通過ELISA,未處理Ig(左)和H1-SS-np-免疫Ig(右)對空鐵蛋白np和各種HA蛋白的結合。圖4e顯示在輸注多克隆Ig後24小時的小鼠血清中Gen6HA-SS特異性Ig的估計濃度。

圖5-24提供了用於產生本發明的肽構建體的質粒圖譜和序列。如本公開表2中詳細描述的,圖5顯示了包含SEQ ID NO:266的Gen6_H1NC99_K394M/E446L/E448Q/R449W/D452L/Y437D/N438L_N19Q的圖譜。圖6顯示了包含SEQ ID NO:273的Gen6_H1CA09_K394M/E446L/E448Q/R449W/D452L/Y437D/N438L_N19Q的圖譜。圖7顯示包含SEQ ID NO:280的Gen6_H2Sing57_K394M/M445L/E446L/E448Q/R449W/D452L/Y437D/N438L_N19Q的圖譜。圖8顯示包含SEQ ID NO:287的Gen6_H5Ind05K394M/M445L/E446L/E448Q/R449W/D452L/Y437D/N438L/S49bW_N19Q的圖譜。圖9顯示包含SEQ ID NO:294的Gen6_H1NC99_K394M/E446L_N19Q的圖譜。圖10顯示包含SEQ ID NO:301的Gen6_H1NC99_K394M/E446L/Y437D/N438L_N19Q的圖譜。圖11顯示包含SEQ ID NO:308的Gen6_H1NC99_K394I/E446I/Y437D/N438L_N19Q的圖譜。圖12顯示包含SEQ ID NO:315的Gen6H1NC99K394L/E446I/Y437D/N438L_N19Q的圖譜。圖13顯示包含SEQ ID NO:322的Gen6_H1NC99_K394L/E446L/Y437D/N438L_N19Q的圖譜。圖14顯示包含SEQ ID NO:329的Gen6_H1NC99_K394M/E446M/Y437D/N438L_N19Q的圖譜。圖15顯示包含SEQ ID NO:336的Gen6H1NC99K394Q/E446Q/Y437D/N438L_N19Q的圖譜。圖16顯示包含SEQ ID NO:343的Gen6H1NC99K394M/E446L/Y437D/N438L/H45N/V47T_N19Q的圖譜。圖17顯示包含SEQ ID NO:350的Gen6H1NC99V36I/K394M/L445M/E446L/E448Q/R449F/D452L/Y437D/N438L N19Q的圖譜。圖18顯示包含SEQ ID NO:357的Gen6_H1NC99_K394M/E446L/E448Q/R449W/D452L/S402aN/G402cT/S402dG/T402fA/Y437D/N438L_N19Q的圖譜。圖19顯示包含SEQ ID NO:364的Gen6_H1NC99_K394M/E446L/E448Q/R449W/D452L/S402bG/G402cN/S402eT/T402fA/Y437D/N438L_N19Q的圖譜。圖20顯示包含SEQ ID NO:371的Gen6_H1NC99_K394M/E446L/E448Q/R449W/D452L/S402eN/Y437D/N438L_N19Q的圖譜。圖21顯示了包含SEQ ID NO:378的Gen6H1NC99K394M/E446L/E448Q/R449W/D452L/G402cN/G402eT/T402fA/Q370N/E372T/Y437D/N438L_S21T的圖譜。圖22顯示了包含SEQ ID NO:386的Gen6_H1NC99_K394M/E446L/E448Q/R449W/D452L/G402cN/G402eT/T402fA/Q370N/E372T/Y437D/N438L_S21T/Q69N的圖。圖23顯示了包含SEQ ID NO:392的Gen6_H1NC99_K394M/E446L/Y437D/N438L/Δ172-174的圖。圖24顯示了包含SEQ ID NO:399的Gen6_H1NC99_rpk3_Dloop2的圖。

發明詳述

本發明涉及用於流感病毒的新型疫苗。更具體地,本發明涉及新的基於流感HA蛋白的疫苗,其引發針對來自一大批流感病毒的HA蛋白的莖區的免疫應答。它還涉及自組裝納米顆粒,所述自組裝納米顆粒在其表面上展示來自流感HA蛋白的莖區的融合前構象的免疫原性部分。此類納米顆粒可用於針對流感病毒對個體接種疫苗。因此,本發明還涉及用於產生此類納米顆粒的蛋白質構建體和編碼此類蛋白質的核酸分子。另外,本發明涉及生產本發明的納米顆粒的方法,以及使用此類納米顆粒對個體接種疫苗的方法。

在進一步描述本發明前,應當理解本發明不限於描述的具體實施方案,因此當然可以有所變化。還應當理解,本文中使用的術語僅為了描述具體的實施方案,而並不意圖為限制性的,因為本發明的範圍僅會以權利要求書為限。

應當注意到,如本文中及所附權利要求書中使用的,單數形式「一個」、「一種」和「該/所述」包括複數提及物,除非上下文另有明確規定。例如,核酸分子指一種或多種核酸分子。因此,術語「一個」、「一種」、「一個/種或多個/種」和「至少一個/種」可以互換使用。類似地,術語「包含」、「包括」和「具有」可以互換使用。進一步注意到,權利要求書可以撰寫為排除任何任選要素。因此,此陳述意圖充當與權利要求要素的敘述結合使用排除術語,如「單獨」、「僅」等,或者使用「負」限定的前置基礎。

在上文外,除非另有明確定義,本文中公開的各個實施方案共同的下列術語和短語如下定義:

如本文中使用的,蛋白質構建體是由人工製備的蛋白質,其中兩個或更多個胺基酸序列以自然界中未發現的方式共價連接。被連接的胺基酸序列可以是相關的或不相關的。如本文所使用的,如果通常沒有發現多肽序列的胺基酸序列在其天然環境(例如細胞內)中通過共價鍵連接在一起,則它們是不相關的。例如,通常沒有發現構成鐵蛋白的單體亞基的胺基酸序列和流感HA蛋白的胺基酸序列通過共價鍵連接在一起。因此,此類序列被認為是不相關的。

蛋白質構建體還可以包含相關的胺基酸序列。例如,流感HA蛋白的結構使得頭部區胺基酸序列在兩端側翼為莖區胺基酸序列。通過遺傳手段,可以通過從頭部區的中間除去胺基酸殘基,同時保持側翼為莖區序列的頭部區的部分,來創建HA蛋白的缺失形式。雖然最終分子中序列的順序保持相同,但胺基酸之間的空間關係將不同於天然蛋白。因此,此類分子將被認為是蛋白質構建體。根據本發明,蛋白質構建體也可以稱為融合蛋白。

蛋白質構建體中的胺基酸序列可以彼此直接連接,或者它們可以使用接頭序列連接。接頭序列,肽或多肽是用於連接具有期望特徵(例如,結構,表位,免疫原性,活性等)的兩種蛋白質的短(例如,2-20)胺基酸序列。接頭序列通常不具有其自身的活性,並且通常用於允許蛋白質構建體的其它部分呈現期望的構象。接頭序列通常由小胺基酸殘基和/或其運行(runs),例如絲氨酸,丙氨酸和甘氨酸製備,儘管不排除使用其它胺基酸殘基。

如本文中使用的,術語免疫原性是指特定蛋白質或其特定區域引發對特定蛋白質或包含與特定蛋白質具有高度同一性的胺基酸序列的蛋白質的免疫應答的能力。根據本發明,具有高同一性程度的兩種蛋白質具有至少80%相同,至少85%相同,至少87%相同,至少90%相同,至少92%相同,至少93%相同,至少94%相同,至少95%相同,至少96%相同,至少97%相同,至少98%相同或至少99%相同的胺基酸序列。測定兩個胺基酸或核酸序列之間的百分比同一性的方法是本領域已知的。

如本文中使用的,對本發明的疫苗或納米顆粒的免疫應答是受試者中形成對疫苗中存在的HA蛋白的體液和/或細胞免疫應答。為了本發明的目的,「體液免疫應答」是指由抗體分子(包括分泌型(IgA)或IgG分子)介導的免疫應答,而「細胞免疫應答」是由T淋巴細胞和/或其它白血細胞介導的。細胞免疫的一個重要方面涉及溶細胞性T細胞(「CTL」)的抗原特異性應答。CTL對肽抗原具有特異性,所述肽抗原與由主要組織相容性複合物(MHC)編碼並且在細胞表面上表達的蛋白質聯合呈現。CTL有助於誘導和促進細胞內微生物的破壞或被此類微生物感染的細胞的溶解。細胞免疫的另一方面涉及輔助T細胞的抗原特異性應答。輔助T細胞作用為幫助刺激非特異性效應細胞針對細胞的功能,並且聚焦非特異性效應細胞針對細胞的活性,所述細胞在其表面上展示與MHC分子聯合的肽抗原。細胞免疫應答還指由活化的T細胞和/或其它白細胞(包括源自CD4+和CD8+T細胞的那些)產生的細胞因子,趨化因子和其它此類分子的產生。

因此,免疫應答可以是刺激CTL和/或輔助T細胞的產生或激活的應答。也可以刺激趨化因子和/或細胞因子的產生。疫苗還可以引發抗體介導的免疫應答。因此,免疫應答可以包括一種或多種以下效應:由B細胞產生抗體(例如IgA或IgG);和/或特異性針對存在於疫苗中的HA蛋白的抑制物(suppressor),細胞毒性或輔助T細胞和/或T細胞的活化。這些應答可用來中和感染性(例如抗體依賴性保護),和/或介導抗體-補體或抗體依賴性細胞細胞毒性(ADCC)以向免疫的個體提供保護。此類反應可以使用本領域熟知的標準免疫測定法和中和測定法來測定。

如本文中使用的,術語抗原性的,抗原性等是指由抗體或一組抗體結合的蛋白質。類似地,蛋白質的抗原部分是被抗體或一組抗體識別的任何部分。根據本發明,通過抗體識別蛋白質是指抗體選擇性地與蛋白質結合。如本文中使用的,短語選擇性地結合,選擇性結合等是指抗體與同HA無關的結合蛋白或樣品或測定法中非蛋白質組分形成對比優先結合HA蛋白的能力。優先結合HA的抗體是結合HA但不顯著結合可能存在於樣品或測定法中的其它分子或組分的抗體。認為顯著的結合是例如抗HA抗體與非HA分子的結合,其親和力或親合力大到足以幹擾測定法檢測和/或測定樣品中抗流感抗體,或HA蛋白水平的能力。可存在於樣品或測定法中的其它分子和化合物的實例包括但不限於非HA蛋白,如白蛋白,脂質和碳水化合物。根據本發明,非HA蛋白是具有與本文公開的流感HA蛋白的序列共享小於60%同一性的胺基酸序列的蛋白質。在一些實施方案中,一種或多種抗體提供廣泛的異亞型保護。在一些實施方案中,一種或多種抗體是中和性的。

如本文中使用的,中和性抗體是防止流感病毒完成一輪複製的抗體。如本文所定義的,一輪複製指病毒的生命周期,從病毒附著到宿主細胞開始,並以從宿主細胞出芽新形成的病毒結束。該生命周期包括但不限於以下步驟:附著於細胞,進入細胞,HA蛋白的切割和重排,病毒膜與內體膜的融合,病毒核糖核蛋白向細胞質的釋放,形成新病毒顆粒和自宿主細胞膜的病毒顆粒的出芽。根據本發明,中和性抗體是抑制一個或多個此類步驟的抗體。

如本文中使用的,廣泛中和性抗體是中和流感病毒的多於一種類型,亞型和/或毒株的抗體。例如,針對來自A型流感病毒的HA蛋白引發的廣泛中和性抗體可以中和B型或C型病毒。作為另一個實例,針對來自I型流感病毒的HA蛋白引發的廣泛中和性抗體可以中和組2病毒。作為另一個實例,針對來自病毒的一種亞型或株的HA蛋白引發的廣泛中和性抗體可以中和病毒的另一種亞型或株。例如,針對來自H1流感病毒的HA蛋白引發的廣泛中和性抗體可以中和來自一種或多種選自下組的亞型的病毒:H2,H3,H4,H5,H6,H7,H8,H8,H10,H11,H12,H13,H14,H15,H16,H17或H18。

根據本發明,用於分類流感病毒的所有命名法是本領域技術人員通常使用的。因此,流感病毒的類型或組是指A型流感,B型流感或C型流感。本領域技術人員應當理解,病毒作為特定類型的命名涉及在各自的M1(基質)蛋白質或NP(核蛋白)中的序列差異。A型流感病毒進一步分為組1和組2。這些組進一步分為亞型,其指基於其HA蛋白的序列的病毒分類。目前普遍認可的亞型的實例是H1,H2,H3,H4,H5,H6,H7,H8,H8,H10,H11,H12,H13,H14,H15,H16,H17或H18。組1流感亞型是H1,H2,H5,H6,H8,H9,H11,H12,H13,H16,H17和H18。組2流感亞型是H3,H4,H7,H10,H14,和H15。最後,術語毒株是指亞型內彼此不同之處在於它們在其基因組中具有小的遺傳變異的病毒。

如本文中使用的,流感血凝素蛋白或HA蛋白是指全長流感血凝素蛋白或其任何部分,其可用於產生本發明的蛋白質構建體和納米顆粒或能夠引發免疫應答。優選的HA蛋白是能夠形成三聚體的那些。全長流感HA蛋白的表位是指此類蛋白質的部分,其可以引發針對同源流感病毒株,即衍生HA的菌株的抗體應答。在一些實施方案中,此類表位也可以引發針對異源流感病毒株,即具有與免疫原的HA不同的HA的毒株的抗體應答。在一些實施方案中,表位引發廣泛異亞型保護性應答。在一些實施方案中,表位引發中和性抗體。

如本文中使用的,變體指在序列上與參照序列相似但不相同的蛋白質或核酸分子,其中變體蛋白質(或由變體核酸分子編碼的蛋白質)的活性沒有顯著改變。這些序列變異可以是天然存在的變異或者它們可以經由使用本領域技術人員已知的遺傳工程化技術來工程化改造。此類技術的例子可見Sambrook J,Fritsch E F,Maniatis T等,於Molecular Cloning--A Laboratory Manual,2nd Edition,Cold Spring Harbor Laboratory Press,1989,pp.9.31-9.57),或於Current Protocols in Molecular Biology,John Wiley&Sons,N.Y.(1989),6.3.1-6.3.6,這兩篇的完整內容通過提及併入本文。

就變體而言,胺基酸或核酸序列的任何類型變化是可允許的,只要所得的變體蛋白質保留引發針對流感病毒的中和性或非中和性抗體的能力。此類變異的例子包括但不限於缺失、插入、取代及其組合。例如,就蛋白質而言,本領域技術人員公知的是,一個或多個(例如2,3,4,5,6,7,8,9或10)胺基酸經常可以從蛋白質的氨基和/或羧基端末端除去,而不顯著影響所述蛋白質的活性。類似地,一個或多個(例如2,3,4,5,6,7,8,9或10)胺基酸經常可以插入蛋白質中,而不顯著影響蛋白質的活性。在已經進行插入的變體中,插入的胺基酸可以通過參考其後進行插入的胺基酸殘基來提及。例如,在胺基酸殘基402之後插入四個胺基酸殘基可以稱為402a-402d。此外,如果那些插入的胺基酸之一隨後被另一個胺基酸取代,則這種變化可以參考字母位置提及。例如,用蘇氨酸取代插入的甘氨酸(在插入物的另一個位置中)可以稱為S402dT。

如記錄的,相對於本文中公開的流感HA蛋白,本發明的變體蛋白質可以含有胺基酸取代。任何胺基酸取代是可允許的,只要蛋白質的活性不受顯著影響。在這點上,本領域中應當理解,胺基酸可以基於其物理特性而分成組。此類組的例子包括但不限於帶電荷的胺基酸、不帶電荷的胺基酸、極性不帶電荷的胺基酸、和疏水性胺基酸。含有取代的優選變體是那些其中的胺基酸用來自相同組的胺基酸取代的變體。此類取代稱為保守取代。

天然存在的殘基可以基於共同的側鏈特性而分成類:

1)疏水性:Met,Ala,Val,Leu,Ile;

2)中性親水性:Cys,Ser,Thr;

3)酸性:Asp,Glu;

4)鹼性:Asn,Gln,His,Lys,Arg;

5)影響鏈取向的殘基:Gly,Pro;和

6)芳香基:Trp,Tyr,Phe。

例如,非保守取代可以牽涉用這些類別之一的成員替換來自另一類別的成員。

在進行胺基酸變化中,可以考慮胺基酸的親水指數。基於每種胺基酸的疏水性和電荷性質,已給每種胺基酸的親水指數賦值。親水指數是:異亮氨酸(+4.5);纈氨酸(+4.2);亮氨酸(+3.8);苯丙氨酸(+2.8);半胱氨酸/胱氨酸(+2.5);甲硫氨酸(+1.9);丙氨酸(+1.8);甘氨酸(-0.4);蘇氨酸(-0.7);絲氨酸(-0.8);色氨酸(-0.9);酪氨酸(-1.3);脯氨酸(-1.6);組氨酸(-3.2);穀氨酸(-3.5);穀氨醯胺(-3.5);天冬氨酸(-3.5);天冬醯胺(-3.5);賴氨酸(-3.9);和精氨酸(-4.5)。本領域一般了解親水胺基酸指數在賦予蛋白質相互作用性生物學功能中的重要性(Kyte等,1982,J.Mol.Biol.157:105-31)。已知可以用某些胺基酸替代其它具有相似親水指數或分值的胺基酸,而仍然保留相似的生物學活性。在進行基於親水指數的變化中,親水指數在±2之內的胺基酸取代是優選的,在±1之內的那些胺基酸取代是特別優選的,且在±0.5之內的那些胺基酸取代是甚至更特別優選的。

本領域還了解可以基於疏水性有效地進行類似胺基酸的取代,特別是在意圖將由此產生的生物功能等同性蛋白質或肽用於結合免疫學發明(本案就是如此)的情況中。蛋白質的最大局部平均親水性(如由其相鄰胺基酸的親水性所決定的)與其免疫原性和抗原性,即與蛋白質的生物學特性相關聯。已將下列親水性數值(hydrophilicity value)賦予這些胺基酸殘基:精氨酸(+3.0);賴氨酸(+3.0);天冬氨酸(+3.0±1);穀氨酸(+3.0±1);絲氨酸(+0.3);天冬醯胺(+0.2);穀氨醯胺(+0.2);甘氨酸(0);蘇氨酸(-0.4);脯氨酸(-0.5±1);丙氨酸(-0.5);組氨酸(-0.5);半胱氨酸(-1.0);甲硫氨酸(-1.3);纈氨酸(-1.5);亮氨酸(-1.8);異亮氨酸(-1.8);酪氨酸(-2.3);苯丙氨酸(-2.5);和色氨酸(-3.4)。在進行基於相似親水性數值的變化時,親水性數值在±2之內的胺基酸取代是優選的,在±1之內的那些胺基酸取代是特別優選的,且在±0.5之內的那些胺基酸取代是甚至更特別優選的。還可以基於親水性鑑定來自一級胺基酸序列的表位。

在期望此類取代時,本領域技術人員可以確定期望的胺基酸取代(無論是保守的還是非保守的)。例如,可以使用胺基酸取代來鑑定HA蛋白的重要殘基,或者提高或降低本文中描述的HA蛋白的免疫原性、溶解度或穩定性。下文在表I中顯示了例示性的胺基酸取代。

表1

胺基酸取代

如本文中使用的,短語顯著影響蛋白質活性指將蛋白質活性降低至少10%,至少20%,至少30%,至少40%或至少50%。就本發明而言,此類活性可以例如以蛋白質引發針對流感病毒的保護性抗體的能力測量。此類活性可以通過測量針對流感病毒的此類抗體的效價,此類抗體針對流感感染提供保護的能力或者通過測量由引發的抗體中和的類型、亞型或毒株的數目測量。測定抗體效價,實施保護測定法,和實施病毒中和測定法的方法是本領域技術人員已知的。在上文描述的活性外,可以測量的其它活性包括凝集紅細胞的能力和蛋白質對細胞的結合親和力。測量此類活性的方法是本領域技術人員已知的。

術語個體、受試者和患者是本領域中公知的,並且在本文中可互換使用,指對流感感染易感的任何人或其它動物。例子包括但不限於人和其它靈長類,包括非人靈長類,諸如黑猩猩及其它猿和猴物種;家畜,諸如牛、綿羊、豬、海豹、山羊和馬;馴養哺乳動物,諸如犬和貓;實驗室動物,包括嚙齒類,諸如小鼠、大鼠和豚鼠;禽類,包括馴養禽類、野生禽類和獵禽,諸如雞、火雞和其它雞形目(gallinaceous)禽類、鴨、鵝,等等。術語個體、受試者和患者單獨不表示特定年齡、性別、人種,等等。因此,任何年齡的個體(無論雄性或雌性)意圖為本公開內容覆蓋,並且包括但不限於老年人、成人、兒童、嬰孩(babies)、嬰兒(infant)、和幼童(toddler)。同樣地,本發明的方法可以適用於任何人種,包括例如高加索人(Caucasian)(白種人)、非洲裔美國人(African-American)(黑人)、美洲原住民(Native American)、夏威夷原住民(Native Hawaiian)、西班牙裔(Hispanic)、拉美裔(Latino)、亞裔(Asian)、和歐洲裔。感染的受試者是已知在其體內具有流感病毒的受試者。

如本文中使用的,接種疫苗的受試者是已經施用意圖提供針對流感病毒的保護性效果的疫苗的受試者。

如本文中使用的,術語暴露指受試者已經與已知感染流感病毒的動物個體接觸。

本文中討論的出版物僅提供其在本申請的提交日前的公開內容。本文中的任何內容不應解釋為承認憑藉在先發明,本發明沒有資格早於此類出版物。此外,提供的出版日期可以與實際出版日期不同,這可能需要獨立確認。

除非另有定義,本文中使用的所有技術和科學術語與本發明所屬領域的普通技術人員的通常理解具有相同的意義。雖然與本文中描述的方法和材料類似或等同的任何方法和材料也可以用於實施或測試本發明,現在描述優選的方法和材料。本文中提及的所有出版物通過提及收入本文以公開並描述與結合出版物引用的方法和/或材料。

應當領會,本發明的某些特徵(為了清楚,其在不同實施方案的背景中描述)也可以在單一實施方案中組合提供。相反,本發明的各個特徵(為了簡潔,其在單一實施方案的背景中描述)也可以分開或在任何合適的亞組合中提供。實施方案的所有組合是本發明明確涵蓋的,並且在本文中公開,就像每種組合單獨且明確公開一樣。另外,所有亞組合也是本發明明確涵蓋的,並且在本文中公開,就像每種此類亞組合在本文中單獨且明確公開一樣。

本發明的一個實施方案是包含流感HA蛋白的蛋白質構建體,其中流感HA蛋白的頭部區已被包含距HA蛋白頭部區少於5個連續胺基酸殘基的胺基酸序列替換。如本文中使用的,HA蛋白是指可用於產生本發明的蛋白質構建體和納米顆粒的全長流感HA蛋白或其任何一個或多個部分和/或變體。因此,本發明涉及能夠引發對流感HA蛋白的莖區的免疫應答的分子。在一些實施方案中,HA蛋白構建體的序列已經進一步改變(即突變),以穩定蛋白的莖區,其形式可以呈遞給免疫系統。此類HA蛋白的一些代表性實例和由其製備的蛋白質構建體示於下表2中。

表2

病毒表面上的三聚體HA蛋白包含球狀頭部區和莖或柄區域,其將HA蛋白錨定到病毒脂質包膜中。流感HA的頭部區僅由HA1多肽的主要部分形成,而柄區由HA1和HA2的區段製成。根據本發明,頭部區大致由對應於流感H1N1NC的全長HA蛋白(SEQ ID NO:8)的胺基酸59-291的HA蛋白的胺基酸組成。類似地,如本文所使用的,莖區大約由胺基酸1-58和對應於流感H1N1NC的全長HA蛋白(SEQ ID NO:8)的胺基酸328-564的HA蛋白的胺基酸組成。如本文中使用的,關於頭部和莖區的術語大約是指上述序列在長度上可以改變幾個胺基酸,而不影響本發明的性質。因此,例如,頭部區可以由胺基酸50-291,胺基酸59-296或胺基酸59-285組成。通常,頭部和莖區域將不會從上述位置改變超過十個胺基酸;然而,在一個實施方案中,頭部區的羧基端末端可以延伸得遠達對應於SEQ ID NO:8的胺基酸327的胺基酸。在一個實施方案中,頭部區由在對應於流感A/新喀裡多尼亞/20/1999(SEQ ID NO:8)的Cys59和Cys291的胺基酸殘基之間的胺基酸序列組成,並且包括所述胺基酸殘基。關於HA蛋白,本領域技術人員應當理解,來自不同流感病毒的HA蛋白可能由於蛋白質中的突變(插入,缺失)而具有不同的長度。因此,提及相應的區域是指與所比較的區域在序列、結構和/或功能上相同或幾乎相同(例如,至少90%相同,至少95%相同,至少98%相同或至少99%相同)的另一種蛋白質的區域。例如,關於HA蛋白的莖區,另一HA蛋白中的相應區域可以不具有相同的殘基數,但是將具有幾乎相同的序列並且將執行相同的功能。作為實例,在上述實施方案中,來自A/新喀裡多尼亞/20/1999的HA蛋白(SEQ ID NO:8)的頭部區在胺基酸C291處結束。A/加利福尼亞/4/2009(H1)(SEQ ID NO:11)中頭部區末端的相應胺基酸是半胱氨酸292。為了更好地闡明病毒之間的序列比較,本領域技術人員使用編號系統,其將胺基酸位置與參考序列相關。因此,來自不同流感毒株的HA蛋白中的相應胺基酸殘基相對於其與蛋白質的n-末端胺基酸的距離可能不具有相同的殘基數。例如,使用H3編號系統,參考A/新喀裡多尼亞/20/1999(1999NC,H1)中的殘基100並不意味著它是距離N-末端胺基酸的第100個殘基。相反,A/新喀裡多尼亞/20/1999(1999NC,H1)的殘基100與流感H3N2毒株的殘基100對齊。本領域技術人員理解這種編號系統的使用。雖然H3編號系統可用於鑑定胺基酸的位置,除非另有說明,HA蛋白中胺基酸殘基的位置將通過一般性參考來自本文公開的序列的相應胺基酸的位置來鑑定。

本發明人還發現,通過將流感病毒HA蛋白的特定序列與能夠將HA蛋白呈遞給免疫系統的不相關分子組合,可以引發對HA蛋白的靶向區域的免疫應答。本發明的一個實施方案是包含與單體亞基蛋白的至少部分連接的流感HA蛋白的蛋白質構建體,其中流感HA蛋白的頭部區已被包含來自HA蛋白的頭部區的少於5個連續胺基酸殘基的胺基酸序列替換,並且其中所述蛋白質構建體能夠形成納米顆粒。

通過至少將流感HA蛋白的部分與單體亞基連接,本發明的蛋白質構建體能夠組裝成在其表面上表達HA的三聚體的納米顆粒。應當理解,構成此類三聚體的HA蛋白是融合前形式,並且與單體亞基的連接和在納米顆粒上的表達使融合前蛋白以其三聚體形式穩定化。這是重大的,因為HA蛋白以更天然的形式呈現,意味著莖多肽的某些表面不被暴露,從而降低莖多肽可能誘導不利抗體應答的風險。

在一個實施方案中,HA蛋白包含來自流感HA蛋白的莖區的至少一個免疫原性部分,其中所述蛋白引發針對流感病毒的保護性抗體。在一個實施方案中,HA蛋白包含來自選自A型流感病毒,B型流感病毒和C型流感病毒的病毒的HA蛋白的莖區的至少一個免疫原性部分,其中蛋白質引發針對流感病毒的保護性抗體。在一個實施方案中,HA蛋白包含來自選自以下的HA蛋白的莖區的至少一個免疫原性部分:H1流感病毒HA蛋白,H2流感病毒HA蛋白,H3流感病毒HA蛋白,流感H4病毒HA蛋白,H5流感病毒HA蛋白,H6流感病毒HA蛋白,H7流感病毒HA蛋白,H8流感病毒HA蛋白,H9流感病毒HA蛋白,H10流感病毒HA蛋白HA蛋白,H11流感病毒HA蛋白,H12流感病毒HA蛋白,H13流感病毒HA蛋白,H14流感病毒HA蛋白,H15流感病毒HA蛋白,H16流感病毒HA蛋白,H17流感病毒HA蛋白和H18流感病毒HA蛋白。

在一個實施方案中,HA蛋白包含來自蛋白質的至少一個免疫原性部分,所述蛋白質包含與選自下組的序列至少80%相同的胺基酸序列:SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17。在一個實施方案中,HA蛋白包含來自蛋白質的至少一個免疫原性部分,所述蛋白質包含選自下組的胺基酸序列:SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17。在一個實施方案中,HA蛋白包含來自蛋白質的至少一個免疫原性部分,所述蛋白質包含與選自下組的序列至少80%相同的胺基酸序列:SEQ ID NO:80,SEQ ID NO:83,SEQ ID NO:86,SEQ ID NO:89,SEQ ID NO:92,SEQ ID NO:95,SEQ ID NO:98,SEQ ID NO:101,SEQ ID NO:104,SEQ ID NO:150,SEQ ID NO:152,SEQ ID NO:154,SEQ ID NO:156,SEQ ID NO:158,SEQ ID NO:164,SEQ ID NO:170,SEQ ID NO:176,SEQ ID NO:182,SEQ ID NO:188,SEQ ID NO:197,SEQ ID NO:203,SEQ ID NO:209,SEQ ID NO:214,SEQ ID NO:217,SEQ ID NO:222,SEQ ID NO:224,SEQ ID NO:226,SEQ ID NO:228,SEQ ID NO:230,SEQ ID NO:232,SEQ ID NO:235,SEQ ID NO:240,SEQ ID NO:242,SEQ ID NO:244,SEQ ID NO:246,SEQ ID NO:248,SEQ ID NO:250,SEQ ID NO:252,SEQ ID NO:254,SEQ ID NO:256,SEQ ID NO:258,SEQ ID NO:261,SEQ ID NO:268,SEQ ID NO:275,SEQ ID NO:282,SEQ ID NO:289,SEQ ID NO:296,SEQ ID NO:303,SEQ ID NO:310,SEQ ID NO:317,SEQ ID NO:324,SEQ ID NO:331,SEQ ID NO:338,SEQ ID NO:345,SEQ ID NO:352,SEQ ID NO:359,SEQ ID NO:366,SEQ ID NO:373,SEQ ID NO:380,SEQ ID NO:387,SEQ ID NO:394和SEQ ID NO:400。在一個實施方案中,HA蛋白包含來自蛋白質的至少一個免疫原性部分,所述蛋白質包含選自下組的胺基酸序列:SEQ ID NO:80,SEQ ID NO:83,SEQ ID NO:86,SEQ ID NO:89,SEQ ID NO:92,SEQ ID NO:95,SEQ ID NO:98,SEQ ID NO:101,SEQ ID NO:104,SEQ ID NO:150,SEQ ID NO:152,SEQ ID NO:154,SEQ ID NO:156,SEQ ID NO:158,SEQ ID NO:164,SEQ ID NO:170,SEQ ID NO:176,SEQ ID NO:182,SEQ ID NO:188,SEQ ID NO:197,SEQ ID NO:203,SEQ ID NO:209,SEQ ID NO:214,SEQ ID NO:217,SEQ ID NO:222,SEQ ID NO:224,SEQ ID NO:226,SEQ ID NO:228,SEQ ID NO:230,SEQ ID NO:232,SEQ ID NO:235,SEQ ID NO:240,SEQ ID NO:242,SEQ ID NO:244,SEQ ID NO:246,SEQ ID NO:248,SEQ ID NO:250,SEQ ID NO:252,SEQ ID NO:254,SEQ ID NO:256,SEQ ID NO:258,SEQ ID NO:261,SEQ ID NO:268,SEQ ID NO:275,SEQ ID NO:282,SEQ ID NO:289,SEQ ID NO:296,SEQ ID NO:303,SEQ ID NO:310,SEQ ID NO:317,SEQ ID NO:324,SEQ ID NO:331,SEQ ID NO:338,SEQ ID NO:345,SEQ ID NO:352,SEQ ID NO:359,SEQ ID NO:366,SEQ ID NO:373,SEQ ID NO:380,SEQ ID NO:387,SEQ ID NO:394和SEQ ID NO:400。在一個實施方案中,包含HA蛋白的免疫原性部分的此類蛋白質引發針對流感病毒的廣泛保護性抗體的產生。

蛋白質的免疫原性部分包含表位,其是被免疫系統識別的胺基酸殘基的簇,從而引發免疫應答。此類表位可以由連續的胺基酸殘基(即,在蛋白質中彼此相鄰的胺基酸殘基)組成,或者它們可以由非連續的胺基酸殘基(即,蛋白質中彼此不相鄰的胺基酸殘基),但在最終摺疊的蛋白質中緊密空間接近。本領域技術人員完全理解,表位需要最少六個胺基酸殘基,以便被免疫系統識別。因此,在一個實施方案中,來自流感HA蛋白的免疫原性部分包含至少一個表位。在一個實施方案中,HA蛋白包含來自流感HA蛋白的莖區的至少6個胺基酸,至少10個胺基酸,至少25個胺基酸,至少50個胺基酸,至少75個胺基酸或至少100個胺基酸。在一個實施方案中,HA蛋白包含來自HA蛋白的莖區的至少6個胺基酸,至少10個胺基酸,至少25個胺基酸,至少50個胺基酸,至少75個胺基酸或至少100個胺基酸,所述HA蛋白來自選自A型流感病毒,B型流感病毒和C型流感病毒的病毒。在一個實施方案中,HA蛋白包含來自HA蛋白的莖區的至少6個胺基酸,至少10個胺基酸,至少25個胺基酸,至少50個胺基酸,至少75個胺基酸或至少100個胺基酸,所述HA蛋白來自選自H1流感病毒HA蛋白,H2流感病毒HA蛋白,H3流感病毒HA蛋白,H4流感病毒HA蛋白,H5流感病毒HA蛋白,H6流感病毒HA蛋白,H7流感病毒病毒HA蛋白,H8流感病毒HA蛋白,H9流感病毒HA蛋白,H10流感病毒HA蛋白HA蛋白,H11流感病毒HA蛋白,H12流感病毒HA蛋白,H13流感病毒HA蛋白,H14流感病毒HA蛋白,H15流感病毒HA蛋白,H16流感病毒HA蛋白,H17流感病毒HA蛋白和H18流感病毒HA蛋白。在一個實施方案中,HA蛋白包含來自HA蛋白的莖區的至少6個胺基酸,至少10個胺基酸,至少25個胺基酸,至少50個胺基酸,至少75個胺基酸或至少100個胺基酸,所述HA蛋白來自於選自下組的病毒株:流感A/新喀裡多尼亞/20/1999(1999NC,H1),A/加利福尼亞/04/2009(2009CA,H1),A/新加坡/1/1957(1957Sing,H2),A/香港/1/1968(1968HK,H3),A/布裡斯班/10/2007(2007Bris,H3),A/印度尼西亞/05/2005(2005Indo,H5),B/佛羅裡達/4/2006(2006Flo,B),A/珀斯/16/2009(2009Per,H3),A/布裡斯班/59/2007(2007Bris,H1),B/布裡斯班/60/2008(2008Bris,B)及其變體。在一個實施方案中,胺基酸是來自HA蛋白的莖區的連續胺基酸。在一個實施方案中,包含來自HA蛋白的莖區的至少6個胺基酸,至少10個胺基酸,至少25個胺基酸,至少50個胺基酸,至少75個胺基酸或至少100個胺基酸的此類蛋白質引發針對流感病毒的廣泛保護性抗體的產生。本發明的一個實施方案是包含蛋白質構建體,其包含來自HA蛋白的莖區的至少6個胺基酸,至少10個胺基酸,至少25個胺基酸,至少50個胺基酸,至少75個胺基酸或至少100個胺基酸,所述HA蛋白包含選自SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17的胺基酸序列。本發明的一個實施方案是蛋白質構建體,其包含來自HA蛋白的莖區的至少6個胺基酸,至少10個胺基酸,至少25個胺基酸,至少50個胺基酸,至少75個胺基酸或至少100個胺基酸,所述HA蛋白包含選自下組的胺基酸序列:SEQ ID NO:80,SEQ ID NO:83,SEQ ID NO:86,SEQ ID NO:89,SEQ ID NO:92,SEQ ID NO:95,SEQ ID NO:98,SEQ ID NO:101,SEQ ID NO:104,SEQ ID NO:150,SEQ ID NO:152,SEQ ID NO:154,SEQ ID NO:156,SEQ ID NO:158,SEQ ID NO:164,SEQ ID NO:170,SEQ ID NO:176,SEQ ID NO:182,SEQ ID NO:188,SEQ ID NO:197,SEQ ID NO:203,SEQ ID NO:209,SEQ ID NO:214,SEQ ID NO:217,SEQ ID NO:222,SEQ ID NO:224,SEQ ID NO:226,SEQ ID NO:228,SEQ ID NO:230,SEQ ID NO:232,SEQ ID NO:235,SEQ ID NO:240,SEQ ID NO:242,SEQ ID NO:244,SEQ ID NO:246,SEQ ID NO:248,SEQ ID NO:250,SEQ ID NO:252,SEQ ID NO:254,SEQ ID NO:256,SEQ ID NO:258,SEQ ID NO:261,SEQ ID NO:268,SEQ ID NO:275,SEQ ID NO:282,SEQ ID NO:289,SEQ ID NO:296,SEQ ID NO:303,SEQ ID NO:310,SEQ ID NO:317,SEQ ID NO:324,SEQ ID NO:331,SEQ ID NO:338,SEQ ID NO:345,SEQ ID NO:352,SEQ ID NO:359,SEQ ID NO:366,SEQ ID NO:373,SEQ ID NO:380,SEQ ID NO:387,SEQ ID NO:394和SEQ ID NO:400。在一個實施方案中,胺基酸是來自HA蛋白的莖區的連續胺基酸。在一個實施方案中,胺基酸是非連續的,但在最終蛋白質中緊密空間接近。

雖然本申請例示了來自幾種示例性HA蛋白的莖區序列的使用,但是本發明也可以使用來自包含所公開的HA序列的變異的蛋白質的莖區來實施。因此,在一個實施方案中,HA蛋白來自選自下組的病毒:流感A/新喀裡多尼亞/20/1999(1999NC,H1),A/加利福尼亞/04/2009(2009CA,H1),A/新加坡/1/1957(1957Sing,H2),A/香港/1/1968(1968HK,H3),A/布裡斯班/10/2007(2007Bris,H3),A/印度尼西亞/05/2005(2005Indo,H5),B/佛羅裡達/4/2006(2006Flo,B),A/珀斯/16/2009(2009Per,H3),A/布裡斯班/59/2007(2007Bris,H1),B/布裡斯班/60/2008(2008Bris,B)及其變體。在一個實施方案中,HA蛋白包含與HA蛋白的莖區至少80%,至少85%,至少90%,至少92%,至少94%,至少96%,至少98%或至少99%相同的胺基酸序列,所述HA蛋白包含選自下組的胺基酸序列:SEQ ID NO:8,SEQ ID NO:,SEQ ID NO:11,SEQ ID NO:14,SEQ ID NO:17。在一個實施方案中,HA蛋白包含選自SEQ ID NO:8,SEQ ID NO:,SEQ ID NO:11,SEQ ID NO:14,SEQ ID NO:17的胺基酸序列。

在一個實施方案中,HA蛋白的頭部區序列被接頭序列替換。可以使用任何接頭序列,只要莖區序列能夠形成期望的結構。雖然任何胺基酸可用於製備接頭序列,但優選使用缺少大的或帶電荷的側鏈的胺基酸。優選的胺基酸包括但不限於絲氨酸,甘氨酸和丙氨酸。在一個實施方案中,接頭由絲氨酸和甘氨酸殘基製成。接頭序列的長度可以變化,但是優選的實施方案使用最短的可能序列,以允許莖序列形成期望的結構。在一個實施方案中,接頭序列的長度小於10個胺基酸。在一個實施方案中,接頭序列的長度小於5個胺基酸。在優選的實施方案中,接頭序列缺乏來自HA蛋白的頭部區的連續胺基酸序列。在一個實施方案中,接頭序列包含來自HA蛋白頭部區的少於5個連續胺基酸。

如上所述,HA序列與單體亞基蛋白的部分連接。如本文中使用的,單體亞基蛋白是指能夠結合其它單體亞基蛋白的蛋白單體,使得單體亞基蛋白自組裝成納米顆粒。任何單體亞基蛋白可以用於產生本發明的蛋白質構建體,只要該蛋白質構建體能夠形成在其表面上展示HA蛋白的多聚體結構。在一個實施方案中,單體亞基是鐵蛋白。

鐵蛋白是在所有動物,細菌和植物中發現的球狀蛋白,其主要通過將水合鐵離子和質子運輸到礦化核心和從礦化核心運輸來控制多核Fe(III)2O3形成的速率和位置起作用。鐵蛋白的球狀形式由單體亞基蛋白(也稱為單體鐵蛋白亞基)組成,其是具有約17-20kDa的分子量的多肽。一個此類單體鐵蛋白亞基的序列的實例由SEQ ID NO:2表示。每個單體鐵蛋白亞基具有螺旋束的拓撲結構,其包括四個反向平行螺旋基序,具有大致垂直於4螺旋束的長軸的第五較短螺旋(c-端螺旋)。根據慣例,螺旋分別從N-末端標記為「A,B,C和D&E」。N-末端序列位於納米顆粒三折軸附近並延伸到表面,而E螺旋在四摺疊軸上聚集在一起,C-末端延伸到顆粒核心中。這種包裝的結果在納米顆粒表面上創建兩個孔。預期這些孔中的一個或兩個代表水合鐵擴散進入和離開納米顆粒的點。產生後,這些單體鐵蛋白亞基蛋白自組裝成球狀鐵蛋白蛋白。因此,鐵蛋白的球狀形式包含24個單體,鐵蛋白亞基蛋白,並具有432對稱的殼體樣結構。

根據本發明,本發明的單體鐵蛋白亞基是鐵蛋白蛋白的全長單一多肽或其任何部分,其能夠指導單體鐵蛋白亞基自組裝成蛋白質的球狀形式。此類蛋白質的實例包括但不限於SEQ ID NO:2和SEQ ID NO:5。來自任何已知的鐵蛋白蛋白的單體鐵蛋白亞基的胺基酸序列可以用於產生本發明的蛋白質構建體,只要單體鐵蛋白亞基能夠自組裝成在其表面上展示HA的納米顆粒。在一個實施方案中,單體亞基來自選自下組的鐵蛋白蛋白:細菌鐵蛋白蛋白,植物鐵蛋白蛋白,藻鐵蛋白蛋白,昆蟲鐵蛋白蛋白,真菌鐵蛋白蛋白和哺乳動物鐵蛋白蛋白。在一個實施方案中,所述鐵蛋白蛋白來自幽門螺桿菌(Helicobacter pylori)。

本發明的蛋白質構建體不需要包含鐵蛋白蛋白的單體亞基多肽的全長序列。可以使用單體鐵蛋白亞基蛋白的部分或區域,只要該部分包含指導單體鐵蛋白亞基自組裝成蛋白的球形形式的胺基酸序列。此類區域的一個實例位於幽門螺桿菌鐵蛋白蛋白的胺基酸5和167之間。更具體的區域描述於Zhang,Y.Self-Assembly in the Ferritin Nano-Cage Protein Super Family.2011,Int.J.Mol.Sci.,12,5406-5421,其通過引用整體併入本文。

在一個實施方案中,HA蛋白與來自鐵蛋白的至少50個,至少100個或至少150個胺基酸連接,其中所述蛋白質構建體能夠形成納米顆粒。在一個實施方案中,HA蛋白與來自SEQ ID NO:2或SEQ ID NO:5的至少50,至少100或至少150個胺基酸連接,其中所述蛋白質構建體能夠形成納米顆粒。在一個實施方案中,HA蛋白與蛋白質連接,所述蛋白質包含與鐵蛋白序列至少85%,至少90%或至少95%相同的胺基酸序列,其中蛋白質構建體能夠形成納米顆粒。在一個實施方案中,HA蛋白與蛋白質連接,所述蛋白質包含與SEQ ID NO:2或SEQ ID NO:5至少85%,至少90%,至少95%相同的胺基酸序列,其中所述蛋白質構建體形成納米顆粒。

在一個實施方案中,單體亞基是2,4-二氧四氫蝶啶合成酶(lumazine synthase)。在一個實施方案中,HA蛋白與來自2,4-二氧四氫蝶啶合酶的至少50個,至少100個或至少150個胺基酸連接,其中所述蛋白質構建體能夠形成納米顆粒。因此,在一個實施方案中,HA蛋白與蛋白質連接,所述蛋白質與2,4-二氧四氫蝶啶合酶至少85%,至少90%,至少95%相同,其中蛋白質構建體能夠形成納米顆粒。

如本文中使用的,本發明的納米顆粒是指通過本發明的蛋白質構建體(融合蛋白)的自組裝形成的三維顆粒。本發明的納米顆粒通常是球形形狀的,儘管不排除其它形狀,並且通常直徑為約20nm至約100nm。本發明的納米顆粒可以但不需要包含除了蛋白質構建體外的分子,如蛋白質,脂質,碳水化合物等,它們從所述蛋白質構建體中形成。

可以使用重組技術製備本發明的蛋白質構建體以將HA蛋白,接頭和單體亞基的各部分連接在一起。以這種方式,可以產生僅包含產生納米顆粒疫苗所必需的那些序列的蛋白質構建體。因此,本發明的一個實施方案是蛋白質構建體(也稱為融合蛋白),其包含來自流感病毒HA蛋白的莖區的第一胺基酸序列和來自流感病毒HA蛋白的莖區的第二胺基酸序列,所述第一和第二胺基酸序列通過接頭序列共價連接,

其中所述第一胺基酸序列包含來自頭部區序列的氨基端末端上遊的胺基酸序列的至少20個連續胺基酸殘基;

其中所述第二胺基酸序列包含來自頭部區序列的羧基端末端下遊的胺基酸序列的至少20個連續胺基酸殘基;和,

其中所述第一或第二胺基酸序列與單體亞基結構域的至少部分連接,使得所述蛋白質構建體能夠形成納米顆粒。

在一個實施方案中,第一胺基酸序列來自選自下組的病毒的HA蛋白的莖區:A型流感病毒,B型流感病毒和C型流感病毒。在一個實施方案中,第一胺基酸序列來自選自下組的病毒的HA蛋白的莖區:H1流感病毒,H2流感病毒,流感H3病毒,流感H4病毒,流感H5病毒,流感H6病毒,H7流感病毒,H8流感病毒,H9流感病毒,H10流感病毒,H11流感病毒,H12流感病毒,H13流感病毒,H14流感病毒,H15流感病毒,H16流感病毒,H17流感病毒,和H18流感病毒。在一個實施方案中,第一胺基酸序列來自選自下組的病毒的HA蛋白的莖區:流感A/新喀裡多尼亞/20/1999(1999NC,H1),A/加利福尼亞/04/2009(2009CA,H1),A/新加坡/1/1957(1957Sing,H2),A/香港/1/1968(1968HK,H3),A/布裡斯班/10/2007(2007Bris,H3),A/印度尼西亞/05/2005(2005Indo,H5),B/佛羅裡達/4/2006(2006Flo,B),A/珀斯/16/2009(2009Per,H3),A/布裡斯班/59/2007(2007Bris,H1),B/布裡斯班/60/2008(2008Bris,B)。在一個實施方案中,第一胺基酸序列來自HA蛋白的莖區,所述HA蛋白具有與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的胺基酸序列:SEQ ID NO:8,SEQ ID NO:,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17。在一個實施方案中,第一胺基酸序列來自HA蛋白的莖區,所述HA蛋白包含選自下組的序列:SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17。在一個實施方案中,HA蛋白包含來自蛋白質的至少一個免疫原性部分,所述蛋白質包含與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的胺基酸序列:SEQ ID NO:80,SEQ ID NO:83,SEQ ID NO:86,SEQ ID NO:89,SEQ ID NO:92,SEQ ID NO:95,SEQ ID NO:98,SEQ ID NO:101,SEQ ID NO:104,SEQ ID NO:150,SEQ ID NO:152,SEQ ID NO:154,SEQ ID NO:156,SEQ ID NO:158,SEQ ID NO:164,SEQ ID NO:170,SEQ ID NO:176,SEQ ID NO:182,SEQ ID NO:188,SEQ ID NO:197,SEQ ID NO:203,SEQ ID NO:209,SEQ ID NO:214,SEQ ID NO:217,SEQ ID NO:222,SEQ ID NO:224,SEQ ID NO:226,SEQ ID NO:228,SEQ ID NO:230,SEQ ID NO:232,SEQ ID NO:235,SEQ ID NO:240,SEQ ID NO:242,SEQ ID NO:244,SEQ ID NO:246,SEQ ID NO:248,SEQ ID NO:250,SEQ ID NO:252,SEQ ID NO:254,SEQ ID NO:256,SEQ ID NO:258,SEQ ID NO:261,SEQ ID NO:268,SEQ ID NO:275,SEQ ID NO:282,SEQ ID NO:289,SEQ ID NO:296,SEQ ID NO:303,SEQ ID NO:310,SEQ ID NO:317,SEQ ID NO:324,SEQ ID NO:331,SEQ ID NO:338,SEQ ID NO:345,SEQ ID NO:352,SEQ ID NO:359,SEQ ID NO:366,SEQ ID NO:373,SEQ ID NO:380,SEQ ID NO:387,SEQ ID NO:394和SEQ ID NO:400。在一個實施方案中,HA蛋白包含來自蛋白質的至少一個免疫原性部分,所述蛋白質包含選自下組的胺基酸序列:SEQ ID NO:80,SEQ ID NO:83,SEQ ID NO:86,SEQ ID NO:89,SEQ ID NO:92,SEQ ID NO:95,SEQ ID NO:98,SEQ ID NO:101,SEQ ID NO:104,SEQ ID NO:150,SEQ ID NO:152,SEQ ID NO:154,SEQ ID NO:156,SEQ ID NO:158,SEQ ID NO:164,SEQ ID NO:170,SEQ ID NO:176,SEQ ID NO:182,SEQ ID NO:188,SEQ ID NO:197,SEQ ID NO:203,SEQ ID NO:209,SEQ ID NO:214,SEQ ID NO:217,SEQ ID NO:222,SEQ ID NO:224,SEQ ID NO:226,SEQ ID NO:228,SEQ ID NO:230,SEQ ID NO:232,SEQ ID NO:235,SEQ ID NO:240,SEQ ID NO:242,SEQ ID NO:244,SEQ ID NO:246,SEQ ID NO:248,SEQ ID NO:250,SEQ ID NO:252,SEQ ID NO:254,SEQ ID NO:256,SEQ ID NO:258,SEQ ID NO:261,SEQ ID NO:268,SEQ ID NO:275,SEQ ID NO:282,SEQ ID NO:289,SEQ ID NO:296,SEQ ID NO:303,SEQ ID NO:310,SEQ ID NO:317,SEQ ID NO:324,SEQ ID NO:331,SEQ ID NO:338,SEQ ID NO:345,SEQ ID NO:352,SEQ ID NO:359,SEQ ID NO:366,SEQ ID NO:373,SEQ ID NO:380,SEQ ID NO:387,SEQ ID NO:394和SEQ ID NO:400。

在一個實施方案中,第二胺基酸序列來自選自下組的病毒的HA蛋白的莖區:A型流感病毒,B型流感病毒和C型流感病毒。在一個實施方案中,第二胺基酸序列來自選自下組的病毒的HA蛋白的莖區:H1流感病毒,H2流感病毒,流感H3病毒,流感H4病毒,流感H5病毒,流感H6病毒,H7流感病毒,H8流感病毒,H9流感病毒,H10流感病毒,H11流感病毒,H12流感病毒,H13流感病毒,H14流感病毒,H15流感病毒,H16流感病毒,H17流感病毒和H18流感病毒。在一個實施方案中,第二胺基酸序列來自選自下組的病毒的HA蛋白的莖區:流感A/新喀裡多尼亞/20/1999(1999NC,H1),A/加利福尼亞/04/2009(2009CA,H1),A/新加坡/1/1957(1957Sing,H2),A/香港/1/1968(1968HK,H3),A/布裡斯班/10/2007(2007Bris,H3),A/印度尼西亞/05/2005(2005Indo,H5),B/佛羅裡達/4/2006(2006Flo,B),A/珀斯/16/2009(2009Per,H3),A/布裡斯班/59/2007(2007Bris,H1),B/布裡斯班/60/2008(2008Bris,B)。在一個實施方案中,第二胺基酸序列來自HA蛋白的莖區,所述HA蛋白具有與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的胺基酸序列:SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17。在一個實施方案中,第二胺基酸序列來自HA蛋白的莖區,所述HA蛋白包含選自下組的序列:SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17。在一個實施方案中,HA蛋白包含來自蛋白質的至少一個免疫原性部分,所述蛋白質包含與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的胺基酸序列:SEQ ID NO:80,SEQ ID NO:83,SEQ ID NO:86,SEQ ID NO:89,SEQ ID NO:92,SEQ ID NO:95,SEQ ID NO:98,SEQ ID NO:101,SEQ ID NO:104,SEQ ID NO:150,SEQ ID NO:152,SEQ ID NO:154,SEQ ID NO:156,SEQ ID NO:158,SEQ ID NO:164,SEQ ID NO:170,SEQ ID NO:176,SEQ ID NO:182,SEQ ID NO:188,SEQ ID NO:197,SEQ ID NO:203,SEQ ID NO:209,SEQ ID NO:214,SEQ ID NO:217,SEQ ID NO:222,SEQ ID NO:224,SEQ ID NO:226,SEQ ID NO:228,SEQ ID NO:230,SEQ ID NO:232,SEQ ID NO:235,SEQ ID NO:240,SEQ ID NO:242,SEQ ID NO:244,SEQ ID NO:246,SEQ ID NO:248,SEQ ID NO:250,SEQ ID NO:252,SEQ ID NO:254,SEQ ID NO:256,SEQ ID NO:258,SEQ ID NO:261,SEQ ID NO:268,SEQ ID NO:275,SEQ ID NO:282,SEQ ID NO:289,SEQ ID NO:296,SEQ ID NO:303,SEQ ID NO:310,SEQ ID NO:317,SEQ ID NO:324,SEQ ID NO:331,SEQ ID NO:338,SEQ ID NO:345,SEQ ID NO:352,SEQ ID NO:359,SEQ ID NO:366,SEQ ID NO:373,SEQ ID NO:380,SEQ ID NO:387,SEQ ID NO:394和SEQ ID NO:400。在一個實施方案中,HA蛋白包含來自蛋白質的至少一個免疫原性部分,所述蛋白質包含選自下組的胺基酸序列:SEQ ID NO:80,SEQ ID NO:83,SEQ ID NO:86,SEQ ID NO:89,SEQ ID NO:92,SEQ ID NO:95,SEQ ID NO:98,SEQ ID NO:101,SEQ ID NO:104,SEQ ID NO:150,SEQ ID NO:152,SEQ ID NO:154,SEQ ID NO:156,SEQ ID NO:158,SEQ ID NO:164,SEQ ID NO:170,SEQ ID NO:176,SEQ ID NO:182,SEQ ID NO:188,SEQ ID NO:197,SEQ ID NO:203,SEQ ID NO:209,SEQ ID NO:214,SEQ ID NO:217,SEQ ID NO:222,SEQ ID NO:224,SEQ ID NO:226,SEQ ID NO:228,SEQ ID NO:230,SEQ ID NO:232,SEQ ID NO:235,SEQ ID NO:240,SEQ ID NO:242,SEQ ID NO:244,SEQ ID NO:246,SEQ ID NO:248,SEQ ID NO:250,SEQ ID NO:252,SEQ ID NO:254,SEQ ID NO:256,SEQ ID NO:258,SEQ ID NO:261,SEQ ID NO:268,SEQ ID NO:275,SEQ ID NO:282,SEQ ID NO:289,SEQ ID NO:296,SEQ ID NO:303,SEQ ID NO:310,SEQ ID NO:317,SEQ ID NO:324,SEQ ID NO:331,SEQ ID NO:338,SEQ ID NO:345,SEQ ID NO:352,SEQ ID NO:359,SEQ ID NO:366,SEQ ID NO:373,SEQ ID NO:380,SEQ ID NO:387,SEQ ID NO:394和SEQ ID NO:400。

如上所述,第一胺基酸序列包含來自頭部區序列的氨基端末端上遊的胺基酸序列的至少20個連續胺基酸殘基。根據本發明,術語上遊指與頭部區的第一個胺基酸殘基的氨基端末端連接的胺基酸序列的全部。在一個實施方案中,頭部區的氨基端末端位於對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白(SEQ ID NO:8)的Cys59的胺基酸殘基。因此,在一個實施方案中,第一胺基酸序列包含來自對應於流感A新喀裡多尼亞/20/1999(H1)(SEQ ID NO:8)的胺基酸殘基1-58的HA蛋白區域的至少20個連續胺基酸殘基。在一個實施方案中,第一胺基酸序列包含來自與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的序列的至少20個連續胺基酸殘基:SEQ ID NO:20,SEQ ID NO:35,SEQ ID NO:50和SEQ ID NO:65。在一個實施方案中,第一胺基酸序列包含來自選自SEQ ID NO:20,SEQ ID NO:35,SEQ ID NO:50和SEQ ID NO:65的序列的至少20個連續胺基酸殘基。

在一個實施方案中,第一胺基酸序列包含來自對應於流感A新喀裡多尼亞/20/1999(H1)(SEQ ID NO:8)的胺基酸殘基1-58的HA蛋白的胺基酸區域的至少40個連續胺基酸殘基。在一個實施方案中,第一胺基酸序列包含與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的序列的至少40個連續胺基酸殘基:SEQ ID NO:20,SEQ ID NO:35,SEQ ID NO:50和SEQ ID NO:65。在一個實施方案中,第一胺基酸序列包含來自選自SEQ ID NO:20,SEQ ID NO:35,SEQ ID NO:50和SEQ ID NO:65的序列的至少40個連續胺基酸殘基。

在一個實施方案中,第一胺基酸序列包含與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的序列:SEQ ID NO:20,SEQ ID NO:35,SEQ ID NO:50和SEQ ID NO:65。在一個實施方案中,第一胺基酸序列包含選自SEQ ID NO:20,SEQ ID NO:35,SEQ ID NO:50和SEQ ID NO:65的序列。

在一個實施方案中,第二胺基酸序列來自選自下組的病毒的HA蛋白的莖區:A型流感病毒,B型流感病毒和C型流感病毒。在一個實施方案中,第二胺基酸序列來自選自下組的病毒的HA蛋白的莖區:H1流感病毒,H2流感病毒,H3流感病毒,H4流感病毒,H5流感病毒病毒,H6流感病毒,H7流感病毒,H8流感病毒,H9流感病毒,H10流感病毒,H11流感病毒,H12流感病毒,H13流感病毒,H14流感病毒,H15流感病毒病毒,H16流感病毒,H17流感病毒和H18流感病毒。在一個實施方案中,第二胺基酸序列來自選自下組的病毒的HA蛋白的莖區:流感A/新喀裡多尼亞/20/1999(1999NC,H1),A/加利福尼亞/04/2009(2009CA,H1),A/新加坡/1/1957(1957Sing,H2),A/香港/1/1968(1968HK,H3),A/布裡斯班/10/2007(2007Bris,H3),A/印度尼西亞/05/2005(2005Indo,H5),B/佛羅裡達/4/2006(2006Flo,B),A/珀斯/16/2009(2009Per,H3),A/布裡斯班/59/2007(2007Bris,H1),B/布裡斯班/60/2008(2008Bris,B)。在一個實施方案中,第二胺基酸序列來自HA蛋白的莖區,所述HA蛋白具有與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的胺基酸序列:SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17。在一個實施方案中,第二胺基酸序列來自HA蛋白的莖區,所述HA蛋白包含選自SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17的序列。

如上所述,第二胺基酸序列包含來自頭部區序列的羧基端末端下遊的胺基酸序列的至少20個連續胺基酸殘基。根據本發明,術語下遊指與頭部區的羧基端末端胺基酸殘基連接的整個胺基酸序列。在一個實施方案中,頭部區的羧基端末端位於對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白(SEQ ID NO:8)的Cys291的胺基酸位置。因此,在一個實施方案中,第二胺基酸序列包含來自對應於流感A新喀裡多尼亞/20/1999(H1)(SEQ ID NO:8)的胺基酸殘基292-517的HA蛋白的胺基酸區域的至少20個連續胺基酸。在一個實施方案中,第二胺基酸序列包含來自對應於流感A新喀裡多尼亞/20/1999(H1)(SEQ ID NO:8)的胺基酸殘基328-517的HA蛋白的胺基酸區域的至少20個連續胺基酸。在一個實施方案中,第二胺基酸序列包含與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的序列的至少20個連續胺基酸殘基:SEQ ID NO:23,SEQ ID NO:26,SEQ ID NO:29,SEQ ID NO:32,SEQ ID NO:38,SEQ ID NO:41,SEQ ID NO:44,SEQ ID NO:47,SEQ ID NO:53,SEQ ID NO:56,SEQ ID NO:59,SEQ ID NO:62,SEQ ID NO:68,SEQ ID NO:71,SEQ ID NO:74和SEQ ID NO:77。在一個實施方案中,第二胺基酸序列包含來自選自下組的序列的至少20個連續胺基酸殘基:SEQ ID NO:23,SEQ ID NO:26,SEQ ID NO:29,SEQ ID NO:32,SEQ ID NO:38,SEQ ID NO:41,SEQ ID NO:44,SEQ ID NO:47,SEQ ID NO:53,SEQ ID NO:56,SEQ ID NO:59,SEQ ID NO:62,SEQ ID NO:68,SEQ ID NO:71,SEQ ID NO:74和SEQ ID NO:77。

在一個實施方案中,第二胺基酸序列包含來自頭部區序列的羧基端末端下遊的胺基酸序列的至少40個,至少60個,至少75個,至少100個或至少150個連續胺基酸。在一個實施方案中,第二胺基酸序列包含來自HA蛋白的胺基酸區的至少40個,至少60個,至少75個,至少100個或至少150個連續胺基酸,所述胺基酸區對應於流感A新喀裡多尼亞/20/1999(H1)(SEQ ID NO:8)的胺基酸殘基292-517。在一個實施方案中,第二胺基酸序列包含來自HA蛋白的胺基酸區域的至少40個,至少60個,至少75個,至少100個或至少150個連續胺基酸,所述胺基酸區域對應於流感A新喀裡多尼亞/20/1999(H1)(SEQ ID NO:8)的胺基酸殘基328-517。在一個實施方案中,第二胺基酸序列包含來自序列的至少40,至少60,至少75,至少100或至少150個連續胺基酸,所述序列與選自下組的序列至少85%,至少90%,至少95%或至少97%相同:SEQ ID NO:23,SEQ ID NO:26,SEQ ID NO:29,SEQ ID NO:32,SEQ ID NO:38,SEQ ID NO:41,SEQ ID NO:44,SEQ ID NO:47,SEQ ID NO:53,SEQ ID NO:56,SEQ ID NO:59,SEQ ID NO:62,SEQ ID NO:68,SEQ ID NO:71,SEQ ID NO:74和SEQ ID NO:77。在一個實施方案中,第二胺基酸序列包含來自下組的至少40,至少60,至少75,至少100或至少150個連續胺基酸:SEQ ID NO:23,SEQ ID NO:26,SEQ ID NO:29,SEQ ID NO:32,SEQ ID NO:38,SEQ ID NO:41,SEQ ID NO:44,SEQ ID NO:47,SEQ ID NO:53,SEQ ID NO:56,SEQ ID NO:59,SEQ ID NO:62,SEQ ID NO:68,SEQ ID NO:71,SEQ ID NO:74和SEQ ID NO:77。在一個實施方案中,第二胺基酸序列包含與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的胺基酸序列:SEQ ID NO:23,SEQ ID NO:26,SEQ ID NO:29,SEQ ID NO:32,SEQ ID NO:38,SEQ ID NO:41,SEQ ID NO:44,SEQ ID NO:47,SEQ ID NO:53,SEQ ID NO:56,SEQ ID NO:59,SEQ ID NO:62,SEQ ID NO:68,SEQ ID NO:71,SEQ ID NO:74和SEQ ID NO:77。在一個實施方案中,第二胺基酸序列包含選自下組的序列:SEQ ID NO:23,SEQ ID NO:26,SEQ ID NO:29,SEQ ID NO:32,SEQ ID NO:38,SEQ ID NO:41,SEQ ID NO:44,SEQ ID NO:47,SEQ ID NO:53,SEQ ID NO:56,SEQ ID NO:59,SEQ ID NO:62,SEQ ID NO:68,SEQ ID NO:71,SEQ ID NO:74和SEQ ID NO:77。

如上所述,蛋白質構建體的第一和第二胺基酸序列可以通過接頭序列連接。可以使用任何接頭序列,只要該接頭序列具有距HA蛋白的頭部區少於5個連續的胺基酸殘基,並且只要第一和第二胺基酸能夠形成期望的構象即可。在一個實施方案中,接頭序列長度小於10個胺基酸,小於7個胺基酸或小於5個胺基酸。在一個實施方案中,接頭序列包含甘氨酸和絲氨酸。在一個實施方案中,接頭序列將第一胺基酸序列的羧基端末端連接到第二胺基酸序列的氨基端末端。在一個實施方案中,接頭序列將第二胺基酸序列的羧基端末端連接到第一胺基酸序列的氨基端末端。

如上所述,蛋白質構建體的第一或第二胺基酸序列與單體亞基蛋白的至少部分連接,使得蛋白質構建體能夠形成納米顆粒。在一個實施方案中,單體亞基蛋白的至少部分連接到第二胺基酸序列。在優選的實施方案中,單體亞基蛋白的至少部分連接到第二胺基酸序列的羧基端末端。在一個實施方案中,所述部分包含來自單體亞基的至少50個,至少100個或至少150個胺基酸。在一個實施方案中,單體亞基是鐵蛋白。在一個實施方案中,單體亞基是2,4-二氧四氫蝶啶合成酶。在一個實施方案中,所述部分包含來自SEQ ID NO:2,SEQ ID NO:5或SEQ ID NO:194的至少50,至少100或至少150個胺基酸。在一個實施方案中,單體亞基包含與SEQ ID NO:2,SEQ ID NO:5或SEQ ID NO:194具有至少85%相同,至少90%相同或至少95%相同的序列。在一個實施方案中,單體亞基包含選自SEQ ID NO:2,SEQ ID NO:5和SEQ ID NO:194的序列。

發明人已經發現,上述蛋白質構建體的流感HA序列的修飾導致蛋白質構建體的改進的穩定性。例如,本發明人已經發現,從對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白(SEQ ID NO:8)的胺基酸N403-W435的胺基酸區的HA蛋白的缺失導致更穩定的蛋白質構建體。在該區域缺失時,該區域側翼的胺基酸序列可以直接連接在一起,或者它們可以用接頭序列如例如甘氨酸-絲氨酸-甘氨酸連接。因此,在一個實施方案中,第二胺基酸序列包含與來自頭部區序列的羧基端末端下遊的胺基酸序列的至少100個連續胺基酸殘基至少85%,至少90%或至少95%相同的多肽序列,其中所述多肽序列缺少對應於來自流感A/新喀裡多尼亞1999的HA蛋白(SEQ ID NO:8)的SEQ ID NO:133,SEQ ID NO:134,SEQ ID NO:135或SEQ ID NO:136的區域。在一個實施方案中,第二胺基酸序列包含來自頭部區序列的羧基端末端下遊的胺基酸序列的至少100個連續胺基酸殘基,其中多肽序列缺乏對應於流感A/新喀裡多尼亞1999的HA蛋白(SEQ ID NO:8)的SEQ ID NO:133,SEQ ID NO:134,SEQ ID NO:135或SEQ ID NO:136的區域。

在一個實施方案中,第二胺基酸序列包含與來自頭部區序列的羧基端末端下遊的胺基酸序列的至少100個連續胺基酸殘基至少85%,至少90%或至少95%相同的多肽序列,其中所述多肽序列缺少對應於流感A/加利福尼亞/4/2009的HA蛋白(SEQ ID NO:10)的SEQ ID NO:137,SEQ ID NO:138,SEQ ID NO:139或SEQ ID NO:140的區域。在一個實施方案中,第二胺基酸序列包含來自頭部區序列的羧基端末端下遊的胺基酸序列的至少100個連續胺基酸殘基,其中多肽序列缺少對應於流感A/加利福尼亞/4/2009的HA蛋白(SEQ ID NO:10)的SEQ ID NO:137,SEQ ID NO:138,SEQ ID NO:139或SEQ ID NO:140的區域。

在一個實施方案中,第二胺基酸序列包含與來自頭部區序列的羧基端末端下遊的胺基酸序列的至少100個連續胺基酸殘基至少85%,至少90%或至少95%相同的胺基酸序列,其中多肽序列缺少對應於流感A/新加坡/1957的HA蛋白(SEQ ID NO:12)的SEQ ID NO:141,SEQ ID NO:142,SEQ ID NO:143或SEQ ID NO:144的區域。在一個實施方案中,第二胺基酸序列包含與來自頭部區序列的羧基端末端下遊的胺基酸序列的至少100個連續胺基酸殘基至少85%,至少90%或至少95%相同的胺基酸序列,其中多肽序列缺少對應於流感A/新加坡/1957的HA蛋白(SEQ ID NO:12)的SEQ ID NO:141,SEQ ID NO:142,SEQ ID NO:143或SEQ ID NO:144的區域。

在一個實施方案中,第二胺基酸序列包含來自頭部區序列的羧基端末端下遊的胺基酸序列的至少100個連續胺基酸殘基,其中多肽序列缺少對應於流感A/印度尼西亞/05/2005(H5)的HA蛋白(SEQ ID NO:16)的SEQ ID NO:145,SEQ ID NO:146,SEQ ID NO:147或SEQ ID NO:148的區域。在一個實施方案中,第二胺基酸序列包含來自頭部區序列的羧基端末端下遊的胺基酸序列的至少100個連續胺基酸殘基,其中多肽序列缺少對應於流感A/印度尼西亞/05/2005(H5)的HA蛋白(SEQ ID NO:16)的SEQ ID NO:145,SEQ ID NO:146,SEQ ID NO:147或SEQ ID NO:148的區域。

在一個實施方案中,第二胺基酸序列包含與SEQ ID NO:23,SEQ ID NO:26或SEQ ID NO:29的100個連續胺基酸至少85%,至少90%或至少95%相同的序列,其中所述100個連續胺基酸不包含選自SEQ ID NO:133,SEQ ID NO:134,SEQ ID NO:135和SEQ ID NO:136的序列。在一個實施方案中,第二胺基酸序列包含來自SEQ ID NO:23,SEQ ID NO:26或SEQ ID NO:29的100個連續胺基酸,其中所述100個連續胺基酸不包含選自下組的序列:SEQ ID NO:133,SEQ ID NO:134,SEQ ID NO:135和SEQ ID NO:136。

在一個實施方案中,第二胺基酸序列包含與SEQ ID NO:38,SEQ ID NO:41或SEQ ID NO:44的100個連續胺基酸至少85%,至少90%或至少95%相同的序列,其中所述100個連續胺基酸不包含選自SEQ ID NO:137,SEQ ID NO:138,SEQ ID NO:139和SEQ ID NO:140的序列。在一個實施方案中,第二胺基酸序列包含來自SEQ ID NO:38,SEQ ID NO:41或SEQ ID NO:44的100個連續胺基酸,其中所述100個連續胺基酸不包含選自下組的序列:SEQ ID NO:137,SEQ ID NO:138,SEQ ID NO:139和SEQ ID NO:140。

在一個實施方案中,第二胺基酸序列包含與SEQ ID NO:53,SEQ ID NO:56或SEQ ID NO:59的100個連續胺基酸至少85%,至少90%或至少95%相同的序列,其中所述100個連續胺基酸不包含選自SEQ ID NO:141,SEQ ID NO:142,SEQ ID NO:143和SEQ ID NO:144的序列。在一個實施方案中,第二胺基酸序列包含來自SEQ ID NO:53,SEQ ID NO:56或SEQ ID NO:59的100個連續胺基酸,其中所述100個連續胺基酸不包含選自下組的序列:SEQ ID NO:141,SEQ ID NO:142,SEQ ID NO:143和SEQ ID NO:144。

在一個實施方案中,第二胺基酸序列包含與SEQ ID NO:68,SEQ ID NO:71或SEQ ID NO:74的100個連續胺基酸至少85%,至少90%或至少95%相同的序列,其中所述100個連續胺基酸不包含選自SEQ ID NO:145,SEQ ID NO:146,SEQ ID NO:147和SEQ ID NO:148的序列。在一個實施方案中,第二胺基酸序列包含來自SEQ ID NO:68,SEQ ID NO:71或SEQ ID NO:74的100個連續胺基酸,其中所述100個連續胺基酸不包含選自下組的序列:SEQ ID NO:145,SEQ ID NO:146,SEQ ID NO:147和SEQ ID NO:148。

在一個實施方案中,第二胺基酸序列包含與來自選自下組的序列的100個連續胺基酸至少85%,至少90%或至少95%相同的序列:SEQ ID NO:26,SEQ ID NO:28,SEQ ID NO:32,SEQ ID NO:41,SEQ ID NO:44,SEQ ID NO:47,SEQ ID NO:56,SEQ ID NO:59,SEQ ID NO:62,SEQ ID NO:71和SEQ ID NO:77。在一個實施方案中,第二胺基酸序列包含來自選自下組的序列的至少100個連續胺基酸:SEQ ID NO:26,SEQ ID NO:32,SEQ ID NO:41,SEQ ID NO:47,SEQ ID NO:56,SEQ ID NO:62,SEQ ID NO:71和SEQ ID NO:77。在一個實施方案中,第二胺基酸序列包含選自下組的序列:SEQ ID NO:26,SEQ ID NO:32,SEQ ID NO:41,SEQ ID NO:47,SEQ ID NO:56,SEQ ID NO:62,SEQ ID NO:71,SEQ ID NO:74和SEQ ID NO:77。

本發明人還發現了,HA莖區序列的序列改變導致更穩定的蛋白質構建體。例如,在摺疊的HA蛋白中,對應於流感A新喀裡多尼亞/20/1999(H1)的K394和E446(對應於SEQ ID NO:149的K1和E53)的胺基酸殘基形成鹽橋,有助於穩定摺疊的蛋白質。本發明人已經發現,通過用合適的胺基酸取代賴氨酸和穀氨酸殘基,可以加強兩個胺基酸殘基之間的相互作用,這改善了分子的穩定性並允許對其進行更廣泛的操作。因此,本發明的一個實施方案是蛋白質構建體,其包含來自流感病毒HA蛋白的莖區的第一胺基酸序列和來自流感病毒HA蛋白的莖區的第二胺基酸序列,所述第一和第二胺基酸酸序列通過接頭序列共價連接,

其中所述第一胺基酸序列包含來自頭部區序列的氨基端末端上遊的胺基酸序列的至少20個連續胺基酸性殘基,

其中所述第二胺基酸序列包含來自頭部區序列的羧基端末端下遊的胺基酸序列的至少60個連續胺基酸,

其中所述60個連續胺基酸包含對應於來自A/新喀裡多尼亞/20/1999的SEQ ID NO:149或SEQ ID NO:150的序列的多肽序列,且

其中對應於SEQ ID NO:149的K1或SEQ ID NO:150的K1的所述多肽序列中的胺基酸殘基被除賴氨酸以外的胺基酸取代,

並且對應於SEQ ID NO:149的E53或SEQ ID NO:150的E20的胺基酸殘基被除穀氨酸之外的胺基酸殘基取代,使得取代的胺基酸殘基之間的相互作用的強度大於在野生型蛋白中的相互作用的強度。

如上所述,對應於流感A新喀裡多尼亞/20/1999(H1)的K394和E446的胺基酸殘基形成鹽橋,其是一類鍵。本領域已知存在胺基酸之間的其它類型的鍵,其強度根據鍵的類型而變化。此類鍵的實例包括但不限於疏水鍵和氫鍵,二者通常比鹽橋更強。因此,在一個實施方案中,對應於SEQ ID NO:149的K1或SEQ ID NO:150的K1的多肽中的胺基酸殘基和對應於SEQ ID NO:149的E53或SEQ ID NO:150的E20的多肽中的胺基酸殘基被改變,使得它們在最終摺疊的蛋白質中形成氫鍵。在一個實施方案中,對應於SEQ ID NO:149的K1或SEQ ID NO:150的K1的多肽中的胺基酸殘基和對應於SEQ ID NO:149的E53或SEQ ID NO:150的E20的多肽中的胺基酸殘基被改變,使得它們在最終摺疊的蛋白質中形成疏水鍵。

對應於SEQ ID NO:149的K1,SEQ ID NO:150的K1,SEQ ID NO:149的E53或SEQ ID NO:150的E20的胺基酸可以被任何胺基酸殘基取代,只要兩個胺基酸之間的所得相互作用比未改變的蛋白質中的鹽橋更強。增加對應於流感A新喀裡多尼亞/20/1999(H1)的K394和E446(SEQ ID NO:149的K1和E53)的胺基酸之間的相互作用強度的取代的實例包括但不限於:

其中對應於SEQ ID NO:149的K1的多肽序列中的胺基酸殘基被甲硫氨酸取代,並且對應於SEQ ID NO:149的E53的胺基酸殘基被亮氨酸取代;

其中對應於SEQ ID NO:149的K1的多肽序列中的胺基酸殘基被甲硫氨酸取代,並且對應於SEQ ID NO:149的E53的胺基酸殘基被甲硫氨酸取代;

其中對應於SEQ ID NO:149的K1的多肽序列中的胺基酸殘基被亮氨酸取代,並且對應於SEQ ID NO:149的E53的胺基酸殘基被亮氨酸取代;

其中對應於SEQ ID NO:149的K1的多肽序列中的胺基酸殘基被異亮氨酸取代,並且對應於SEQ ID NO:149的E53的胺基酸殘基被異亮氨酸取代;

其中對應於SEQ ID NO:149的K1的多肽序列中的胺基酸殘基被亮氨酸取代,並且對應於SEQ ID NO:149的E53的胺基酸殘基被異亮氨酸取代;

其中對應於SEQ ID NO:149的K1的多肽序列中的胺基酸殘基被穀氨醯胺取代,並且對應於SEQ ID NO:149的E53的胺基酸殘基被穀氨醯胺取代。

在一個實施方案中,第一胺基酸序列來自選自下組的病毒的HA蛋白的莖區:A型流感病毒,B型流感病毒和C型流感病毒。在一個實施方案中,第一胺基酸序列來自選自下組的病毒的HA蛋白的莖區:H1流感病毒,H2流感病毒,流感H3病毒,流感H4病毒,流感H5病毒,流感H6病毒,H7流感病毒,H8流感病毒,H9流感病毒,H10流感病毒,H11流感病毒,H12流感病毒,H13流感病毒,H14流感病毒,H15流感病毒,H16流感病毒,H17流感病毒,和H18流感病毒。在一個實施方案中,第一胺基酸序列來自選自下組的病毒的HA蛋白的莖區:流感A/新喀裡多尼亞/20/1999(1999NC,H1),A/加利福尼亞/04/2009(2009CA,H1),A/新加坡/1/1957(1957Sing,H2),A/香港/1/1968(1968HK,H3),A/布裡斯班/10/2007(2007Bris,H3),A/印度尼西亞/05/2005(2005Indo,H5),B/佛羅裡達/4/2006(2006Flo,B),A/珀斯/16/2009(2009Per,H3),A/布裡斯班/59/2007(2007Bris,H1),B/布裡斯班/60/2008(2008Bris,B)。在一個實施方案中,第一胺基酸序列來自HA蛋白的莖區,所述HA蛋白具有與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的胺基酸序列:SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17。在一個實施方案中,第一胺基酸序列來自HA蛋白的莖區,所述HA蛋白包含選自下組的序列:SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17。

在一個實施方案中,第二胺基酸序列來自選自下組的病毒的HA蛋白的莖區:A型流感病毒,B型流感病毒和C型流感病毒。在一個實施方案中,第二胺基酸序列來自選自下組的病毒的HA蛋白的莖區:H1流感病毒,H2流感病毒,流感H3病毒,流感H4病毒,流感H5病毒,流感H6病毒,H7流感病毒,H8流感病毒,H9流感病毒,H10流感病毒,H11流感病毒,H12流感病毒,H13流感病毒,H14流感病毒,H15流感病毒,H16流感病毒,H17流感病毒,和H18流感病毒。在一個實施方案中,第二胺基酸序列來自來自選自下組的病毒的HA蛋白的莖區:流感A/新喀裡多尼亞/20/1999(1999NC,H1),A/加利福尼亞/04/2009(2009CA,H1),A/新加坡/1/1957(1957Sing,H2),A/香港/1/1968(1968HK,H3),A/布裡斯班/10/2007(2007Bris,H3),A/印度尼西亞/05/2005(2005Indo,H5),B/佛羅裡達/4/2006(2006Flo,B),A/珀斯/16/2009(2009Per,H3),A/布裡斯班/59/2007(2007Bris,H1),B/布裡斯班/60/2008(2008Bris,B)。在一個實施方案中,第二胺基酸序列來自HA蛋白的莖區,所述HA蛋白具有與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的胺基酸序列。在一個實施方案中,第二胺基酸序列來自HA蛋白的莖區,所述HA蛋白包含選自下組的序列:SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17。

在一個實施方案中,第一胺基酸序列包含來自對應於流感A新喀裡多尼亞/20/1999(H1)(SEQ ID NO:8)的胺基酸殘基1-58的HA蛋白的區域的至少20個連續胺基酸殘基。在一個實施方案中,第一胺基酸序列包含與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的序列的至少20個連續胺基酸殘基:SEQ ID NO:20,SEQ ID NO:35,SEQ ID NO:50和SEQ ID NO:65。在一個實施方案中,第一胺基酸序列包含來自選自SEQ ID NO:20,SEQ ID NO:35,SEQ ID NO:50和SEQ ID NO:65的序列的至少20個連續胺基酸殘基。

在一個實施方案中,第一胺基酸序列包含來自HA蛋白的胺基酸區域的至少40個連續胺基酸殘基,所述胺基酸區域對應於流感A新喀裡多尼亞/20/1999(H1)(SEQ ID NO:8)的胺基酸殘基1-58。在一個實施方案中,第一胺基酸序列包含與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的序列的至少40個連續胺基酸殘基:SEQ ID NO:20,SEQ ID NO:35,SEQ ID NO:50和SEQ ID NO:65。在一個實施方案中,第一胺基酸序列包含來自選自SEQ ID NO:20,SEQ ID NO:35,SEQ ID NO:50和SEQ ID NO:65的序列的至少40個連續胺基酸殘基。在一個實施方案中,第一胺基酸序列包含與選自下組的序列至少85%,至少90%或至少95%相同的序列:SEQ ID NO:20,SEQ ID NO:35,SEQ ID NO:50和SEQ ID NO:65。在一個實施方案中,第一胺基酸序列包含選自SEQ ID NO:20,SEQ ID NO:35,SEQ ID NO:50和SEQ ID NO:65的序列。

在一個實施方案中,第二胺基酸序列來自選自A型流感病毒,B型流感病毒和C型流感病毒的病毒的HA蛋白的莖區。在一個實施方案中,第二胺基酸序列來自選自下組的病毒的HA蛋白的莖區:H1流感病毒,H2流感病毒,H3流感病毒,H4流感病毒,H5流感病毒病毒,H6流感病毒,H7流感病毒,H8流感病毒,H9流感病毒,H10流感病毒,H11流感病毒,H12流感病毒,H13流感病毒,H14流感病毒,H15流感病毒病毒,H16流感病毒,H17流感病毒和H18流感病毒。在一個實施方案中,第二胺基酸序列來自選自下組的病毒的HA蛋白的莖區:流感A/新喀裡多尼亞/20/1999(1999NC,H1),A/加利福尼亞/04/2009(2009CA,H1),A/新加坡/1/1957(1957Sing,H2),A/香港/1/1968(1968HK,H3),A/布裡斯班/10/2007(2007Bris,H3),A/印度尼西亞/05/2005(2005Indo,H5),B/佛羅裡達/4/2006(2006Flo,B),A/珀斯/16/2009(2009Per,H3),A/布裡斯班/59/2007(2007Bris,H1),B/布裡斯班/60/2008(2008Bris,B)。在一個實施方案中,第二胺基酸序列來自HA蛋白的莖區,所述HA蛋白具有與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的胺基酸序列:SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17。在一個實施方案中,第二胺基酸序列來自HA蛋白的莖區,所述HA蛋白包含選自SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17的序列。

在一個實施方案中,第二胺基酸序列的至少60個連續胺基酸來自HA蛋白的胺基酸區,所述胺基酸區對應於流感A新喀裡多尼亞/20/1999(H1)(SEQ ID NO:8)的胺基酸殘基292-517。在一個實施方案中,第二胺基酸序列的至少60個連續胺基酸來自HA蛋白的胺基酸區,其對應於流感A新喀裡多尼亞/20/1999(H1)(SEQ ID NO:8)的胺基酸殘基328-517。在一個實施方案中,第二胺基酸序列的至少60個連續胺基酸來自與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的序列:SEQ ID NO:23,SEQ ID NO:26,SEQ ID NO:29,SEQ ID NO:32,SEQ ID NO:38,SEQ ID NO:41,SEQ ID NO:44,SEQ ID NO:47,SEQ ID NO:53,SEQ ID NO:56,SEQ ID NO:59,SEQ ID NO:62,SEQ ID NO:68,SEQ ID NO:71,SEQ ID NO:74和SEQ ID NO:77。在一個實施方案中,第二胺基酸序列的至少60個連續胺基酸來自選自下組的序列

SEQ ID NO:23,SEQ ID NO:26,SEQ ID NO:29,SEQ ID NO:32,SEQ ID NO:38,SEQ ID NO:41,SEQ ID NO:44,SEQ ID NO:47,SEQ ID NO:53,SEQ ID NO:56,SEQ ID NO:59,SEQ ID NO:62,SEQ ID NO:68,SEQ ID NO:71,SEQ ID NO:74和SEQ ID NO:77。

在一個實施方案中,第二胺基酸序列包含來自頭部區序列的羧基端末端下遊的胺基酸序列的至少75個,至少100個,至少150個或至少200個連續胺基酸,其中至少75個,至少100個,至少150個或至少200個連續胺基酸包含對應於H1N1NC的SEQ ID NO:149或SEQ ID NO:150的序列的多肽序列,並且其中對應於SEQ ID NO:149的K1或SEQ ID NO:150的K1的多肽序列中的胺基酸殘基,和多肽序列中對應於SEQ ID NO:149的E53或SEQ ID NO:150的E20的胺基酸殘基已經分別被除了賴氨酸和穀氨酸外的胺基酸取代,使得取代的胺基酸殘基之間的相互作用的強度大於野生型蛋白質中的相互作用的強度。在一個實施方案中,第二胺基酸序列包含來自HA蛋白的胺基酸區域的至少75,至少100,至少150或至少200個連續胺基酸,所述HA蛋白對應於流感A新喀裡多尼亞/20/1999(H1)(SEQ ID NO:8)的胺基酸殘基292-517,其中所述至少75個,至少100個,至少150個或至少200個連續胺基酸包含對應於H1N1NC的SEQ ID NO:149或SEQ ID NO:150的序列的多肽序列,並且其中對應於SEQ ID NO:149的K1或SEQ ID NO:150的K1的多肽序列中的胺基酸殘基,和對應於SEQ ID NO:149的E53或SEQ ID NO:150的E20的多肽序列中的胺基酸殘基分別被除了賴氨酸和穀氨酸之外的胺基酸取代,使得取代的胺基酸殘基之間的相互作用的強度大於強度的野生型蛋白中的相互作用。在一個實施方案中,第二胺基酸序列包含來自HA蛋白的胺基酸區域的至少75個,至少100個,至少150個或至少200個連續胺基酸,所述胺基酸區域對應於流感A新喀裡多尼亞/20/1999(H1)(SEQ ID NO:8)的胺基酸殘基328-517,其中至少75,至少100,至少150或至少200個連續胺基酸包含對應於H1N1NC的SEQ ID NO:149或SEQ ID NO:150的序列的多肽序列,並且其中對應於SEQ ID NO:149的K1或SEQ ID NO:150的K1的多肽序列中的胺基酸殘基,和對應於SEQ ID NO:149的E53或SEQ ID NO:150的E20的多肽序列中的胺基酸殘基分別被除了賴氨酸和穀氨酸之外的胺基酸取代,使得取代的胺基酸殘基之間的相互作用的強度大於強度的野生型蛋白中的相互作用。在一個實施方案中,第二胺基酸序列包含來自與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的序列的至少75,至少100,至少150或至少200個連續胺基酸:SEQ ID NO:23,SEQ ID NO:26,SEQ ID NO:29,SEQ ID NO:32,SEQ ID NO:38,SEQ ID NO:41,SEQ ID NO:44,SEQ ID NO:47,SEQ ID NO:53,SEQ ID NO:56,SEQ ID NO:59,SEQ ID NO:62,SEQ ID NO:68,SEQ ID NO:71,SEQ ID NO:74和SEQ ID NO:77,其中所述至少75個,至少100個,至少150個或至少200個連續胺基酸包含對應於H1N1NC的SEQ ID NO:149或SEQ ID NO:150的序列的多肽序列,和其中對應於SEQ ID NO:149的K1或SEQ ID NO:150的K1的多肽序列中的胺基酸殘基和對應於SEQ ID NO:149的E53或SEQ ID NO:150的E20的多肽序列中的胺基酸殘基分別被除了賴氨酸和穀氨酸之外的胺基酸取代,使得取代的胺基酸殘基之間的相互作用的強度大於野生型蛋白質中相互作用的強度。在一個實施方案中,第二胺基酸序列包含來自下組的至少75,至少100,至少150或至少200個連續胺基酸:SEQ ID NO:23,SEQ ID NO:26,SEQ ID NO:29,SEQ ID NO:32,SEQ ID NO:38,SEQ ID NO:41,SEQ ID NO:44,SEQ ID NO:47,SEQ ID NO:53,SEQ ID NO:56,SEQ ID NO:59,SEQ ID NO:62,SEQ ID NO:68,SEQ ID NO:71,SEQ ID NO:74和SEQ ID NO:77。其中所述至少75個,至少100個,至少150個或至少200個連續胺基酸包含對應於H1N1NC的SEQ ID NO:149或SEQ ID NO:150的序列的多肽序列,並且其中對應於SEQ ID NO:149的K1或SEQ ID NO:150的K1的多肽序列中的胺基酸殘基,並且多肽序列中對應於SEQ ID NO:149的E53或SEQ ID NO:150的E20的胺基酸殘基已經分別被除了賴氨酸和穀氨酸之外的胺基酸取代,使得取代的胺基酸殘基之間的相互作用的強度大於野生型蛋白質中的相互作用的強度。

含有規定位點特異性突變的蛋白質構建體可用於通過將本發明的納米顆粒連接到單體亞基來製備本發明的納米顆粒。因此,在一個實施方案中,將含有所公開的位點特異性突變(例如,SEQ ID NO:149或SEQ ID NO:150的K1和SEQ ID NO:149的E53或SEQ ID NO:150的E20)的蛋白質構建體連接到單體亞基蛋白的至少部分,其中所述單體亞基蛋白的所述部分能夠指導蛋白質構建體的自組裝。在一個實施方案中,單體亞基蛋白的至少部分連接到第二胺基酸序列。在優選的實施方案中,單體亞基蛋白的至少部分連接到第二胺基酸序列的羧基端末端。在一個實施方案中,所述部分包含來自單體亞基的至少50個,至少100個或至少150個胺基酸。在一個實施方案中,單體亞基是鐵蛋白。在一個實施方案中,單體亞基是2,4-二氧四氫蝶啶合成酶。在一個實施方案中,單體亞基包含與SEQ ID NO:2,SEQ ID NO:5或SEQ ID NO:194至少85%相同,至少90%相同或至少95%相同的序列。在一個實施方案中,單體亞基包含選自SEQ ID NO:2,SEQ ID NO:5和SEQ ID NO:194的序列。

儘管對本文公開的HA蛋白進行的修飾已經描述為單獨的實施方案,但是應當理解,所有此類修飾可以包含在單一蛋白質構建體中。例如,可以製備蛋白質構建體,其中第一胺基酸序列通過接頭連接到第二胺基酸序列,其中第二胺基酸序列包含來自頭部區的羧基端末端下遊的區域的胺基酸序列,但是缺乏由SEQ ID NO:133-148表示的內部環序列,並且其中對應於SEQ ID NO:149的K1或SEQ ID NO:50的K1和SEQ ID NO:149的E53或SEQ ID NO:150的E20的第二胺基酸序列中的胺基酸分別被除了賴氨酸和穀氨酸之外的胺基酸取代,以增加摺疊蛋白中這些胺基酸殘基之間的相互作用的強度。因此,本發明的一個實施方案是蛋白質構建體,其包含來自流感病毒HA蛋白的莖區的第一胺基酸序列和來自流感病毒HA蛋白的莖區的第二胺基酸序列,所述第一和第二胺基酸酸序列通過接頭序列共價連接,

其中所述第一胺基酸序列包含來自頭部區序列的氨基端末端上遊的胺基酸序列的至少20個連續胺基酸殘基;

其中所述第二胺基酸序列包含與來自頭部區序列的羧基端末端下遊的胺基酸序列的至少100個連續胺基酸殘基至少85%,至少90%或至少95%相同的多肽序列,

其中所述多肽序列包含對應於由SEQ ID NO:150代表的流感A新喀裡多尼亞/20/1999(H1)中的序列,由SEQ ID NO:152代表的流感A加利福尼亞/2009(H1)中的序列,由SEQ ID NO:154表示的流感A新加坡/1957(H2)中的序列和由SEQ ID NO:156表示的流感A印度尼西亞/2005H5)中的序列和,

其中對應於SEQ ID NO:150的K1的多肽序列中的胺基酸殘基已經被除了賴氨酸之外的胺基酸取代,並且對應於SEQ ID NO:150的E20的胺基酸殘基已經被除了穀氨酸外的胺基酸取代。

在一個實施方案中,多肽包含來自頭部區序列的羧基端末端下遊的胺基酸序列的至少100個連續胺基酸。在一個實施方案中,所述至少100個連續胺基酸包含SEQ ID NO:150。在一個實施方案中,所述至少100個連續胺基酸包含SEQ ID NO:152。在一個實施方案中,所述至少100個連續胺基酸序列包含SEQ ID NO:154。在一個實施方案中,所述至少100個連續胺基酸包含SEQ ID NO:156。應當理解,在上述構建體中,當除去內部環區時,剩餘的HA蛋白的各個末端可以直接連接在一起。然而,在一些情況下,此類直接連接可能降低肽主鏈的柔性。因此,在一些情況下,用接頭序列替代內部環區域可能是有益的。作為實例,如果六個胺基酸接頭序列插入SEQ ID NO:150,則最終序列可表現如下:VNSVIEKMGSGGSGTYNAELLVLL。

因此,在一個實施方案中,蛋白質構建體的多肽序列包含SEQ ID NO:150,其中插入短接頭序列。在一個實施方案中,蛋白質構建體的多肽序列包含SEQ ID NO:152,其中插入短接頭序列。在一個實施方案中,蛋白質構建體的多肽序列包含SEQ ID NO:154,其中插入短接頭序列。在一個實施方案中,蛋白質構建體的多肽序列包含SEQ ID NO:156,其中插入短接頭序列。在一個實施方案中,接頭由絲氨酸和甘氨酸殘基製成。在一個實施方案中,接頭的長度少於10個胺基酸。在一個實施方案中,接頭的長度少於5個胺基酸。在一個實施方案中,接頭的長度少於3個胺基酸。

儘管上文所述的蛋白質構建體可用於產生能夠產生針對一種或多種流感病毒的免疫應答的納米顆粒,但是在一些實施方案中,可能有用的是將進一步的突變工程化改造到本發明的蛋白質的胺基酸序列中。例如,可以有用的是改變單體亞基蛋白,三聚化結構域或接頭序列中的位點,如酶識別位點或糖基化位點,以便對蛋白質給予有益的性質(例如溶解度,半衰期,免於免疫監視的蛋白質的掩蔽部分)。在這方面,已知鐵蛋白的單體亞基不是天然糖基化的。然而,如果其在哺乳動物或酵母細胞中作為分泌性蛋白質表達,則其可以被糖基化。因此,在一個實施方案中,來自單體鐵蛋白亞基的胺基酸序列中的潛在N連接的糖基化位點被突變,使得突變的鐵蛋白亞基序列在突變位點不再被糖基化。突變的單體鐵蛋白亞基的一個此類序列由SEQ ID NO:5表示。

也可以改變蛋白質構建體序列以包括其它有用的突變。例如,在一些情況下,可以期望阻斷針對蛋白質構建體中的某些胺基酸序列的免疫應答的產生。這可以通過在待阻斷的位點附近添加糖基化位點來完成,使得聚糖在空間上阻礙免疫系統到達阻斷位點的能力。因此,在一個實施方案中,蛋白質構建體的序列已經改變為包括一個或多個糖基化位點。這樣的位點的實例包括但不限於Asn-X-Ser,Asn-X-Thr和Asn-X-Cys。在一些情況下,可以將糖基化位點引入接頭序列中。引入糖基化位點的有用位點的其它實例包括但不限於對應於來自流感A新喀裡多尼亞/20/1999(H1)的胺基酸45-47或胺基酸370-372的胺基酸。引入糖基化位點的方法是本領域技術人員已知的。

本文的公開內容證明在HA或單體亞基蛋白中的特定位置處的突變產生有用的蛋白質構建體,並因此產生本發明的納米顆粒。引入突變的鐵蛋白蛋白質中有用位置的實例包括對應於選自下組的胺基酸位置的胺基酸:SEQ ID NO:2的胺基酸位置18,胺基酸位置20和胺基酸位置68。引入突變的有用位置的實例包括HA蛋白中對應於選自下組的胺基酸位置的胺基酸:流感A新喀裡多尼亞/20/1999(H1)的HA蛋白(SEQ ID NO:8)的胺基酸位置36,胺基酸位置45,胺基酸位置47,胺基酸位置49,胺基酸位置339,胺基酸位置340,胺基酸位置341,胺基酸位置342,胺基酸位置361,胺基酸位置372,胺基酸位置394,胺基酸位置402,胺基酸位置437,胺基酸位置438,胺基酸位置445,胺基酸位置446,胺基酸位置448,胺基酸449,胺基酸位置450和胺基酸位置452。表2中列出了此類突變的一些實例。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置36的位置處包含異亮氨酸或與其具有相似性質的胺基酸殘基A。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置45的位置處包含天冬醯胺或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999的HA蛋白的胺基酸位置47的位置包含蘇氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置49的位置處包含色氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置339的位置處包含穀氨醯胺或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置340的位置處包含精氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置341的位置包含穀氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置342的位置包含蘇氨酸或與其具有相似性質的胺基酸殘基(H1)。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置372的位置處包含蘇氨酸或與其具有相似性質的胺基酸殘基(H1)。在一個實施方案中,蛋白質構建體的HA部分在對應於在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置394的位置包含甲硫氨酸,異亮氨酸,亮氨酸,穀氨醯胺或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置402的位置處包含天冬醯胺,蘇氨酸,甘氨酸,天冬醯胺或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置437的位置包含天冬氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置438的位置包含亮氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置445的位置包含亮氨酸,甲硫氨酸或具有與其相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置446的位置包含異亮氨酸,亮氨酸,甲硫氨酸,穀氨醯胺或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置448的位置處包含穀氨醯胺或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置449的位置包含色氨酸,苯丙氨酸或具有與其相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置450的位置處包含丙氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置452的位置處包含亮氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分缺少對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸515-517的一個或多個胺基酸。

本發明的一個實施方案是蛋白質構建體,所述蛋白質構建體包含與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的胺基酸序列:SEQ ID NO:80,SEQ ID NO:83,SEQ ID NO:86,SEQ ID NO:89,SEQ ID NO:92,SEQ ID NO:95,SEQ ID NO:98,SEQ ID NO:101,SEQ ID NO:104,SEQ ID NO:158,SEQ ID NO:164,SEQ ID NO:170,SEQ ID NO:176,SEQ ID NO:182,SEQ ID NO:188,SEQ ID NO:197,SEQ ID NO:203,SEQ ID NO:209,SEQ ID NO:214,SEQ ID NO:217,SEQ ID NO:222,SEQ ID NO:224,SEQ ID NO:226,SEQ ID NO:228,SEQ ID NO:230,SEQ ID NO:232,SEQ ID NO:235,SEQ ID NO:240,SEQ ID NO:242,SEQ ID NO:244,SEQ ID NO:246,SEQ ID NO:248,SEQ ID NO:250,SEQ ID NO:252,SEQ ID NO:254,SEQ ID NO:256,SEQ ID NO:258,SEQ ID NO:261,SEQ ID NO:268,SEQ ID NO:275,SEQ ID NO:282,SEQ ID NO:289,SEQ ID NO:296,SEQ ID NO:303,SEQ ID NO:310,SEQ ID NO:317,SEQ ID NO:324,SEQ ID NO:331,SEQ ID NO:338,SEQ ID NO:345,SEQ ID NO:352,SEQ ID NO:359,SEQ ID NO:366,SEQ ID NO:373,SEQ ID NO:380,SEQ ID NO:387,SEQ ID NO:394和SEQ ID NO:400。

在一個實施方案中,對應於SEQ ID NO:149的K1或SEQ ID NO:150的K1的胺基酸殘基的胺基酸殘基被除了賴氨酸之外的胺基酸取代,並且對應於SEQ ID NO:149的E53的胺基酸殘基或SEQ ID NO:20的E20的胺基酸殘基被穀氨酸以外的胺基酸取代,使得在摺疊蛋白中取代的胺基酸之間的相互作用的強度增加。

本發明的一個實施方案是蛋白質構建體,所述蛋白質構建體包含選自下組的序列:SEQ ID NO:80,SEQ ID NO:83,SEQ ID NO:86,SEQ ID NO:89,SEQ ID NO:92,SEQ ID NO:95,SEQ ID NO:98,SEQ ID NO:101,SEQ ID NO:104,SEQ ID NO:158,SEQ ID NO:164,SEQ ID NO:170,SEQ ID NO:176,SEQ ID NO:182,SEQ ID NO:188,SEQ ID NO:197,SEQ ID NO:203,SEQ ID NO:209,SEQ ID NO:214,SEQ ID NO:217,SEQ ID NO:222,SEQ ID NO:224,SEQ ID NO:226,SEQ ID NO:228,SEQ ID NO:230,SEQ ID NO:232,SEQ ID NO:235,SEQ ID NO:240,SEQ ID NO:242,SEQ ID NO:244,SEQ ID NO:246,SEQ ID NO:248,SEQ ID NO:250,SEQ ID NO:252,SEQ ID NO:254,SEQ ID NO:256,SEQ ID NO:258,SEQ ID NO:261,SEQ ID NO:268,SEQ ID NO:275,SEQ ID NO:282,SEQ ID NO:289,SEQ ID NO:296,SEQ ID NO:303,SEQ ID NO:310,SEQ ID NO:317,SEQ ID NO:324,SEQ ID NO:331,SEQ ID NO:338,SEQ ID NO:345,SEQ ID NO:352,SEQ ID NO:359,SEQ ID NO:366,SEQ ID NO:373,SEQ ID NO:380,SEQ ID NO:387,SEQ ID NO:394和SEQ ID NO:400。在一個實施方案中,當與單體亞基蛋白連接時,蛋白構建體能夠形成納米顆粒,其中納米顆粒能夠引發針對流感病毒的免疫應答。

如前已經描述,由流感HA蛋白製成的蛋白質構建體可以用於通過將其連接到單體亞基來製備本發明的納米顆粒。因此,在一個實施方案中,蛋白質構建體與單體亞基蛋白質的至少一部分連接,其中單體亞基蛋白質的部分能夠指導蛋白質構建體的自組裝。在一個實施方案中,單體亞基蛋白的至少一部分連接到第二胺基酸序列。在優選的實施方案中,單體亞基蛋白的至少一部分連接到第二胺基酸序列的羧基末端。在一個實施方案中,所述部分包含來自單體亞基的至少50個,至少100個或至少150個胺基酸。在一個實施方案中,單體亞基是鐵蛋白。在一個實施方案中,單體亞基是2,4-二氧四氫蝶啶合成酶。在一個實施方案中,單體亞基包含與SEQ ID NO:2,SEQ ID NO:5或SEQ ID NO:194至少85%相同,至少90%相同或至少95%相同的序列。在一個實施方案中,單體亞基包含選自SEQ ID NO:2,SEQ ID NO:5和SEQ ID NO:194的序列。

本發明的一個實施方案是蛋白質構建體,其包含與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的胺基酸序列:SEQ ID NO:107,SEQ ID NO:110,SEQ ID NO:113,SEQ ID NO:116,SEQ ID NO:119,SEQ ID NO:122,SEQ ID NO:125,SEQ ID NO:128,SEQ ID NO:131,SEQ ID NO:161,SEQ ID NO:167,SEQ ID NO:173,SEQ ID NO:179,SEQ ID NO:185,SEQ ID NO:191,SEQ ID NO:200,SEQ ID NO:206,SEQ ID NO:212,SEQ ID NO:215,SEQ ID NO:220,SEQ ID NO:223,SEQ ID NO:225,SEQ ID NO:227,SEQ ID NO:229,SEQ ID NO:231,SEQ ID NO:233,SEQ ID NO:238,SEQ ID NO:241,SEQ ID NO:243,SEQ ID NO:245,SEQ ID NO:247,SEQ ID NO:249,SEQ ID NO:251,SEQ ID NO:253,SEQ ID NO:255,SEQQ ID NO:257,SEQ ID NO:259,SEQ ID NO:264,SEQ ID NO:271,SEQ ID NO:278,SEQ ID NO:285,SEQ ID NO:292,SEQ ID NO:299,SEQ ID NO:306,SEQ ID NO:313,SEQ ID NO:320,SEQ ID NO:327,SEQ ID NO:334,SEQ ID NO:341,SEQ ID NO:348,SEQ ID NO:355,SEQ ID NO:362,SEQ ID NO:369,SEQ ID NO:376,SEQ ID NO:383,SEQ ID NO:390和SEQ ID NO:397。在一個實施方案中,對應於SEQ ID NO:149的K1或SEQ ID NO:150的K1的胺基酸殘基被除了賴氨酸之外的胺基酸取代,並且對應於SEQ ID NO:149的E53的胺基酸殘基或SEQ ID NO:20的E20被除了穀氨酸之外的胺基酸取代,使得在摺疊蛋白中取代的胺基酸之間的相互作用的強度增加。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置36的位置包含異亮氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置45的位置處包含天冬醯胺或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置47的位置包含蘇氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置49的位置處包含色氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置339的位置處包含穀氨醯胺或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置340的位置處包含精氨酸或與其具有相似性質的胺基酸殘基(H1)。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/20的HA蛋白的胺基酸位置341的位置包含穀氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置342的位置包含蘇氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置372的位置處包含蘇氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置394的位置包含甲硫氨酸,異亮氨酸,亮氨酸,穀氨醯胺或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置402的位置處包含天冬醯胺,蘇氨酸,甘氨酸,天冬醯胺或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置437的位置包含天冬氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置438的位置包含亮氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置445的位置包含亮氨酸,甲硫氨酸或具有與其相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置446的位置包含異亮氨酸,亮氨酸,甲硫氨酸,穀氨醯胺或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置448的位置處包含穀氨醯胺或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置449的位置包含色氨酸,苯丙氨酸或具有與其相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置450的位置處包含丙氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置452的位置處包含亮氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分缺少對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸515-517的一個或多個胺基酸。

本發明的一個實施方案是蛋白質構建體,所述蛋白質構建體包含選自下組的序列:SEQ ID NO:107,SEQ ID NO:110,SEQ ID NO:113,SEQ ID NO:116,SEQ ID NO:119,SEQ ID NO:122,SEQ ID NO:125,SEQ ID NO:128,SEQ ID NO:131,SEQ ID NO:161,SEQ ID NO:167,SEQ ID NO:173,SEQ ID NO:179,SEQ ID NO:185,SEQ ID NO:191,SEQ ID NO:200,SEQ ID NO:206,SEQ ID NO:212,SEQ ID NO:215,SEQ ID NO:220,SEQ ID NO:223,SEQ ID NO:225,SEQ ID NO:227,SEQ ID NO:229,SEQ ID NO:231,SEQ ID NO:233,SEQ ID NO:238,SEQ ID NO:241,SEQ ID NO:243,SEQ ID NO:245,SEQ ID NO:247,SEQ ID NO:249,SEQ ID NO:251,SEQ ID NO:253,SEQ ID NO:255,SEQQ ID NO:257,SEQ ID NO:259,SEQ ID NO:264,SEQ ID NO:271,SEQ ID NO:278,SEQ ID NO:285,SEQ ID NO:292,SEQ ID NO:299,SEQ ID NO:306,SEQ ID NO:313,SEQ ID NO:320,SEQ ID NO:327,SEQ ID NO:334,SEQ ID NO:341,SEQ ID NO:348,SEQ ID NO:355,SEQ ID NO:362,SEQ ID NO:369,SEQ ID NO:376,SEQ ID NO:383,SEQ ID NO:390和SEQ ID NO:397。

本發明的一個實施方案是由核酸分子編碼的蛋白質構建體,所述核酸分子包含與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的核酸序列:SEQ ID NO:266,SEQ ID NO:273,SEQ ID NO:SEQ ID NO:280,SEQ ID NO:287,SEQ ID NO:294,SEQ ID NO:301,SEQ ID NO:308,SEQ ID NO:315,SEQ ID NO:322,SEQ ID NO:329,SEQ ID NO:336,SEQ ID NO:343,SEQ ID NO:350,SEQ ID NO:357,SEQ ID NO:364,SEQ ID NO:371,SEQ ID NO:378,SEQ ID NO:385SEQ ID NO:392和SEQ ID NO:399。本發明的一個實施方案是由核酸分子編碼的蛋白質構建體,所述核酸分子包含選自下組的核酸序列:SEQ ID NO:266,SEQ ID NO:273,SEQ ID NO:SEQ ID NO:280,SEQ ID NO:287,SEQ ID NO:294,SEQ ID NO:301,SEQ ID NO:308,SEQ ID NO:315,SEQ ID NO:322,SEQ ID NO:329,SEQ ID NO:336,SEQ ID NO:343,SEQ ID NO:350,SEQ ID NO:357,SEQ ID NO:364,SEQ ID NO:371,SEQ ID NO:378,SEQ ID NO:385SEQ ID NO:392和SEQ ID NO:399。

本發明的蛋白質和蛋白質構建體由本發明的核酸分子編碼。此外,它們由本發明的核酸構建體表達。如本文所使用的,核酸構建體是重組表達載體,即連接到編碼蛋白質的核酸分子的載體,使得當將核酸構建體施用於,例如,受試者或器官,組織或細胞時,核酸分子可以實現蛋白質表達。載體還能夠將核酸分子轉運到環境內的細胞,例如但不限於生物體,組織或細胞培養物。本公開的核酸構建體通過人類幹預產生。核酸構建體可以是DNA,RNA或其變體。載體可以是DNA質粒,病毒載體或其它載體。在一個實施方案中,載體可以是巨細胞病毒(CMV),逆轉錄病毒,腺病毒,腺伴隨病毒,皰疹病毒,牛痘病毒,脊髓灰質炎病毒,辛德畢斯病毒或任何其它DNA或RNA病毒載體。在一個實施方案中,載體可以是假型化的慢病毒或逆轉錄病毒載體。在一個實施方案中,載體可以是DNA質粒。在一個實施方案中,載體可以是包含能夠進行核酸分子遞送和表達的病毒組分和質粒組分的DNA質粒。構建本公開的核酸構建體的方法是公知的。參見,例如,Molecular Cloning:a Laboratory Manual,3rd edition,Sambrook et al.2001Cold Spring Harbor Laboratory Press,以及Current Protocols in Molecular Biology,Ausubel et al.eds.,John Wiley&Sons,1994。在一個實施方案中,載體是DNA質粒,如CMV/R質粒,如CMV/R或CMV/R 8KB(本文也稱為CMV/R 8kb)。本文提供了CMV/R和CMV/R 8kb的實例。CMV/R也在2006年8月22日授權的US 7,094,598B2中描述。

如本文中使用的,核酸分子包含編碼本發明的蛋白質構建體的核酸序列。核酸分子可以重組地,合成地或通過重組和合成程序的組合產生。本公開的核酸分子可以具有野生型核酸序列或密碼子修飾的核酸序列,以例如摻入由人翻譯系統更好識別的密碼子。在一個實施方案中,核酸分子可以被遺傳工程化以引入或消除編碼不同胺基酸的密碼子,如引入編碼N-連接的糖基化位點的密碼子。產生本公開核酸分子的方法是本領域已知的,特別是一旦知道核酸序列。應當理解,核酸構建體可以包含一個核酸分子或多於一個核酸分子。還應當理解,核酸分子可以編碼一種蛋白質或多於一種蛋白質。

一個實施方案是編碼流感HA蛋白的核酸分子,所述流感HA蛋白包含與選自SEQ ID NO:150,SEQ ID NO:152,SEQ ID NO:154和SEQ ID NO:156的序列至少80%,至少85%,至少90%,至少92%,至少94%,至少96%,至少98%或至少99%相同的胺基酸序列。一個實施方案是編碼流感HA蛋白的核酸分子,所述流感HA蛋白包含選自SEQ ID NO:150,SEQ ID NO:152,SEQ ID NO:154和SEQ ID NO:156的胺基酸序列。

在一個實施方案中,核酸分子編碼流感HA蛋白,其包含與選自下組的序列至少80%,至少85%,至少90%,至少92%,至少94%,至少96%,至少98%或至少99%相同的胺基酸序列:SEQ ID NO:80,SEQ ID NO:83,SEQ ID NO:86,SEQ ID NO:89,SEQ ID NO:92,SEQ ID NO:95,SEQ ID NO:98,SEQ ID NO:101,SEQ ID NO:104,SEQ ID NO:158,SEQ ID NO:164,SEQ ID NO:170,SEQ ID NO:176,SEQ ID NO:182,SEQ ID NO:188,SEQ ID NO:197,SEQ ID NO:203,SEQ ID NO:209,SEQ ID NO:214,SEQ ID NO:217,SEQ ID NO:222,SEQ ID NO:224,SEQ ID NO:226,SEQ ID NO:228,SEQ ID NO:230,SEQ ID NO:232,SEQ ID NO:235,SEQ ID NO:240,SEQ ID NO:242,SEQ ID NO:244,SEQ ID NO:246,SEQ ID NO:248,SEQ ID NO:250,SEQ ID NO:252,SEQ ID NO:254,SEQ ID NO:256,SEQ ID NO:258,SEQ ID NO:261,SEQ ID NO:268,SEQ ID NO:275,SEQ ID NO:282,SEQ ID NO:289,SEQ ID NO:296,SEQ ID NO:303,SEQ ID NO:310,SEQ ID NO:317,SEQ ID NO:324,SEQ ID NO:331,SEQ ID NO:338,SEQ ID NO:345,SEQ ID NO:352,SEQ ID NO:359,SEQ ID NO:366,SEQ ID NO:373,SEQ ID NO:380,SEQ ID NO:387,SEQ ID NO:394和SEQ ID NO:400。在一個實施方案中,核酸分子編碼流感HA蛋白,其包含選自下組的胺基酸:SEQ ID NO:80,SEQ ID NO:83,SEQ ID NO:86,SEQ ID NO:89,SEQ ID NO:92,SEQ ID NO:95,SEQ ID NO:98,SEQ ID NO:101,SEQ ID NO:104,SEQ ID NO:158,SEQ ID NO:164,SEQ ID NO:170,SEQ ID NO:176,SEQ ID NO:182,SEQ ID NO:188,SEQ ID NO:197,SEQ ID NO:203,SEQ ID NO:209,SEQ ID NO:214,SEQ ID NO:217,SEQ ID NO:222,SEQ ID NO:224,SEQ ID NO:226,SEQ ID NO:228,SEQ ID NO:230,SEQ ID NO:232,SEQ ID NO:235,SEQ ID NO:240,SEQ ID NO:242,SEQ ID NO:244,SEQ ID NO:246,SEQ ID NO:248,SEQ ID NO:250,SEQ ID NO:252,SEQ ID NO:254,SEQ ID NO:256,SEQ ID NO:258,SEQ ID NO:261,SEQ ID NO:268,SEQ ID NO:275,SEQ ID NO:282,SEQ ID NO:289,SEQ ID NO:296,SEQ ID NO:303,SEQ ID NO:310,SEQ ID NO:317,SEQ ID NO:324,SEQ ID NO:331,SEQ ID NO:338,SEQ ID NO:345,SEQ ID NO:352,SEQ ID NO:359,SEQ ID NO:366,SEQ ID NO:373,SEQ ID NO:380,SEQ ID NO:387,SEQ ID NO:394和SEQ ID NO:400。

本發明的一個實施方案是核酸分子,所述核酸分子包含與選自下組的序列至少80%,至少85%,至少90%,至少92%,至少94%,至少96%,至少98%或至少99%相同的核酸序列:SEQ ID NO:79,SEQ ID NO:82,SEQ ID NO:85,SEQ ID NO:88,SEQ ID NO:91,SEQ ID NO:94,SEQ ID NO:97,SEQ ID NO:100,SEQ ID NO:103,SEQ ID NO:157,SEQ ID NO:163,SEQ ID NO:169,SEQ ID NO:175,SEQ ID NO:181,SEQ ID NO:187,SEQ ID NO:196,SEQ ID NO:202,SEQ ID NO:208,SEQ ID NO:216,SEQ ID NO:234,SEQ ID NO:260,SEQ ID NO:267,SEQ ID NO:274,SEQ ID NO:281,SEQ ID NO:288,SEQ ID NO:295,SEQ ID NO:302,SEQ ID NO:309,SEQ ID NO:316,SEQ ID NO:323,SEQ ID NO:330,SEQ ID NO:337,SEQ ID NO:344,SEQ ID NO:351,SEQ ID NO:358,SEQ ID NO:365,SEQ ID NO:372,SEQ ID NO:379,SEQ ID NO:386和SEQ ID NO:393。本發明的一個實施方案是核酸分子,其包含選自下組的核酸:SEQ ID NO:79,SEQ ID NO:82,SEQ ID NO:85,SEQ ID NO:88,SEQ ID NO:91,SEQ ID NO:94,SEQ ID NO:97,SEQ ID NO:100,SEQ ID NO:103,SEQ ID NO:157,SEQ ID NO:163,SEQ ID NO:169,SEQ ID NO:175,SEQ ID NO:181,SEQ ID NO:187,SEQ ID NO:196,SEQ ID NO:202,SEQ ID NO:208,SEQ ID NO:216,SEQ ID NO:234,SEQ ID NO:260,SEQ ID NO:267,SEQ ID NO:274,SEQ ID NO:281,SEQ ID NO:288,SEQ ID NO:295,SEQ ID NO:302,SEQ ID NO:309,SEQ ID NO:316,SEQ ID NO:323,SEQ ID NO:330,SEQ ID NO:337,SEQ ID NO:344,SEQ ID NO:351,SEQ ID NO:358,SEQ ID NO:365,SEQ ID NO:372,SEQ ID NO:379,SEQ ID NO:386和SEQ ID NO:393。

優選的核酸分子是編碼單體亞基,HA蛋白和/或包含與流感HA蛋白連接的單體亞基蛋白的蛋白質構建體的那些。因此,本發明的一個實施方案是包含編碼蛋白質的核酸序列的核酸分子,所述蛋白質包含與流感HA蛋白連接的鐵蛋白蛋白的單體亞基。在一個實施方案中,單體亞基包含與選自SEQ ID NO:2和SEQ ID NO:5的序列至少80%,至少85%,至少90%,至少92%,至少94%,至少96%,至少98%或至少99%序列相同的胺基酸序列。在一個實施方案中,單體亞基包含選自SEQ ID NO:2和SEQ ID NO:5的胺基酸序列。

本發明的一個實施方案是包含編碼蛋白質的核酸序列的核酸分子,所述蛋白質包含與流感HA蛋白連接的2,4-二氧四氫蝶啶合酶單體亞基。在一個實施方案中,單體亞基包含與SEQ ID NO:194至少80%,至少85%,至少90%,至少92%,至少94%,至少96%,至少98%或至少99%相同的胺基酸序列。在一個實施方案中,單體亞基包含SEQ ID NO:194。

本發明的一個實施方案是編碼蛋白質構建體的核酸分子,所述蛋白質構建體包含與選自下組的序列至少80%,至少85%,至少90%,至少92%,至少94%,至少96%,至少98%或至少99%相同的胺基酸序列:SEQ ID NO:107,SEQ ID NO:110,SEQ ID NO:113,SEQ ID NO:116,SEQ ID NO:119,SEQ ID NO:122,SEQ ID NO:125,SEQ ID NO:128,SEQ ID NO:131,SEQ ID NO:161,SEQ ID NO:167,SEQ ID NO:173,SEQ ID NO:179,SEQ ID NO:185,SEQ ID NO:191,SEQ ID NO:200,SEQ ID NO:206,SEQ ID NO:212,SEQ ID NO:215,SEQ ID NO:220,SEQ ID NO:223,SEQ ID NO:225,SEQ ID NO:227,SEQ ID NO:229,SEQ ID NO:231,SEQ ID NO:233,SEQ ID NO:238,SEQ ID NO:241,SEQ ID NO:243,SEQ ID NO:245,SEQ ID NO:247,SEQ ID NO:249,SEQ ID NO:251,SEQ ID NO:253,SEQ ID NO:255,SEQQ ID NO:257,SEQ ID NO:259,SEQ ID NO:264,SEQ ID NO:271,SEQ ID NO:278,SEQ ID NO:285,SEQ ID NO:292,SEQ ID NO:299,SEQ ID NO:306,SEQ ID NO:313,SEQ ID NO:320,SEQ ID NO:327,SEQ ID NO:334,SEQ ID NO:341,SEQ ID NO:348,SEQ ID NO:355,SEQ ID NO:362,SEQ ID NO:369,SEQ ID NO:376,SEQ ID NO:383,SEQ ID NO:390和SEQ ID NO:397。本發明的一個實施方案是編碼蛋白質構建體的核酸分子,所述蛋白質構建體包含選自下組的序列:SEQ ID NO:107,SEQ ID NO:110,SEQ ID NO:113,SEQ ID NO:116,SEQ ID NO:119,SEQ ID NO:122,SEQ ID NO:125,SEQ ID NO:128,SEQ ID NO:131,SEQ ID NO:161,SEQ ID NO:167,SEQ ID NO:173,SEQ ID NO:179,SEQ ID NO:185,SEQ ID NO:191,SEQ ID NO:200,SEQ ID NO:206,SEQ ID NO:212,SEQ ID NO:215,SEQ ID NO:220,SEQ ID NO:223,SEQ ID NO:225,SEQ ID NO:227,SEQ ID NO:229,SEQ ID NO:231,SEQ ID NO:233,SEQ ID NO:238,SEQ ID NO:241,SEQ ID NO:243,SEQ ID NO:245,SEQ ID NO:247,SEQ ID NO:249,SEQ ID NO:251,SEQ ID NO:253,SEQ ID NO:255,SEQQ ID NO:257,SEQ ID NO:259,SEQ ID NO:264,SEQ ID NO:271,SEQ ID NO:278,SEQ ID NO:285,SEQ ID NO:292,SEQ ID NO:299,SEQ ID NO:306,SEQ ID NO:313,SEQ ID NO:320,SEQ ID NO:327,SEQ ID NO:334,SEQ ID NO:341,SEQ ID NO:348,SEQ ID NO:355,SEQ ID NO:362,SEQ ID NO:369,SEQ ID NO:376,SEQ ID NO:383,SEQ ID NO:390和SEQ ID NO:397。

本發明的一個實施方案是包含核酸序列的核酸分子,所述核酸序列與選自下組的序列至少80%,至少85%,至少90%,至少92%,至少94%,至少96%,至少98%或至少99%相同:SEQ ID NO:106,SEQ ID NO:109,SEQ ID NO:112,SEQ ID NO:115,SEQ ID NO:118,SEQ ID NO:121,SEQ ID NO:124,SEQ ID NO:127,SEQ ID NO:130,SEQ ID NO:160,SEQ ID NO:166,SEQ ID NO:172,SEQ ID NO:178,SEQ ID NO:184,SEQ ID NO:190,SEQ ID NO:199,SEQ ID NO:205,SEQ ID NO:211,SEQ ID NO:219,SEQ ID NO:237,SEQ ID NO:263,SEQ ID NO:270,SEQ ID NO:277,SEQ ID NO:284,SEQ ID NO:291,SEQ ID NO:298,SEQ ID NO:305,SEQ ID NO:312,SEQ ID NO:319,SEQ ID NO:326,SEQ ID NO:333,SEQ ID NO:340,SEQ ID NO:347,SEQ ID NO:354,SEQ ID NO:361,SEQ ID NO:368,SEQ ID NO:375,SEQ ID NO:382,SEQ ID NO:389和SEQ ID NO:396。本發明的一個實施方案是包含選自下組的核酸序列的核酸分子:SEQ ID NO:106,SEQ ID NO:109,SEQ ID NO:112,SEQ ID NO:115,SEQ ID NO:118,SEQ ID NO:121,SEQ ID NO:124,SEQ ID NO:127,SEQ ID NO:130,SEQ ID NO:160,SEQ ID NO:166,SEQ ID NO:172,SEQ ID NO:178,SEQ ID NO:184,SEQ ID NO:190,SEQ ID NO:199,SEQ ID NO:205,SEQ ID NO:211,SEQ ID NO:219,SEQ ID NO:237,SEQ ID NO:263,SEQ ID NO:270,SEQ ID NO:277,SEQ ID NO:284,SEQ ID NO:291,SEQ ID NO:298,SEQ ID NO:305,SEQ ID NO:312,SEQ ID NO:319,SEQ ID NO:326,SEQ ID NO:333,SEQ ID NO:340,SEQ ID NO:347,SEQ ID NO:354,SEQ ID NO:361,SEQ ID NO:368,SEQ ID NO:375,SEQ ID NO:382,SEQ ID NO:389和SEQ ID NO:396。

本發明還涵蓋用於產生本發明的蛋白質構建體的表達系統。在一個實施方案中,本發明的核酸分子可操作地連接於啟動子。如本文中使用的,操作連接是指當連接的啟動子被激活時,可以表達由連接的核酸分子編碼的蛋白質。用於實施本發明的啟動子是本領域技術人員已知的。本發明的一個實施方案是包含核酸序列的核酸分子,所述核酸序列與選自下組的序列至少85%,至少90%,至少95%或至少97%相同:SEQ ID NO:266,SEQ ID NO:273,SEQ ID NO:SEQ ID NO:280,SEQ ID NO:287,SEQ ID NO:294,SEQ ID NO:301,SEQ ID NO:308,SEQ ID NO:315,SEQ ID NO:322,SEQ ID NO:329,SEQ ID NO:336,SEQ ID NO:343,SEQ ID NO:350,SEQ ID NO:357,SEQ ID NO:364,SEQ ID NO:371,SEQ ID NO:378,SEQ ID NO:385SEQ ID NO:392和SEQ ID NO:399。本發明的一個實施方案是包含選自下組的核酸序列的核酸分子:SEQ ID NO:266,SEQ ID NO:273,SEQ ID NO:SEQ ID NO:280,SEQ ID NO:287,SEQ ID NO:294,SEQ ID NO:301,SEQ ID NO:308,SEQ ID NO:315,SEQ ID NO:322,SEQ ID NO:329,SEQ ID NO:336,SEQ ID NO:343,SEQ ID NO:350,SEQ ID NO:357,SEQ ID NO:364,SEQ ID NO:371,SEQ ID NO:378,SEQ ID NO:385SEQ ID NO:392和SEQ ID NO:399。

本發明的一個實施方案是包含本發明的核酸分子的重組細胞。本發明的一個實施方案是包含本發明的核酸分子的重組病毒。

如上指示,本發明的蛋白質構建體的重組生產可以使用本領域目前已知的任何合適的常規重組技術來完成。例如,可以如下在大腸桿菌(E.coli)中進行編碼融合蛋白的核酸分子的產生,即使用編碼合適的單體亞基蛋白(如幽門螺桿菌鐵蛋白單體亞基)的核酸分子,並且將其融合到編碼本文公開的合適的流感蛋白的核酸分子。然後,可以將構建體轉化成蛋白質表達細胞,培養至合適的大小,並誘導產生融合蛋白。

如已經描述的,因為本發明的蛋白質構建體包含單體亞基蛋白質,所以它們可以自組裝。根據本發明,由此類自組裝產生的超分子被稱為HA表達性、基於單體亞基的納米顆粒。為了便於討論,將HA表達性、基於單體亞基的納米顆粒簡稱為納米顆粒(np)。本發明的納米顆粒具有與製備它們的單體蛋白質的納米顆粒相似的結構特徵。例如,關於鐵蛋白,基於鐵蛋白的納米顆粒含有24個亞基並且具有432對稱性。在本發明的納米顆粒的情況下,亞基是包含與流感HA蛋白連接的單體亞基(例如,鐵蛋白,2,4-二氧四氫蝶啶合酶等)的蛋白質構建體。此類納米顆粒在其表面上以HA三聚體展示HA蛋白的至少一部分。在此類構建中,HA三聚體對於免疫系統是可及的,並且因此可以引發免疫應答。因此,本發明的一個實施方案是包含本發明的蛋白構建體的納米顆粒,其中所述蛋白構建體包含來自與單體亞基蛋白連接的HA蛋白的莖區的胺基酸。在一個實施方案中,納米顆粒在其表面上以HA三聚體展示HA蛋白。在一個實施方案中,流感HA蛋白能夠引發針對流感病毒的保護性抗體。

在本發明的一個實施方案中,納米顆粒包含蛋白質構建體,其包含來自流感病毒HA蛋白的莖區的第一胺基酸序列和來自流感病毒HA蛋白的莖區的第二胺基酸序列,所述第一和第二胺基酸序列通過接頭序列共價連接,

其中所述第一胺基酸序列包含來自頭部區序列的氨基端末端上遊的胺基酸序列的至少20個連續胺基酸殘基;

其中所述第二胺基酸序列包含來自所述頭部區序列的羧基端末端下遊的胺基酸序列的至少20個連續胺基酸殘基;且

其中所述第一或第二胺基酸序列與單體亞基結構域的至少一部分連接。

在本發明的一個實施方案中,納米顆粒包含蛋白質構建體,其包含來自流感病毒HA蛋白的莖區的第一胺基酸序列和來自流感病毒HA蛋白的莖區的第二胺基酸序列,所述第一和第二胺基酸序列通過接頭序列共價連接,

其中所述第一胺基酸序列包含來自頭部區序列的氨基端末端上遊的胺基酸序列的至少20個連續胺基酸殘基;

其中所述第二胺基酸序列包含與頭部區序列的羧基端末端下遊的胺基酸序列的至少100個連續胺基酸殘基至少85%,至少90%或至少95%相同的多肽序列,

其中所述多肽序列包含與由SEQ ID NO:150代表的流感A新喀裡多尼亞/20/1999(H1)中的序列,由SEQ ID NO:150代表的流感A加利福尼亞/2009中的序列,由SEQ ID NO:154代表的流感A新加坡/1957(H2)中的序列和由SEQ ID NO:156代表的流感A印度尼西亞/2005H5中的序列對應的序列;和

其中所述第一或第二胺基酸序列與單體亞基蛋白連接。

在另一個實施方案中,多肽序列中對應於SEQ ID NO:150的K1的胺基酸殘基已被除賴氨酸以外的胺基酸取代,並且對應於SEQ ID NO:150的E20的胺基酸殘基已經被除穀氨酸以外的胺基酸取代。

在一個實施方案中,在構成納米顆粒的蛋白質構建體的單體亞基部分和/或第一和/或第二胺基酸序列中進行了另外的突變。引入突變的鐵蛋白蛋白質中有用位置的實例包括對應於選自下組的胺基酸位置的胺基酸:SEQ ID NO:2的胺基酸位置18,胺基酸位置20和胺基酸位置68。在一個實施方案中,蛋白質構建體包含在對應於選自下組的胺基酸位置的胺基酸位置處的突變:流感A新喀裡多尼亞/20/1999(H1)的HA蛋白(SEQ ID NO:8)的胺基酸位置36,胺基酸位置45,胺基酸位置47,胺基酸位置49,胺基酸位置339,胺基酸位置340,胺基酸位置341,胺基酸位置342,胺基酸位置361,胺基酸位置372,胺基酸位置394,胺基酸位置402,胺基酸位置437,胺基酸位置438,胺基酸位置445,胺基酸位置446,胺基酸位置448,胺基酸449,胺基酸位置450和胺基酸位置452。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置36的位置包含異亮氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置45的位置處包含天冬醯胺或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置47的位置包含蘇氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置49的位置處包含色氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置339的位置處包含穀氨醯胺或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置340的位置處包含精氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置341的位置包含穀氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置342的位置包含蘇氨酸或與其具有相似性質的胺基酸殘基(H1)。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置372的位置處包含蘇氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置394的位置包含甲硫氨酸,異亮氨酸,亮氨酸,穀氨醯胺或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置402的位置處包含天冬醯胺,蘇氨酸,甘氨酸,天冬醯胺或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置437的位置包含天冬氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置438的位置包含亮氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置445的位置包含亮氨酸,甲硫氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置446的位置包含異亮氨酸,亮氨酸,甲硫氨酸,穀氨醯胺或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置448的位置處包含穀氨醯胺或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置449的位置包含色氨酸,苯丙氨酸或具有與其相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置450的位置處包含丙氨酸或與其具有相似性質的胺基酸殘基。在一個實施方案中,蛋白質構建體的HA部分在對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸位置452的位置處包含亮氨酸或與其具有相似性質的胺基酸殘基(H1)。在一個實施方案中,蛋白質構建體的HA部分缺少對應於流感A新喀裡多尼亞/20/1999(H1)的HA蛋白的胺基酸515-517的一個或多個胺基酸。

在一個實施方案中,本發明的納米顆粒包含單體亞基蛋白,其包含來自2,4-二氧四氫蝶啶合酶的至少50個胺基酸,至少100個胺基酸或至少150個胺基酸。在一個實施方案中,單體亞基蛋白包含來自選自SEQ ID NO:194的胺基酸序列的至少50個胺基酸,至少100個胺基酸或至少150個胺基酸,和/或包含與SEQ ID NO:194至少85%,至少90%,至少95%,至少97%,至少99%相同的胺基酸序列。在一個實施方案中,單體亞基包含SEQ ID NO:194。

在一個實施方案中,單體亞基蛋白包含來自鐵蛋白蛋白的至少50個胺基酸,至少100個胺基酸或至少150個胺基酸。在一個實施方案中,單體亞基蛋白包含來自選自SEQ ID NO:2和SEQ ID NO:5的胺基酸序列的至少50個胺基酸,至少100個胺基酸或至少150個胺基酸,和或包含與選自SEQ ID NO:2和SEQ ID NO:5的胺基酸序列至少85%,至少90%,至少95%,至少97%,至少99%相同的胺基酸序列。在一個實施方案中,單體鐵蛋白亞基包含SEQ ID NO:2或SEQ ID NO:5。

在一個實施方案中,納米顆粒包含蛋白質構建體,其包含與來自病毒的HA蛋白的至少一個免疫原性部分連接的本發明的單體蛋白質,所述病毒選自A型流感病毒,B型流感病毒和C型流感病毒。在一個實施方案中,蛋白質構建體包含與選自下組的HA蛋白的至少一個免疫原性部分連接的本發明的單體蛋白:H1流感病毒HA蛋白,H2流感病毒HA蛋白,H3流感病毒HA蛋白,H4流感病毒HA蛋白,H5流感病毒HA蛋白,H6流感病毒HA蛋白,H7流感病毒HA蛋白,H8流感病毒HA蛋白,H9流感病毒HA蛋白,H10流感病毒HA蛋白,H11流感病毒HA蛋白,H12流感病毒HA蛋白,H13流感病毒HA蛋白,H14流感病毒HA蛋白,H15流感病毒HA蛋白,H16流感病毒HA蛋白,H17流感病毒HA蛋白,和H18流感病毒HA蛋白。在一個實施方案中,免疫原性部分包含至少一個表位。

在一個實施方案中,納米顆粒包含包含蛋白質構建體,所述蛋白質構建體包含與胺基酸序列連接的本發明的單體蛋白,所述胺基酸序列與選自下組的序列是至少約80%,至少約85%,至少約90%,至少約95%,至少約97%或至少約99%相同的:SEQ ID NO:80,SEQ ID NO:83,SEQ ID NO:86,SEQ ID NO:89,SEQ ID NO:92,SEQ ID NO:95,SEQ ID NO:98,SEQ ID NO:101,SEQ ID NO:104,SEQ ID NO:158,SEQ ID NO:164,SEQ ID NO:170,SEQ ID NO:176,SEQ ID NO:182,SEQ ID NO:188,SEQ ID NO:197,SEQ ID NO:203,SEQ ID NO:209,SEQ ID NO:214,SEQ ID NO:217,SEQ ID NO:222,SEQ ID NO:224,SEQ ID NO:226,SEQ ID NO:228,SEQ ID NO:230,SEQ ID NO:232,SEQ ID NO:235,SEQ ID NO:240,SEQ ID NO:242,SEQ ID NO:244,SEQ ID NO:246,SEQ ID NO:248,SEQ ID NO:250,SEQ ID NO:252,SEQ ID NO:254,SEQ ID NO:256,SEQ ID NO:258,SEQ ID NO:261,SEQ ID NO:268,SEQ ID NO:275,SEQ ID NO:282,SEQ ID NO:289,SEQ ID NO:296,SEQ ID NO:303,SEQ ID NO:310,SEQ ID NO:317,SEQ ID NO:324,SEQ ID NO:331,SEQ ID NO:338,SEQ ID NO:345,SEQ ID NO:352,SEQ ID NO:359,SEQ ID NO:366,SEQ ID NO:373,SEQ ID NO:380,SEQ ID NO:387,SEQ ID NO:394和SEQ ID NO:400,其中蛋白質構建體能夠選擇性結合抗流感抗體。在一個實施方案中,納米顆粒包含蛋白質構建體,所述蛋白質構建體包含與胺基酸序列連接的本發明的單體蛋白,所述胺基酸序列選自下組:SEQ ID NO:80,SEQ ID NO:83,SEQ ID NO:86,SEQ ID NO:89,SEQ ID NO:92,SEQ ID NO:95,SEQ ID NO:98,SEQ ID NO:101,SEQ ID NO:104,SEQ ID NO:158,SEQ ID NO:164,SEQ ID NO:170,SEQ ID NO:176,SEQ ID NO:182,SEQ ID NO:188,SEQ ID NO:197,SEQ ID NO:203,SEQ ID NO:209,SEQ ID NO:214,SEQ ID NO:217,SEQ ID NO:222,SEQ ID NO:224,SEQ ID NO:226,SEQ ID NO:228,SEQ ID NO:230,SEQ ID NO:232,SEQ ID NO:235,SEQ ID NO:240,SEQ ID NO:242,SEQ ID NO:244,SEQ ID NO:246,SEQ ID NO:248,SEQ ID NO:250,SEQ ID NO:252,SEQ ID NO:254,SEQ ID NO:256,SEQ ID NO:258,SEQ ID NO:261,SEQ ID NO:268,SEQ ID NO:275,SEQ ID NO:282,SEQ ID NO:289,SEQ ID NO:296,SEQ ID NO:303,SEQ ID NO:310,SEQ ID NO:317,SEQ ID NO:324,SEQ ID NO:331,SEQ ID NO:338,SEQ ID NO:345,SEQ ID NO:352,SEQ ID NO:359,SEQ ID NO:366,SEQ ID NO:373,SEQ ID NO:380,SEQ ID NO:387,SEQ ID NO:394和SEQ ID NO:400,其中蛋白質構建體能夠選擇性結合抗流感抗體。

在本發明的一個實施方案中,納米顆粒包含蛋白質構建體,所述蛋白質構建體包含與選自下組的序列至少80%,至少約85%,至少約90%,至少約95%,至少約97%或至少約99%相同的胺基酸序列:SEQ ID NO:107,SEQ ID NO:110,SEQ ID NO:113,SEQ ID NO:116,SEQ ID NO:119,SEQ ID NO:122,SEQ ID NO:125,SEQ ID NO:128,SEQ ID NO:131,SEQ ID NO:161,SEQ ID NO:167,SEQ ID NO:173,SEQ ID NO:179,SEQ ID NO:185,SEQ ID NO:191,SEQ ID NO:200,SEQ ID NO:206,SEQ ID NO:212,SEQ ID NO:215,SEQ ID NO:220,SEQ ID NO:223,SEQ ID NO:225,SEQ ID NO:227,SEQ ID NO:229,SEQ ID NO:231,SEQ ID NO:233,SEQ ID NO:238,SEQ ID NO:241,SEQ ID NO:243,SEQ ID NO:245,SEQ ID NO:247,SEQ ID NO:249,SEQ ID NO:251,SEQ ID NO:253,SEQ ID NO:255,SEQQ ID NO:257,SEQ ID NO:259,SEQ ID NO:264,SEQ ID NO:271,SEQ ID NO:278,SEQ ID NO:285,SEQ ID NO:292,SEQ ID NO:299,SEQ ID NO:306,SEQ ID NO:313,SEQ ID NO:320,SEQ ID NO:327,SEQ ID NO:334,SEQ ID NO:341,SEQ ID NO:348,SEQ ID NO:355,SEQ ID NO:362,SEQ ID NO:369,SEQ ID NO:376,SEQ ID NO:383,SEQ ID NO:390和SEQ ID NO:397,其中蛋白質構建體能夠選擇性結合抗流感抗體。在本發明的一個實施方案中,納米顆粒包含蛋白質構建體,所述蛋白質構建體包含選自下組的胺基酸序列:SEQ ID NO:107,SEQ ID NO:110,SEQ ID NO:113,SEQ ID NO:116,SEQ ID NO:119,SEQ ID NO:122,SEQ ID NO:125,SEQ ID NO:128,SEQ ID NO:131,SEQ ID NO:161,SEQ ID NO:167,SEQ ID NO:173,SEQ ID NO:179,SEQ ID NO:185,SEQ ID NO:191,SEQ ID NO:200,SEQ ID NO:206,SEQ ID NO:212,SEQ ID NO:215,SEQ ID NO:220,SEQ ID NO:223,SEQ ID NO:225,SEQ ID NO:227,SEQ ID NO:229,SEQ ID NO:231,SEQ ID NO:233,SEQ ID NO:238,SEQ ID NO:241,SEQ ID NO:243,SEQ ID NO:245,SEQ ID NO:247,SEQ ID NO:249,SEQ ID NO:251,SEQ ID NO:253,SEQ ID NO:255,SEQQ ID NO:257,SEQ ID NO:259,SEQ ID NO:264,SEQ ID NO:271,SEQ ID NO:278,SEQ ID NO:285,SEQ ID NO:292,SEQ ID NO:299,SEQ ID NO:306,SEQ ID NO:313,SEQ ID NO:320,SEQ ID NO:327,SEQ ID NO:334,SEQ ID NO:341,SEQ ID NO:348,SEQ ID NO:355,SEQ ID NO:362,SEQ ID NO:369,SEQ ID NO:376,SEQ ID NO:383,SEQ ID NO:390和SEQ ID NO:397。

在一個實施方案中,本發明的納米顆粒包含由核酸分子編碼的蛋白質構建體,所述核酸分子包含與選自下組的序列至少85%,至少90%,至少95%or至少97%相同的核酸序列:SEQ ID NO:266,SEQ ID NO:273,SEQ ID NO:SEQ ID NO:280,SEQ ID NO:287,SEQ ID NO:294,SEQ ID NO:301,SEQ ID NO:308,SEQ ID NO:315,SEQ ID NO:322,SEQ ID NO:329,SEQ ID NO:336,SEQ ID NO:343,SEQ ID NO:350,SEQ ID NO:357,SEQ ID NO:364,SEQ ID NO:371,SEQ ID NO:378,SEQ ID NO:385SEQ ID NO:392和SEQ ID NO:399。在一個實施方案中,本發明的納米顆粒包含由核酸分子編碼的蛋白質構建體,所述核酸分子包含選自下組的核酸序列:SEQ ID NO:266,SEQ ID NO:273,SEQ ID NO:SEQ ID NO:280,SEQ ID NO:287,SEQ ID NO:294,SEQ ID NO:301,SEQ ID NO:308,SEQ ID NO:315,SEQ ID NO:322,SEQ ID NO:329,SEQ ID NO:336,SEQ ID NO:343,SEQ ID NO:350,SEQ ID NO:357,SEQ ID NO:364,SEQ ID NO:371,SEQ ID NO:378,SEQ ID NO:385SEQ ID NO:392和SEQ ID NO:399。

本發明的納米顆粒可用於引發對流感病毒的免疫應答。一類免疫應答是B細胞應答,其導致產生針對引發免疫應答的抗原的抗體。因此,在一個實施方案中,納米顆粒引發結合來自選自A型流感病毒,B型流感病毒和C型流感病毒的病毒的流感HA蛋白的莖區的抗體。本發明的一個實施方案是納米顆粒,其引發結合流感HA蛋白的莖區的抗體,所述流感HA蛋白選自H1流感病毒HA蛋白,H2流感病毒HA蛋白,H3流感病毒HA蛋白,H4流感病毒HA蛋白,H5流感病毒HA蛋白,H6流感病毒HA蛋白,H7流感病毒HA蛋白,H8流感病毒HA蛋白,H9流感病毒HA蛋白,H10流感病毒HA蛋白HA蛋白,H11流感病毒HA蛋白,H12流感病毒HA蛋白,H13流感病毒HA蛋白,H14流感病毒HA蛋白,H15流感病毒HA蛋白,H16流感病毒HA蛋白,H17流感病毒HA蛋白,和H18流感病毒HA蛋白。本發明的一個實施方案是納米顆粒,其引發結合來自病毒株的流感HA蛋白的莖區的抗體,所述病毒株選自流感A/新喀裡多尼亞/20/1999(1999NC,H1),A/加利福尼亞/04/2009(2009CA,H1),A/新加坡/1/1957(1957Sing,H2),A/香港/1/1968(1968HK,H3),A/布裡斯班/10/2007(2007Bris,H3),A/印度尼西亞/05/2005(2005Indo,H5),B/佛羅裡達/4/2006(2006Flo,B),A/珀斯/16/2009(2009Per,H3),A/布裡斯班/59/2007(2007Bris,H1),B/布裡斯班/60/2008(2008Bris,B)及其變體。

儘管所有抗體能夠結合引發導致抗體產生的免疫應答的抗原,但優選的抗體是那些提供針對流感病毒的廣泛的異亞型保護的抗體。因此,本發明的一個實施方案是引發保護性抗體的納米顆粒,所述保護性抗體結合來自選自A型流感病毒,B型流感病毒和C型流感病毒的病毒的流感HA蛋白的莖區。本發明的一個實施方案是引發與流感HA蛋白的莖區結合的保護性抗體的蛋白質,所述流感HA蛋白選自H1流感病毒HA蛋白,H2流感病毒HA蛋白,H3流感病毒HA蛋白,H4流感病毒HA蛋白,H5流感病毒HA蛋白,H6流感病毒HA蛋白,H7流感病毒HA蛋白,H8流感病毒HA蛋白,H9流感病毒HA蛋白,H10流感病毒HA蛋白HA蛋白,H11流感病毒HA蛋白,H12流感病毒HA蛋白,H13流感病毒HA蛋白,H14流感病毒HA蛋白,H15流感病毒HA蛋白,H16流感病毒HA蛋白,H17流感病毒HA蛋白,和H18流感病毒HA蛋白。本發明的一個實施方案是引發針對選自下組的病毒的抗體的納米顆粒:流感A/新喀裡多尼亞/20/1999(1999NC,H1),A/加利福尼亞/04/2009(2009CA,H1),A/新加坡/1/1957(1957Sing,H2),A/香港/1/1968(1968HK,H3),A/布裡斯班/10/2007(2007Bris,H3),A/印度尼西亞/05/2005(2005Indo,H5),B/佛羅裡達/4/2006(2006Flo,B),A/珀斯/16/2009(2009Per,H3),A/布裡斯班/59/2007(2007Bris,H1)and B/布裡斯班/60/2008(2008Bris,B)。本發明的一個實施方案是引發結合蛋白質的抗體的納米顆粒,所述蛋白質包含與選自下組的序列至少80%相同的胺基酸序列:SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17。本發明的一個實施方案是引發結合蛋白質的抗體的納米顆粒,所述蛋白質包含選自SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17的胺基酸序列。

由本發明的蛋白質引發的保護性抗體可通過影響病毒生命周期中的任何步驟來提供保護免受病毒感染。例如,保護性抗體可以防止流感病毒附著於細胞,進入細胞,將病毒核糖核蛋白釋放到細胞質中,在受感染的細胞中形成新的病毒顆粒並從受感染的宿主細胞膜出芽新的病毒顆粒。在一個實施方案中,由本發明的蛋白質引發的保護性抗體防止流感病毒進入宿主細胞。在一個實施方案中,由本發明的蛋白質引發的保護性抗體防止病毒膜與內體膜的融合。在一個實施方案中,由本發明的蛋白質引發的保護性抗體防止核糖核蛋白釋放到宿主細胞的細胞質中。在一個實施方案中,由本發明的蛋白質引發的保護性抗體防止新病毒在感染的宿主細胞中的裝配。在一個實施方案中,由本發明的蛋白質引發的保護性抗體防止新形成的病毒從感染的宿主細胞釋放。

因為流感病毒的莖區的胺基酸序列是高度保守的,所以由本發明的納米顆粒引發的保護性抗體可以是廣泛保護性的。也就是說,本發明的納米顆粒引發的保護性抗體可以針對多於一種類型,亞型和/或毒株的流感病毒提供保護。因此,本發明的一個實施方案是引發結合流感HA蛋白莖區的廣泛保護性抗體的蛋白質。一個實施方案是引發結合來自多於一種類型的流感病毒的HA蛋白的莖區的抗體的納米顆粒,所述流感病毒選自A型流感病毒,B型流感病毒和C型流感病毒。一個實施方案是引發結合來自多於一種亞型流感病毒的HA蛋白的莖區的抗體的納米顆粒,所述流感病毒選自H1流感病毒,H2流感病毒,H3流感病毒,H4流感病毒,H5流感病毒,H6流感病毒,H7流感病毒,H8流感病毒,H9流感病毒,H10流感病毒,H11流感病毒,H12流感病毒,H13流感病毒,H14流感病毒,H15流感病毒,H16流感病毒,H17流感病毒和H18流感病毒。一個實施方案是引發結合來自超過流感病毒株的HA蛋白的莖區的抗體的納米顆粒。本發明的一個實施方案是引發結合超過一種蛋白質的抗體的納米顆粒,所述蛋白質包含與選自下組的序列至少80%相同的胺基酸序列:SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17。本發明的一個實施方案是引發結合多於一種蛋白質的抗體的納米顆粒,所述蛋白質包含選自SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17的胺基酸序列。

因為本發明的納米顆粒可以引發對流感病毒的免疫應答,所以它們可用作保護個體免受流感病毒感染的疫苗。因此,本發明的一個實施方案是包含本發明的納米顆粒的疫苗。本發明的疫苗還可以含有其它成分,如佐劑,緩衝液等。儘管可以使用任何佐劑,但優選的實施方案可以含有:化學佐劑,如磷酸鋁,benzyalkonium chloride,烏苯美司(ubenimex)和QS21;遺傳佐劑如IL-2基因或其片段,粒細胞巨噬細胞集落刺激因子(GM-CSF)基因或其片段,IL-18基因或其片段,趨化因子(CC基序)配體21(CCL21)基因或其片段,IL-6基因或其片段,CpG,LPS,TLR激動劑和其它免疫刺激基因;蛋白質佐劑如IL-2或其片段,粒細胞巨噬細胞集落刺激因子(GM-CSF)或其片段,IL-18或其片段,趨化因子(CC基序)配體21(CCL21)或其片段,IL-6或其片段,CpG,LPS,TLR激動劑和其它免疫刺激性細胞因子或其片段;脂質佐劑如陽離子脂質體,N3(陽離子脂質),單磷醯脂質A(MPL1);其它佐劑,包括霍亂毒素,腸毒素,Fms樣酪氨酸激酶-3配體(Flt-3L),布比卡因(bupivacaine),丁哌卡因(marcaine)和左旋咪唑。

本發明的一個實施方案是包含多於一種流感HA蛋白的納米顆粒疫苗。此類疫苗可以包括在單個納米顆粒上或作為納米顆粒混合物的不同流感HA蛋白的組合,其中至少兩種具有獨特的流感HA蛋白。多價疫苗可包含與必要一樣多的流感HA蛋白,以便導致提供保護免於期望的病毒毒株寬度必需的免疫應答的產生。在一個實施方案中,疫苗包含來自至少兩種不同流感株(二價)的HA蛋白。在一個實施方案中,疫苗包含來自至少三種不同流感株(三價)的HA蛋白。在一個實施方案中,疫苗包含來自至少四種不同流感株(四價)的HA蛋白。在一個實施方案中,疫苗包含來自至少五種不同流感株(五價)的HA蛋白。在一個實施方案中,疫苗包含來自至少六種不同流感病毒株(六價)的HA蛋白。在各種實施方案中,疫苗包含來自7、8、9或10種不同流感病毒株之每種的HA蛋白。此類組合的實例是包含流感A組1HA蛋白,流感A組2HA蛋白,和流感B HA蛋白的納米顆粒疫苗。在一個實施方案中,流感HA蛋白是H1HA,H3HA和B HA。在一個實施方案中,流感HA蛋白是包括在2011-2012流感疫苗中的那些。多價疫苗的另一個實例是包含來自四種不同流感病毒的HA蛋白的納米顆粒疫苗。在一個實施方案中,多價疫苗包含來自流感A/新喀裡多尼亞/20/1999(1999NC,H1),A/加利福尼亞/04/2009(2009CA,H1),A/新加坡/1/1957(1957Sing,H2),A/香港/1/1968(1968HK,H3),A/布裡斯班/10/2007(2007Bris,H3),A/印度尼西亞/05/2005(2005Indo,H5),B/佛羅裡達/4/2006(2006Flo,B),A/珀斯/16/2009(2009Per,H3),A/布裡斯班/59/2007(2007Bris,H1)和B/布裡斯班/60/2008(2008Bris,B)的HA蛋白。

本發明的一個實施方案是針對流感病毒對個體接種疫苗的方法,所述方法包括向個體施用納米顆粒,使得在個體中產生針對流感病毒的免疫應答,其中所述納米顆粒包含連接到流感HA蛋白的單體亞基蛋白,並且其中所述納米顆粒在其表面上展示所述流感HA。在一個實施方案中,納米顆粒是單價納米顆粒。在一個實施方案中,納米顆粒是多價納米顆粒。本發明的另一個實施方案是針對流感病毒感染對個體接種疫苗的方法,所述方法包括:

a)獲得包含單體亞基的納米顆粒,其中所述單體亞基與流感血凝素蛋白連接,並且其中所述納米顆粒在其表面上展示流感HA;並且,

b)將納米顆粒施用於個體,使得產生針對流感病毒的免疫應答。

本發明的一個實施方案是針對流感病毒對個體接種疫苗的方法,所述方法包括向個體施用實施方案的疫苗,使得在個體中產生針對流感病毒的免疫應答,其中所述疫苗包含至少一種納米顆粒,其包含與流感HA蛋白連接的單體亞基,並且其中所述納米顆粒在其表面上展示流感HA。在一個實施方案中,疫苗是單價疫苗。在一個實施方案中,疫苗是多價疫苗。本發明的另一個實施方案是針對流感病毒感染對個體接種疫苗的方法,所述方法包括:

a)獲得包含至少一種包含本發明的蛋白質構建體的納米顆粒的疫苗,其中所述蛋白質構建體包含與流感HA蛋白連接的單體亞基蛋白,並且其中所述納米顆粒在其表面上展示流感HA;並且,

b)將所述疫苗施用於個體,使得產生針對流感病毒的免疫應答。

在一個實施方案中,納米顆粒是單價納米顆粒。在一個實施方案中,納米顆粒是多價納米顆粒。

在一個實施方案中,納米顆粒具有八面體對稱。在一個實施方案中,流感HA蛋白能夠引發針對流感病毒的抗體。在一個實施方案中,流感HA蛋白能夠廣泛引發針對流感病毒的抗體。在優選的實施方案中,引發的抗體是保護性抗體。在優選的實施方案中,引發的抗體是廣泛異亞型保護性的。

本發明的疫苗可用於使用初免/加強方案對個體接種疫苗。此類方案在美國專利公開號20110177122中描述,其通過引用整體併入本文。在此類方案中,可以向個體施用第一疫苗組合物(初次),然後在一段時間後,可以向個體施用第二疫苗組合物(加強)。施用加強組合物通常是在施用引發組合物後數周或數月,優選約2-3周或4周,或8周,或16周,或20周,或24周,或28周,或32周。在一個實施方案中,配製加強組合物,用於在施用引發組合物後約1周,或2周,或3周,或4周,或5周,或6周,或7周,或8周,或9周,或16周,或20周,或24周,或28周,或32周施用。

第一和第二疫苗組合物可以是,但不需要是相同的組合物。因此,在本發明的一個實施方案中,施用疫苗的步驟包括施用第一疫苗組合物,然後在稍後時間施用第二疫苗組合物。在一個實施方案中,第一疫苗組合物包含本發明的納米顆粒。在一個實施方案中,第一疫苗組合物包含納米顆粒,其包含來自流感病毒的HA蛋白的胺基酸序列,所述流感病毒選自A/新喀裡多尼亞/20/1999(1999NC,H1),A/加利福尼亞/04/2009(2009CA,H1),A/新加坡/1/1957(1957Sing,H2),A/香港/1/1968(1968HK,H3),A/布裡斯班/10/2007(2007Bris,H3),A/印度尼西亞/05/2005(2005Indo,H5),B/佛羅裡達/4/2006(2006Flo,B),A/珀斯/16/2009(2009Per,H3),A/布裡斯班/59/2007(2007Bris,H1),B/布裡斯班/60/2008(2008Bris,B)。

在一個實施方案中,接種疫苗的個體已暴露於流感病毒。如本文所使用的,術語暴露的,暴露等指示受試者已經與已知感染流感病毒的動物對象接觸。可以使用本領域技術人員熟知的技術來施用本發明的疫苗。用於配製和施用的技術可以在例如「Remington’s Pharmaceutical Sciences」,18th ed.,1990,Mack Publishing Co.,Easton,PA中找到。疫苗可通過包括但不限於傳統注射器,無針注射裝置或微粒轟擊基因槍的手段施用。合適的施用途徑包括但不限於腸胃外遞送,如肌內,皮內,皮下,髓內注射以及鞘內,直接心室內,靜脈內,腹膜內,鼻內或眼內注射,僅舉幾個例子。對於注射,本發明的一個實施方案的化合物可以配製在水溶液中,優選在生理上相容的緩衝液如Hanks溶液,林格氏溶液或生理鹽水緩衝液中配製。

在一個實施方案中,本發明的疫苗或納米顆粒可用於保護個體免受異源流感病毒的感染。也就是說,使用來自流感病毒的一種毒株的HA蛋白製備的疫苗能夠保護個體免受不同流感病毒株的感染。例如,使用來自流感A/新喀裡多尼亞/20/1999(1999NC,H1)的HA蛋白製備的疫苗可以用於保護個體免受流感病毒感染,所述流感病毒包括但不限於A/新喀裡多尼亞/20/1999(1999NC,H1),A/加利福尼亞/04/2009(2009CA,H1),A/新加坡/1/1957(1957Sing,H2),A/香港/1/1968(1968HK,H3),A/布裡斯班/10/2007(2007Bris,H3),A/印度尼西亞/05/2005(2005indo,H5),A/珀斯/16/2009(2009Per,H3),和/或A/布裡斯班/59/2007(2007Bris,H1)。

在一個實施方案中,本發明的疫苗或納米顆粒可用於保護個體免於抗原性趨異的流感病毒的感染。抗原性趨異的是指流感病毒株隨時間突變的趨勢,從而改變展示給免疫系統的胺基酸。此類隨時間的突變也稱為抗原漂移。因此,例如,使用來自A/新喀裡多尼亞/20/1999(1999NC,H1)流感病毒株的HA蛋白製備的疫苗能夠保護個體免受早期的,抗原性趨異的新喀裡多尼亞流感毒株的感染,和未來的進化(或趨異)流感毒株。

因為本發明的納米顆粒展示與完整HA在抗原性上相似的HA蛋白,所以它們可用於檢測針對流感病毒的抗體(抗流感抗體)的測定法中。

因此,本發明的一個實施方案是使用本發明的納米顆粒檢測抗流感病毒抗體的方法。本發明的檢測方法通常可以通過以下步驟實現:

a.使測試抗流感抗體的存在的樣品的至少部分與本發明的納米顆粒接觸;並且,

b.檢測納米顆粒/抗體複合物的存在;

其中納米顆粒/抗體複合物的存在指示樣品含有抗流感抗體。

在本發明的一個實施方案中,從待測試抗流感病毒抗體的存在的個體獲得或收集樣品。個體可以是或不是懷疑具有抗流感抗體或已經暴露於流感病毒的。樣品是從個體獲得的任何樣本,其可用於測試抗流感病毒抗體的存在。優選的樣品是可用於檢測抗流感病毒抗體的存在的體液。可用於實施本發明方法的體液的實例包括但不限於血液,血漿,血清,淚液和唾液。本領域技術人員可以容易地鑑定適於實施所公開的方法的樣品。

血液或血液衍生的液體如血漿,血清等特別適合作為樣品。可以使用本領域已知的方法從個體收集和製備此類樣品。樣品可以在測定前冷藏或冷凍。

本發明的任何納米顆粒可用於實施所公開的方法,只要納米顆粒結合抗流感病毒抗體。有用的納米顆粒及其製備方法已在本文中詳細描述。在優選的實施方案中,納米顆粒包含蛋白質構建體,其中蛋白質構建體包含連接到(融合到)來自流感HA蛋白的至少一個表位的來自單體亞基蛋白的至少25個,至少50個,至少75個,至少100個或至少150個連續胺基酸,使得納米顆粒在其表面上包含流感病毒HA蛋白表位的三聚體,並且其中蛋白質構建體能夠自組裝成納米顆粒。

如本文中使用的,術語接觸是指將測試抗流感抗體存在的樣品引入本發明的納米顆粒,例如通過組合或混合樣品和本發明的納米顆粒,使得納米顆粒能夠與樣品中的抗體(如果存在的話)物理接觸。當抗流感病毒抗體存在於樣品中時,然後形成抗體/納米顆粒複合物。此類複合物形成是指抗流感病毒抗體選擇性結合納米顆粒中蛋白質構建體的HA部分以形成可檢測的穩定複合物的能力。樣品中抗流感病毒抗體與納米顆粒的結合在適合形成複合物的條件下完成。此類條件(例如,適當的濃度,緩衝液,溫度,反應時間)以及優化此類條件的方法是本領域技術人員已知的。結合可以使用本領域標準的多種方法測量,包括但不限於凝集測定法,沉澱測定法,酶免疫測定法(例如ELISA),免疫沉澱測定法,免疫印跡測定法和其它免疫測定法,如例如記載於Sambrook et al.,Molecular Cloning:A Laboratory Manual,(Cold Spring Harbor Labs Press,1989)和Harlow et al.,Antibodies,a Laboratory Manual(Cold Spring Harbor Labs Press,1988),兩者均通過引用整體併入本文。這些參考文獻還提供了複合物形成條件的實例。

如本文所使用的,短語選擇性地結合HA,選擇性結合HA等,是指與結合與HA無關的蛋白質,或樣品或測定法中的非蛋白質組分形成對比,抗體優先結合HA蛋白的能力。選擇性結合HA的抗體是結合HA但不顯著結合可能存在於樣品或測定法中的其它分子或組分的抗體。顯著的結合被認為是例如抗HA抗體與非HA分子的結合,以大到足以幹擾測定法檢測和/或測定樣品中的抗流感抗體的水平的能力的親和力或親合力。可存在於樣品或測定法中的其它分子和化合物的實例包括但不限於非HA蛋白,例如白蛋白,脂質和碳水化合物。

在一個實施方案中,可以在溶液中形成抗流感病毒抗體/納米顆粒複合物(本文中也稱為抗體/納米顆粒複合物)。在一個實施方案中,可以形成抗體/納米顆粒複合物,其中納米顆粒固定在(例如,塗覆到)基底上。固定化技術是本領域技術人員已知的。合適的基底材料包括但不限於塑料,玻璃,凝膠,賽璐珞(celluloid),織物,紙和顆粒材料。基底材料的實例包括但不限於膠乳,聚苯乙烯,尼龍,硝化纖維素,瓊脂糖,棉,PVDF(聚偏氟乙烯)和磁性樹脂。用於基底材料的合適形狀包括但不限於孔(例如,微量滴定盤孔),微量滴定板,浸漬片,條,珠,側流裝置,膜,過濾器,管,盤,賽璐珞型基質,磁性顆粒和其它顆粒。特別優選的底物包括例如ELISA板,浸漬片,免疫斑點條,放射免疫測定板,瓊脂糖珠,塑料珠,乳膠珠,棉線,塑料晶片,免疫印跡膜,免疫印跡紙和流通膜。在一個實施方案中,基底,如顆粒,可以包括可檢測標記物。對於基底材料的實例的描述,參見例如Kemeny,D.M.(1991)APractical Guide to ELISA,Pergamon Press,Elmsford,NY pp 33-44,以及Price,C.and Newman,D.eds.Principles and Practice of Immunoassay,2nd edition(1997)Stockton Press,NY,NY,兩者通過引用整體併入本文。

根據本發明,一旦形成,就檢測抗流感病毒抗體/納米顆粒複合物。檢測可以是定性,定量或半定量的。如本文所使用的,短語檢測複合物形成,檢測複合物等是指鑑定與納米顆粒複合的抗流感病毒抗體的存在。如果形成複合物,則可以但不需要量化形成的複合物的量。假定的抗流感病毒抗體和納米顆粒之間的複合物形成或選擇性結合可以使用本領域標準的多種方法測量(即檢測,測定)(參見例如Sambrook等人,同上),其實例在本文中公開。可以以多種方式檢測複合物,包括但不限於使用一種或多種下列測定法:血凝抑制測定法,徑向擴散測定法,酶聯免疫測定法,競爭性酶聯免疫測定法,放射免疫測定法,螢光免疫測定法,化學發光測定法,側向流測定法,流通測定法,基於顆粒的測定法(例如,使用顆粒,如但不限於磁性顆粒或塑料聚合物,如膠乳或聚苯乙烯珠),免疫沉澱測定法,BioCoreJ測定法(例如,使用膠體金),免疫斑點測定法(例如,CMG=s免疫印跡系統(s Immunodot System),Friborg,Switzerland)和免疫印跡測定法(例如,western印跡),磷光測定法,流通測定法,層析測定法,基於PAGe的測定,表面等離振子共振測定法,分光光度測定法和電子感覺測定。此類測定法是本領域技術人員公知的。

測定法可用於根據其使用方式給出定性或定量結果。可以通過目視(例如,通過眼或通過機器,如密度計或分光光度計)觀察到一些測定法,如凝集,顆粒分離和沉澱測定法,而不需要可檢測的標記物。

在其它測定法中,可檢測標記物與納米顆粒或與選擇性結合納米顆粒的試劑的綴合(即,附著)有助於檢測複合物形成。可檢測標記物可以在不幹擾納米顆粒結合抗流感病毒抗體的能力的位點與納米顆粒或納米顆粒結合試劑綴合。綴合方法是本領域技術人員已知的。可檢測標記物的實例包括但不限於放射性標記物,螢光標記物,化學發光標記物,發色標記物,酶標記物,磷光標記物,電子標記物;金屬溶膠標記物,有色珠,物理標記物或配體。配體是指與另一分子選擇性結合的分子。優選的可檢測標記物包括但不限於螢光素,放射性同位素,磷酸酶(例如鹼性磷酸酶),生物素,抗生物素蛋白,過氧化物酶(例如辣根過氧化物酶),β-半乳糖苷酶和生物素相關化合物或抗生物素蛋白相關化合物(例如鏈黴抗生物素蛋白或ImmunoPure7NeutrAvidin)。

在一個實施方案中,可以通過使樣品與結合抗流感抗體,鐵蛋白或與抗體/納米顆粒複合物的特異性化合物(如抗體)接觸來檢測抗體/納米顆粒複合物,所述特異性化合物與可檢測標記物綴合。可檢測標記物可以以不阻斷化合物結合所檢測的複合物的能力的方式與特定化合物綴合。優選的可檢測標記物包括但不限於螢光素,放射性同位素,磷酸酶(例如鹼性磷酸酶),生物素,抗生物素蛋白,過氧化物酶(例如辣根過氧化物酶),β-半乳糖苷酶和生物素相關化合物或抗生物素蛋白相關化合物(例如鏈黴抗生物素蛋白或ImmunoPure7NeutrAvidin)。

在另一個實施方案中,通過使複合物與指示劑分子接觸來檢測複合物。合適的指示劑分子包括可以結合抗流感病毒抗體/納米顆粒複合物,抗流感病毒抗體或納米顆粒的分子。因此,指示劑分子可以包括例如結合抗流感病毒抗體的試劑,如識別免疫球蛋白的抗體。作為抗體的優選指示劑分子包括例如與來自其中產生抗流感病毒抗體的個體物種的抗體反應的抗體。指示劑分子本身可以附著到本發明的可檢測標誌物。例如,抗體可以與生物素,辣根過氧化物酶,鹼性磷酸酶或螢光素綴合。

本發明還可以包含能夠檢測指示劑分子存在的二級分子或其它結合分子的一個或多個層和/或類型。例如,選擇性結合指示劑分子的無標籤的(即,不與可檢測標誌物綴合的)二抗可以與選擇性結合二抗的有標籤的(即,與可檢測標誌物綴合的)三抗結合。合適的二抗,三抗和其它二級或三級分子可以容易地由本領域技術人員選擇。優選的三級分子也可以由本領域技術人員基於第二分子的特性來選擇。相同的策略可以應用於後續層。

優選地,指示劑分子與可檢測標誌物綴合。如果需要的話,加入顯影劑,並將底物送到檢測裝置進行分析。在一些方案中,在一個或兩個複合物形成步驟之後加入清洗步驟以除去過量的試劑。如果使用這些步驟,則它們牽涉本領域技術人員已知的條件,使得除去過量的試劑,但保留複合物。

因為本發明的測定法可以檢測樣品(包括血液樣品)中的抗流感病毒抗體,所以此類測定法可用於鑑定具有抗流感抗體的個體。因此,本發明的一個實施方案是鑑定具有抗流感病毒抗體的個體的方法,所述方法包括:

a.使來自測試抗流感抗體的個體的樣品與本發明的納米顆粒接觸;和,

b.分析接觸的樣品的納米顆粒/抗體複合物的存在,

其中納米顆粒/抗體複合物的存在指示所述個體具有抗流感抗體。

任何公開的測定形式可用於進行所公開的方法。有用的測定形式的實例包括但不限於徑向擴散測定法,酶聯免疫測定法,競爭性酶聯免疫測定法,放射免疫測定法,螢光免疫測定法,化學發光測定法,側向流測定法,通過測定法,基於顆粒的測定法(例如,使用顆粒,如但不限於磁性顆粒或塑料聚合物,如膠乳或聚苯乙烯珠),免疫沉澱測定法,BioCoreJ測定(例如,使用膠體金),免疫印跡測定法(例如,CMG=s免疫印跡系統,Fribourg,Switzerland)和免疫印跡測定法(例如,western印跡),磷光測定法,流通測定法,層析測定法,基於PAGe的測定法,表面等離振子共振測定法,生物層幹涉測定法,分光光度測定法和電子感覺測定法。

如果在樣品中沒有檢測到抗流感抗體,則此類結果指示個體不具有抗流感病毒抗體。測試的個體可以是或不是懷疑具有針對流感病毒的抗體的。所公開的方法還可以用於確定個體是否已經暴露於流感病毒的一種或多種特定類型,組,亞組或毒株。為了進行此類測定,從個體獲得樣品,所述個體在其過去(例如,大於約1年,大於約2年,大於約3年,大於約4年,大於約5年等)的某個時候在針對流感病毒的一種或多種特定類型,組,亞組或毒株的抗體測試呈陰性(即,缺少抗體)。然後使用本發明的基於納米顆粒的測定法測試樣品的針對流感病毒的一種或多種類型,組,亞組或毒株的抗流感病毒抗體的存在。如果測定法指示存在此類抗體,則在鑑定它們為抗流感抗體陰性的測試後的某個時候將個體鑑定為已經暴露於流感病毒的一種或多種類型,組亞組或毒株。因此,本發明的一個實施方案是鑑定已暴露於流感病毒的個體的方法,所述方法包括:

a.使來自正在測試抗流感抗體的個體的樣品的至少部分與本發明的納米顆粒接觸;和,

b.分析接觸的樣品的抗體/納米顆粒複合物的存在或水平,其中抗體/納米顆粒複合物的存在或水平指示最近的抗流感抗體的存在或水平;

c.將最近的抗流感抗體水平與過去的抗流感抗體水平進行比較;

其中最近的抗流感抗體水平相對於過去的抗流感抗體水平的增加指示個體在確定過去的抗流感抗體水平之後已經暴露於流感病毒。

本發明的方法還可用於確定個體對疫苗的響應。因此,一個實施方案是用於測量個體對流感疫苗的響應的方法,所述方法包括:

a.向個體施用流感病毒疫苗;

b.使來自所述個體的樣品的至少部分與本發明的納米顆粒接觸;

c.分析接觸的樣品的抗體/納米顆粒複合物的存在或水平,其中抗體/納米顆粒複合物的存在或水平指示最近的抗流感抗體的存在或水平

其中所述樣品中抗體的水平相對於所述個體中抗體的疫苗接種前水平的增加指示疫苗在所述個體中誘導免疫應答。

施用於個體的流感疫苗可以但不需要包含本發明的疫苗,只要納米顆粒包含可以結合由施用的疫苗誘導的抗流感抗體的HA蛋白。施用流感疫苗的方法是本領域技術人員已知的。

可以使用任何公開的測定形式進行對從個體獲得的樣品的分析。在一個實施方案中,使用選自以下的測定形式進行樣品的分析:徑向擴散測定法,酶聯免疫測定法,競爭性酶聯免疫測定法,放射免疫測定法,螢光免疫測定法,化學發光測定法,側向流測定法,流通測定法,基於顆粒的測定法(例如,使用顆粒,如但不限於磁性顆粒或塑料聚合物,如膠乳或聚苯乙烯珠),免疫沉澱測定法,BioCoreJ測定法(例如,使用膠體金),免疫斑點測定法(例如,CMG=s免疫印跡系統,Fribourg,Switzerland)和免疫印跡測定法(例如,western印跡),磷光測定法,流通測定法,層析測定法,基於PAGE的測定法,表面等離振子共振測定法,生物層幹涉測定測定法,分光光度測定法和電子感覺測定法。

在一個實施方案中,所述方法包括在施用疫苗之前測定個體中存在的抗流感抗體的水平的步驟。然而,如果此類信息可用,則也可以從先前的醫學記錄確定個體中存在的抗流感抗體的水平。

雖然不必實施所公開的方法,但優選在施用疫苗的步驟和確定個體中抗流感抗體的水平的步驟之間等待一段時間。在一個實施方案中,對個體中存在的抗流感抗體的水平的測定在使用疫苗後的至少1天,至少2天,至少3天,至少4天,至少5天,至少6天,至少1周,至少2周,至少3周,至少4周,至少2個月,至少3個月或至少6個月實施。

本發明還包括適用於檢測抗流感抗體的試劑盒。合適的檢測手段包括利用本發明的納米顆粒的本文公開的技術。試劑盒還可以包含可檢測標誌物,如選擇性結合納米顆粒的抗體或其它指示劑分子的抗體。試劑盒還可以包含相關聯的組分,如但不限於緩衝液,標記物,容器,插頁,管,小瓶,注射器等。

實施例

提出以下實施例以便向本領域普通技術人員提供如何製備和使用實施方案的完整公開和描述,並且不旨在限制發明人認為是其發明的範圍,它們也不意圖表示下面的實驗是所進行的全部或唯一的實驗。已經做出努力以確保關於使用的數字(例如量,溫度等)的準確性,但是應該考慮一些實驗誤差和偏差。除非另有指示,份數是重量份,分子量是重量平均分子量,並且溫度以攝氏度計。使用標準縮寫。

實施例1:HA穩定化莖(HA-SS)構建體的基於結構的迭代設計

該實施例顯示用於產生缺乏免疫顯性頭部結構域的HA穩定化莖(HA-SS)免疫原的基於結構的設計的六個迭代循環(Gen1-Gen6)。

流感A病毒包含18種HA亞型,其中兩種H1和H3目前導致大多數人類感染。季節性流感疫苗針對循環H1和H3株提供了一些保護,但很少提供針對趨異的H5,H7和H9亞型的保護,其導致人類感染的偶然暴發,作為來自禽類和/或豬庫的人畜共患病。本發明人假設聚焦於保守血凝素(HA)莖的免疫應答可能潛在地引發針對多種多樣毒株的廣泛的異亞型流感保護。因此,本發明人使用基於結構的迭代設計來開發缺乏免疫顯性HA頭部區的HA穩定化莖(HA-SS)糖蛋白(圖1)。

A/新喀裡多尼亞/20/1999(1999NC)HA的胞外域序列和A/南卡羅來納/1/1918(1918SC)的晶體結構(PDB ID 1GBN)用作設計模板,並且對每代HA-SS變體評估作為可溶性三聚體的表達,以及基於與野生型(wt)HA三聚體相似的莖特異性單克隆抗體(mAb)反應性評估抗原性。

使用人優選密碼子合成編碼來自1999NC,1986SG,2009CA,H2 2005CAN,H5 2005IND和H5 2004VN的全長HA和神經氨酸酶(NA)的質粒。通過重疊PCR和定點誘變產生不同形式的HA-SS。在freestyle 293(293F;Life Technologies)細胞或293GnTI-/-細胞(用於Gen4HA-SS結晶)中表達所有HA,HA-SS蛋白和mAb,並如前所述進行純化(Wei,C.J.,et al.Elicitation of broadly neutralizing influenza antibodies in animals with previous influenza exposure.Sci.Transl.Med.4,147ra114(2012))。如所述(Kanekiyo,M.,et al.Nature 499,102-106(2013))進行HA-np和Gen1-Gen6HA-SS和Gen4-6HA-SS-np的構建,純化和表徵。

第一代設計(Gen1HA-SS)用GSG接頭替換受體結合結構域(殘基HA1 51-277,H3編號)(圖1)。各自產生HA胞外域三聚體和所有三聚體HA-SS設計,C-末端跨膜和胞質殘基HA2 175-220(H3編號)替換為短接頭,T4摺疊物,凝血酶切割位點和His標籤。使HA1/HA2切割位點突變以防止切割。為了模擬HA-SS設計的結構,使用1918SC HA(PDB ID 1GBN)和噬菌體T4摺疊物三聚體(PDB ID 1RFO)作為模板,使用LOOPY(Xiang,et.al.Proc.Natl.Acad.Sci.U.S.A.99,7432-7437(2002))設計環和連接,使用SCAP(Xiang,et al.,J.Mol.Biol.311,421-430(2001))突變側鏈,並且使用LSQMAN(Kleywegt,et al.,in International Tables for Crystallography,Vol.F,353-367(Kluwer Academic Publishers,Dordrecht,The Netherlands,2001))實施結構重疊。使用Rosetta程序DDG_MONOMER(Kellogg,et al.,Proteins 79,830-838(2011))計算地評估特定突變的力能學(energetics)。使用Chimera(Pettersen,E.F.,et al.Journal of Computational Chemistry 25,1605-1612(2004))進行表面積計算。檢查蛋白質資料庫(PDB)中約700個三聚體結構,以找到合適的三聚化結構域,以進一步穩定化HA-SS免疫原。該搜索揭示了HIV-1gp41(PDB ID 1SZT)針對以下待被優化(i)其大小(每個單體小於70個胺基酸),(ii)其熱穩定性(Tm=70℃),(iii)容易移植,其中N-和C-末端位於三聚體的相同末端,和(iv)gp41的內部七價重複1(inner heptad repeat 1,HR1)螺旋的C-端末端與HA-SS三聚體的內部C螺旋之間的結構互補性。Gen1HA-SS不能表達為三聚體,儘管存在C末端摺疊物三聚化結構域。

為了增加第二代中的三聚體穩定性,本發明人將HA-SS的膜遠端區域處的HA2殘基66-85替換為熱穩定性HIV-1gp41三聚化結構域(參見Tan,et al.,Proc.Natl.Acad.Sci.U.S.A.94,12303-12308(1997)),其中內部七價重複1(HR1)螺旋在結構上與HA莖的內部C螺旋互補。連接gp41和HA-SS必需循環排列gp41螺旋HR1和HR2,其順序是顛倒的並用富含甘氨酸的接頭重新連接(圖1)。為了將HIV-1gp41的融合後形式的六螺旋束插入Gen2HA-SS中,將來自gp41的三個內部螺旋的殘基28-32(殘基573-577,HXBc2編號)疊加到HA內螺旋殘基HA2 81-85(來自PDB ID 1RU7)上,對於15個Cα原子具有的均方根偏差(RMSD)。HA2殘基66-85被gp41七價重複(HR)2螺旋(殘基628-654,HXBc2編號)替換,隨後是含有N-連接的糖基化位點的序列子的六殘基富含甘氨酸的接頭(NGTGGG)和gp41HR1螺旋(殘基548-577)。HR1設計成與HA2的螺旋C符合讀碼框,以產生長的中心嵌合螺旋。通過加入鹽橋,縮短環和降低其疏水性來穩定F』區的膜遠端部分的努力沒有改善Gen2HA-SS設計的三聚化或抗原性。Gen2HA-SS的表達導致29%的三聚化。

為了改善第三代中的三聚化,除去了具有不規則二級結構的HA1F』區的44個殘基的部分,並且HA-SS的內部螺旋C被截短了6個殘基,以在gp41和HA2之間具有更好的互補性。這導致具有77%三聚化的可溶性Gen3HA-SS,其被具有與可溶性HA三聚體(圖1)的親和力總體類似的親和力的HA莖廣泛中和性mAb(bNAb)識別。在Gen3中,用GWG接頭替換F』區的HA-SS HA2殘基43-50和278-313,並除去HA2殘基60-65和86-92。為了使gp41與HA莖的下部區域重比對,將來自gp41的三個內部螺旋的殘基30-34(575-579Hxbc2編號)疊加到HA內部螺旋殘基HA2 90-94上,對於15Cα原子具有的RMSD。對於CR6261和70-5B03觀察到更快的解離速率,這可能部分是由於可以與CR6261重鏈有限接觸的HAF』區域的喪失。

為了在原子水平表徵Gen3HA-SS,本發明人以解析度測定了與鼠bNAb C179的抗原結合片段(Fab)複合的Gen3HA-SS的晶體結構(參見Okuno,Y.,et al.J.Virol.67,2552-2558(1993))(圖2a,左圖);C179抗體是用異亞型中和發現的第一種廣泛中和性HA莖定向抗體。

將從雜交瘤細胞收穫的C179切割成Fab,如先前所述(Ofek,G.,et al.J.Virol.78,10724-10737(2004)),其中具有以下修改:LysC(Roche)與C179以1:20,000(w/w)比率使用,並且經由通過在50mM Tris pH 8.0中的巰基-乙基-吡啶柱(Pall Life Sciences)從消化溶液中除去可結晶片段(Fc),並且用50mM NaAc pH 5.0洗脫C179Fab。

通過使1:1.25(Gen3HA-SS/C179摩爾比)混合物通過Superdex 200 26/60(GE Healthcare)凝膠過濾柱,並且收集在152.0mL洗脫的峰獲得Gen3HA-SS(在293GnTI-/-細胞中表達)與C179Fab的複合物。將複合物在150mM NaCl,10mM Tris HCl pH7.5中濃縮至10mg/ml,並且通過在15%(W/V)聚乙二醇1500,5%(V/V)2-甲基-2,4-戊二醇,200mM NH4Cl和100mM Tris HCl pH8.5中的懸滴蒸汽擴散(hanging drop vapor diffusion)在20℃結晶,這來源於沉澱劑協同結晶篩選(Majeed,S.,et al.Structure 11,1061-1070(2003))。在沒有任何另外的冷凍保護劑的情況下將晶體冷凍,並在數據收集之前貯存在液氮中。

在Advanced Photon Source(APS),阿貢國家實驗室(Argonne National Laboratory)的東南地區協作訪問團隊(Southeast Regional Collaborative Access Team,SER-CAT)22-BM束線處,使用的波長,在100K的溫度下收集X射線數據到解析度用HKL2000在三角空間群H3中處理X射線數據,並且通過使用五個單獨的搜索模型的分子替換來確定複合物的結構。使用PHASER(Mccoy,A.J.,et al.J.Appl.Crystallogr.40,658-674(2007)),與來自1934PR8結構的HA莖單體(PDB ID 1RU7,殘基5-36,315-323HA1鏈A和殘基514-559,590-660HA2鏈B),HIV-1gp41單體(PDB ID 1SZT,殘基3-29,42-67),鼠抗體S25-2的重鏈可變域(PDB ID 1Q9K,殘基1-111),和鼠抗體MN16C13F4的輕鏈可變域(PDB ID 1UWX,殘基3-108)一起搜索。使用MOLREP(Collaborative Computational Project.Acta Crystallogr.D Biol.Crystallogr.50,760-763(1994))來定位T4摺疊物單體(PDB ID1RFO,鏈A),其證實了手工進行的獨立擬合。通過眼將C179Fab恆定結構域擬合入Fo-Fc密度中,之後使用上述Ab(PDB ID 1Q9K和1UWX)的恆定結構域作為模板精修。使用COOT(Emsley,P.&Cowtan,K.Coot:D Biol.Crystallogr.60,2126-2132(2004))和PHENIX(Adams,P.D.,et al.Acta Crystallogr.D Biol.Crystallogr.58,1948-1954(2002))及搭乘氫(riding hydrogen)實施模型建立和精修。除了HA切割環(殘基48-52),連接gp41螺旋的富含甘氨酸的環(殘基139-144),連接HA-SS到摺疊物的接頭(殘基256-259)和摺疊物域C端的凝血酶切割位點和His標籤(殘基286-302)外,將Gen3HA-SS的所有殘基建模成電子密度。觀察到糖並建立在Asn殘基23、119和236上。C179結構包括重鏈殘基1-213和輕鏈殘基1-214。如由PHENIX測定的Ramachandran統計學揭示了有利區域中91.64%的殘基,允許區域中的7.49%和作為異常值的0.86%。

共晶體結構揭示Gen3HA-SS的C179識別類似於在最近公布的C179與A/日本/305/1957(1957JP)HA的共晶結構中識別H2N2三聚體HA的識別(參見Dreyfus,et al.,J.Virol.87,7149-7154(2013))(圖2a,右圖)。雖然這些發現證實了Gen3HA-SS上的莖表位的保留;整體結構揭示了幾個意想不到的差異(圖2a,左圖和中圖)。首先,莖三聚體亞基在其C末端相對於HA分開約(圖2a,中間圖)。第二,C-末端摺疊物三聚化結構域倒轉並且在莖三聚體內部疊入到張開區域中(圖2a,左圖)。最後,HA莖的外部螺旋A與gp41六螺旋束的外部HR2螺旋形成連續螺旋,而不是形成由甘氨酸接頭分開的兩個單獨的螺旋。

為了解決這些問題,創建了含有三個突變(圖1中概述)的第四代HA-SS,以努力除去潛在的側鏈碰撞並且破壞HA2的螺旋B與gp41HR2之間的連續螺旋(圖2b)。

為了結晶Gen4HA-SS/CR6261複合物,通過與內切糖苷酶H(77U/μg Gen4HA-SS)溫育4小時來使Gen4HA-SS(在293GnTI-/-細胞中表達)去糖基化,隨後通過刀豆蛋白A柱(Sigma)除去具有未切割的N-連接聚糖的蛋白質。通過使1:1.25(Gen4HA-SS/CR6261摩爾比)混合物通過Superdex 200 10/300(GE Healthcare)凝膠過濾柱並收集在12.5mL處洗脫的峰來獲得與CR6261Fab的複合物。將複合物在150mM NaCl,10mM Tris HCl pH7.5中濃縮至11mg/ml,並通過在7%(w/v)聚乙二醇4000,4.5%(v/v)異丙醇,100mM咪唑pH6.5中的懸滴蒸汽擴散在20℃結晶。將晶體在包含另外的5%(v/v)2R,3R丁二醇(Sigma)的貯存溶液中在室溫下浸泡6小時,然後簡短30秒轉移至含有15%2R,3R丁二醇的貯存溶液,之後快速冷卻。

在APS的SER-CAT BM-22束線處使用的波長在100K的溫度下收集X射線數據到解析度。用空間群H3中的HKL2000(參考文獻37)處理數據,並通過使用三個單獨的搜索模型的分子置換來確定複合物的結構。使用PHASER來與來自1934PR8結構的HA莖單體,HIV-1gp41單體(與上述相同模型)以及CR6261(PDB ID 3GBM)的可變和恆定結構域一起搜索。分別使用COOT和PHENIX進行模型建立和精製。除了HA切割環(殘基48-52),連接gp41螺旋的富含甘氨酸的環(殘基137-145)和C末端摺疊物(殘基256-259),摺疊結構域C端的凝血酶切割位點和His標籤(殘基286-302)外,將Gen4HA-SS的所有殘基建模為電子密度。儘管在Gen3HA-SS結構中觀察到的相同區域中的HA莖內部可見密度,但是它不足以唯一放置或穩定精製摺疊物結構域。CR6261Fab結構包括重鏈殘基1-213和輕鏈殘基3-107和113-215。如由PHENIX測定的Ramachandran統計學揭示有利區域中93.19%的殘基,允許區域中的6.09%和作為異常值的1.06%。

對於低溫電子顯微術分析,使用Vitrobot Mark IV(FEI Company,Hillsboro,OR)在多孔碳膜(Quantfoil,Germany)上將顆粒玻璃化。在Titan Krios電子顯微鏡(FEI公司,Hillsboro,OR)上收集顆粒的冷凍圖像,在液氮溫度下操作並在300kV下操作。在像素大小以範圍為約2.8至約6μm的散焦值,並且以範圍為約10至的劑量在4,096×4,096電荷耦合器件(CCD)照相機(Gatan Inc.,Warrendale,PA)上收集圖像。使用ctffind3(Mindell,J.A.&Grigorieff,N.J Struct Biol 142,334-347(2003))擬合觀察到的散焦值,並且將展示漂移或散光的圖像從進一步分析中排除。從圖像中手動挑選顆粒(13,464)。無參考2D分類指示在3D精修期間施加的八面體對稱。使用平滑,無刺突的低通濾過的鐵蛋白(PDB ID 2JD6)作為起始模型。在精化過程中除去重疊顆粒之後,從6,540個顆粒計算重建(3D圖)。用Relion包(Scheres,S.H.W.J.Mol.Biol.415,406-418(2012))進行所有圖像分析(2D和3D)。用Chimera進行模型坐標的可視化和分子停靠。

與C179複合的Gen3HA-SS和與CR6261複合的Gen4HA-SS複合物的原子坐標和結構因子分別保存在PDB代碼4MKD和4MKE下。H1-SS-np的冷凍電子顯微術圖已經以EMDB代碼EMD-6332保存。

與bNAb CR6261的Fab複合的Gen4HA-SS的解析度的共晶體結構(參見Ekiert,D.C.,et al.Science 324,246-251(2009))揭示了相對於gp41的展開仍然存在,額外旋轉約19°(圖2b,中間圖)。然而,在Gen4HA-SS中三聚化水平(83%),莖表位構象的保持和HA莖bNAb結合(對四種bNAb為nM)接近最佳(圖1a和2b)。

發明人關注免疫原性HIV-1gp41區域的牽連,因此尋求用短的富含甘氨酸的接頭替換gp41(圖1a),因為這還將增加HA莖在免疫原表面上的百分比(圖1b)。在兩種情況,Gen5HA-SS(其保留Gen4穩定化莖區)和Gen6HA-SS(其中包含Lys51-Glu103(HA2,H3編號)的內部鹽橋被替換為幾乎等排的Met-Leu疏水對)(Gen6HA-SS,圖1c)下進行gp41替換。

通過完全除去gp41三聚化結構域,將HA2殘基58-93與GSGGSG環連接並引入HA2突變Y94D和N95L來創建Gen5HA-SS。

為了設計Gen6HA-SS,最初創建了五個突變以穩定化HA莖HA2的內部核心:K51M,E103L,E105Q,R106W和D109L(稱為Gen6』HA-SS)。對所有三種免疫原保留通過HA莖抗體的三聚化和識別(圖1a)。包含三個另外的內部穩定化突變的Gen6HA-SS的中間形式(稱為Gen6』HA-SS)展示相似的抗原性(圖1d),但是最終觀察到突變E105Q,R106W和D109L不是穩定化Gen6HA-SS和與鐵蛋白融合需要的,並且不用於最終的H1-SS-np構建體(圖1c)。

實施例2:自組裝鐵蛋白納米顆粒的創建

該實施例描述了Gen4,Gen5,Gen6』和Gen6HA-SS通過它們各自的HA C末端與自組裝鐵蛋白納米顆粒的融合。

在自組裝納米顆粒(HA-np)的背景下,HA的免疫原性顯著增加(參見Kanekiyo,M.,et al.,Nature 499,102-106(2013))。此外,本發明人推測與納米顆粒的C-末端融合可以降低莖的近膜區域的張開。因此,本發明人將Gen4,Gen5,Gen6』和Gen6HA-SS通過它們各自的HA C-末端(替換摺疊物)遺傳融合到幽門螺桿菌的自組裝鐵蛋白納米顆粒以創建HA-SS-納米顆粒(HA-SS-np)。

用SGG接頭將Gen4-6HA-SS與幽門螺桿菌鐵蛋白N-末端(殘基5-167)融合以產生HA-SS鐵蛋白納米顆粒(Gen4HA-SS-np,H1-SS-np和H1-SS-np』),如描述的(Kanekiyo,M.,et al.Nature 499,102-106(2013))。

使用fortéBio Octet Red384儀器測量HA和HA-SS分子對mAb CR6261,CR9114,F10scFv和70-5B03的結合動力學。所有測定法在30℃下進行,在補充有1%BSA的PBS中設定為1,000rpm的攪拌,以使非特異性相互作用最小化。所有溶液的最終體積為100μl/孔。在固體黑色96孔板(Geiger Bio-One)中在30℃進行測定。使用在10mM乙酸鹽pH 5.0緩衝液中具有C-末端生物素化的Avi-Tag(25μg/ml)和HA-np或HA-SS-np的HA或HA-SS分別加載鏈黴抗生物素蛋白和胺反應性生物傳感器探針達300s。典型的捕獲水平在0.8和1nm之間,並且一排八個尖端內的變異性不超過0.1nm。將生物傳感器尖端在PBS/1%BSA緩衝液中平衡300s,之後進行溶液中的Fab或F10scFv(0.01至0.5μM)的結合測量。加入抗體後,使結合進行300s;然後使結合解離300s。僅使用解離孔一次以防止汙染。通過減去對於在PBS/1%BSA中溫育的裝載有HA或HA-SS分子的傳感器記錄的測量,進行平行校正以減去系統基線漂移。為了除去非特異性結合應答,將生物素化的gp120表面重修核心分子加載到鏈黴抗生物素蛋白探針上,並與抗莖抗體一起溫育,並從HA和HA-SS響應數據中減去非特異性應答。使用Octet軟體7.0版進行數據分析和曲線擬合。實驗數據用描述1:1相互作用的結合方程擬合。假設結合是可逆的(完全解離),使用非線性最小二乘法擬合進行完整數據集的全局分析,所述非線性最小二乘法擬合允許對於每個實驗中使用的所有濃度同時獲得單一組的結合參數。

如之前所述(Wei,C.J.,et al.Science 329:1060-1064(2010))進行ELISA,血凝抑制(HAI)測定法和假型中和測定法。如描述(Wei,C.J.,et al.Sci.Transl.Med.2,24ra21(2010)),產生表達螢光素酶報告基因的重組HA/NA慢病毒載體。所有流感病毒均獲自疾病控制和預防中心(CDC;Atlanta,GA)。

Gen4,Gen6和Gen6』HA-SS-np各自表示為納米顆粒,如通過透射電子顯微鏡分析和凝膠過濾證實的(圖2)。然而,Gen5HA-SS-np未能表達。選擇Gen6和Gen6』HA-SS-np進行進一步評估,並且在下文中在這些實施例中分別稱為H1-SS-np和H1-SS-np』。以解析度為實施的H1-SS-np的冷凍電子顯微術(EM)分析揭示了對稱的球形顆粒,每個顆粒具有從表面突出的八個刺突(圖2c)。值得注意的是,Gen6HA-SS莖的膜近端區域比Gen4HA-SS更好地適合於電子密度,這表明擴展是減輕的或不再存在(圖2c,左圖)。此外,H1-SS-np和H1-SS-np』都具有期望的抗原性,在ELISA和生物層幹涉測量法測量中被CR6261,CR9114,F10和70-5B03識別(參見Ekiert,D.C.,et al.Science 324,246-251(2009);Sui,J.,et al.Nat.Struct.Mol.Biol.16,265-273(2009);Dreyfus,C.,et al.Science 337,1343-1348(2012);Wrammert,J.,et al.J.Exp.Med.208,181-193(2011)),表明在與鐵蛋白融合後保留了真正的HA-SS結構(圖1a,1e和1f)。

實施例3:評估疫苗功效

該實施例證明了與HA構建體融合的鐵蛋白納米顆粒的疫苗功效的各種測量的表徵。

本發明人使用鈣流量測定法評估了與全長HA-np相比H1-SS-np通過膜錨定的種系恢復的CR6261B細胞受體(BCR)觸發信號傳導的能力(Novak,et.al.Cytometry 17,135-141(1994))。

對於BCR活化測定法,通過輕鏈和膜錨定的IgM重鏈對Ramos B細胞系的表面IgM陰性克隆的慢病毒轉染(FEEKW載體;Luo,X.M.,et al.Blood 113,1422-1431(2009))穩定表達種系CR6261BCR(野生型和雙重I53A/F54A突變體)。然後通過流式細胞術(BD FACSAria;BD Biosciences)分選種系CR6261BCR陽性細胞並擴增。評估對於種系CR6261BCR(野生型或I53A/F54A突變體)表達>95%陽性的細胞的表面表達和正確的HA抗原性。對於信號傳導,向表達種系CR6261BCR的1×106個Ramos B細胞呈現2500nM的H1-SS-np,HA np(HA含有Y98F突變以消除與唾液酸的非特異性結合)或空np。通過流式細胞術測量響應於BCR刺激的鈣流量的動力學,作為染料Fura Red的Ca2+結合/未結合狀態的比率。Ca2+流量的此比率在暴露於配體後10秒呈現。在刺激之前獲取30秒基線。對單個細胞的參比測量取平均值並通過動力學分析,FlowJo軟體變平滑。在暴露於0.5μg/μl抗人IgM F(ab』)2(Southern Biotech)後,通過Ca2+流量比較種系CR6261BCR對具有I53A/F54A突變的種系CR6261BCR之間的功能性。

與空鐵蛋白顆粒相反,H1-SS-np通過野生型BCR誘導有效的信號傳導,全長HA-np在較小程度上亦然,並且通過在第二個重鏈互補決定區(CDR H2)中的兩個關鍵接觸殘基中突變的BCR沒有觀察到信號傳導(圖1g)。這一發現證實了H1-SS-np銜接CR6261的IGHV1-69種系前體並通過CDR H2依賴性識別刺激未免疫的B細胞的能力,在人中發現的廣泛中和性莖定向抗體的特徵。

為了評估H1-SS-np疫苗功效,本發明人使用Sigma佐劑系統(SAS)免疫小鼠和雪貂,這是因為已報導類似於MF59(另一種被批准用於人的基於角鯊烯的佐劑),SAS誘導HA響應。

對於免疫研究,對於該研究進行總共三個動物實驗,兩個在小鼠中,一個在雪貂中。在第一次小鼠實驗中,在第0周和第2周時用2μg H1-SS-np,2μg空白鐵蛋白np,0.2μg H5 2005IND HA-np或TIV(HA摩爾當量)肌肉內免疫雌性BALB/c小鼠(6-8周齡,Jackson Laboratories)。在每次免疫後14天收集血液,並且分離血清。對於第二次小鼠免疫實驗,在第0周、第8周和第12周用3μg的H1-SS-np或空鐵蛋白np免疫雌性BALB/c小鼠三次。對於雪貂免疫,飼養使用6月齡雄性Fitch雪貂(Triple F Farms,Sayre,PA)(對於暴露於目前循環的大流行H1N1,季節性H1N1,H3N2和B流感毒株呈血清陰性),並在BIOQUAL,Inc.(Rockville,MD)護理。這些設施由美國實驗動物保護國際認可協會(American Association for the Accreditation of Laboratory Animal Care International)認可,並滿足NIH標準,如「實驗動物護理和使用指南(Guide for the Care and Use of Laboratory Animals)」中所述。在第0周和第4周,用在500μl PBS中的20μg H1-SS-np』或空鐵蛋白np或TIV(相當於2.5μg H1HA)肌內免疫雪貂。用250μg表達H5 2005IND的質粒DNA,隨後在第0周和第4周用H5N1 2005IND MIV的2.5μg HA免疫陽性對照組中的雪貂。通過肌內注射將疫苗施用到大腿上部肌肉中。Sigma佐劑系統(SAS,Sigma)用於所有蛋白質或基於np的免疫。每次免疫後14天收集血液,並且分離血清。動物實驗完全符合所有相關聯邦規定和NIH指南進行。

對於被動轉移研究,在第0周和第4周首先用H1-SS-np蛋白(2μg/劑量,具有SAS)接種150隻小鼠,以產生HA-SS免疫Ig,並在加強後第1周,第2周和第3周(末端)收集血清。使用製造商方案用蛋白G(Life Technologies)純化來自免疫血清的Ig。攻擊前24小時,兩組BALB/c小鼠(n=10/組,Taconic inc。)通過腹膜內途徑接受未免疫的(Molecular innovations)或免疫的Ig。在被動轉移後24小時從輸注的動物收集血清用於血清學分析。

對於病毒攻擊研究,從疾病控制和預防中心(Atlanta,GA)(CDC#2004706280,E1/E3(1/19/07)獲得H5N1毒株A/越南/1203/04,並且在BIOQUAL Inc.在10天齡的胚胎雞蛋(Charles River,North Franklin,CT)中擴充。攻擊原液具有1010TCID50/ml的感染滴度。對於血液收集,放血和攻擊程序,用配製為對每隻動物提供25mg/kg氯胺酮和0.001mg/kg右美託咪定劑量的氯胺酮/右美託咪定溶液麻醉動物。將小鼠用50μl病毒鼻內接種,每個鼻孔大約25μl,並且對雪貂鼻內接種500μl病毒,每個鼻孔約250μl。攻擊劑量為小鼠中的25LD50和雪貂中的1000TCID50。根據以前的研究,這些攻擊劑量預期分別導致未免疫的對照小鼠和雪貂中的100%致死率。對於雪貂,每天記錄感染的臨床體徵,體重和溫度兩次。如下分配活動得分:0,警惕和嬉戲;1,警醒但只在受刺激時嬉戲(playful);2,警惕,但刺激時不嬉戲;和3,既不警惕,在刺激時也不嬉戲。對顯示嚴重疾病體徵(延長的發燒,腹瀉,幹擾飲食,飲水或呼吸的流涕;嚴重嗜睡;或神經學體徵)或體重減輕>20%的雪貂立即實施安樂死。

H1-SS-np和H1-SS-np』分別引發在小鼠和雪貂兩者中針對組1HA亞型(季節性和大流行H1,H2,H5和H9)的廣泛抗體響應(圖3a,3b和3C)。此外,H1-SS-np在半數的小鼠中誘導出與H2和H5相當的實質性組2(H3和H7)應答(圖3a,左圖)。在小鼠和雪貂兩者中,由H1-SS-np引發的對HA莖的抗體應答顯著高於三價滅活的流感疫苗(TIV)的抗體應答(圖3b,右圖)。雖然也觀察到對鐵蛋白的相當大的應答(圖3a和3b,左圖),但先前的研究已顯示用細菌鐵蛋白免疫不誘導小鼠中自體鐵蛋白的免疫,它也不減輕對隨後免疫的HA特異性抗體應答。使用高度靈敏的HA-NA慢病毒報告物測定法(Wei,C.J.,et al.Sci.Transl.Med.2,24ra21(2010))測量血清中和活性(NT)揭示了在小鼠和雪貂兩者中針對趨異的H1N1毒株A/加利福尼亞/04/2009(2009CA)和A/新加坡/6/1986(1986SG)和同源1999NC株的看得出的活性。然而,針對異亞型H5N1A/越南/1203/2004(H5N1 2004VN),人起源H2N2A/加拿大/720/2005(H2N2 2005CA),H7N9A/安徽(Anhui)/1/2013(H7N9 2013AN)和H9N2A/香港/1074/1999(H9N2 1999HK)在小鼠和雪貂兩者中都是低的或不可檢測的(圖3a和3c)。儘管強的異亞型抗體反應性,但觀察到的最小異亞型中和可能是由於莖中和所需要的單個表位區域的精確靶向,使得其比在表面積上大20倍的HA莖的其它部分對次要結構差異更敏感。TIV免疫的動物在小鼠和雪貂兩者中具有針對同源1999NC的最高NT,針對異源H1N1株的可檢測NT,以及沒有針對異亞型H5N1的NT(圖3b)。如預期的,TIV免疫的動物具有顯著的血凝抑制(HAI)滴度,並且由H1-SS-np和H1-SS-np』引發的NT活性與HAI無關。

為了評估保護,用高致死劑量的高致病性H5N1 2004VN病毒攻擊經免疫的小鼠和雪貂。所有未免疫的小鼠和用空np免疫的小鼠死亡,並且顯著地,所有用H1-SS-np免疫的小鼠存活(圖4a)。用空鐵蛋白納米粒免疫的所有雪貂死於感染,並且用H5N1HA DNA/單價滅活疫苗(MIV)初免-加強免疫的所有雪貂存活(圖4b)。與小鼠研究一致,六個基於H1N1的H1-SS-np』免疫的雪貂中的四個倖免於H5N1攻擊。儘管六個TIV免疫的雪貂中的兩個存活,但是兩個存活者中的一個經歷嚴重的體重減輕(圖4a),並且在具有最小體重減輕的另一隻存活者中沒有H5血清學應答的證據,提示沒有發生感染。除了一個血清陰性動物之外,與空的鐵蛋白-np對照相比,TIV免疫的組在體重減輕或發燒方面沒有差異,並且如通過攻擊後活動評分證明,比H1-SS-np』-免疫的雪貂顯示更大的疾病。與空鐵蛋白免疫的對照相比,基於H1-SS-np』-免疫的雪貂中的活動評分,第6天體重減輕,發熱和疾病顯著減少(圖4)。在存活的雪貂中攻擊後第14天存在的針對H5N1 2004VN的HAI滴度指示雖然H1-SS-np』能夠預防疾病,但它不能防止感染。表3和4提供了小鼠和雪貂中的這些免疫研究的總結。

表3:在用H1-SS-np免疫的小鼠中針對H1N1 1999NC和H5N1 2004VN的攻擊後血清HAI抗體滴度。

*此小鼠在攻擊前1天死亡。

表4:用指定方案免疫的雪貂中針對同源H1N1 1999NC的攻擊前HAI抗體滴度和針對攻擊毒株H5N1 2004VN的攻擊後HAI抗體滴度。

由H1-SS-np』引發的可忽略的H5N1NT活性(圖3c)沒有解釋觀察到的異亞型保護。然而,在HA-SS-np』免疫的白鼬中,HA抗體滴度和存活之間以及抗體滴度和體重之間存在相關性。為了進一步研究這種相關性,在用高致死劑量的H5N1 2004VN病毒攻擊前24小時,發明人被動轉移H1-SS-np免疫Ig至未免疫的小鼠(10mg/動物)。轉移的Ig具有與組1HA亞型(H1,H2,H5和H9)的強反應性,與組2亞型(H3和H7)的較弱的結合和最小的NT活性(圖4d和4e)。在表5中顯示H1-SS-np免疫Ig對多種流感假病毒的IC50中和滴度。

表5:H1-SS-np免疫Ig的IC50假病毒中和滴度。

雖然所有接受未免疫的Ig的小鼠都死於感染,但接受免疫Ig的10隻小鼠中的8隻完全被保護而免於致命的H5N1異亞型攻擊。在免疫Ig組中死亡的兩隻小鼠中對同源H1 1999NC HA的低血清反應性指示它們可能尚未接受適當的Ig施用(圖4c)。

這些數據一起顯示,基於除中和之外的功能機制(如抗體依賴性細胞介導的細胞毒性(ADCC)或抗體依賴性補體介導的裂解)的抗體介導的的保護負責由H1-SS-np和H1-SS-np』免疫引發的保護。報告了通過廣泛中和性HA莖抗體在小鼠中的流感保護依賴於Fc相互作用(DiLillo,et.al.Nat Med 20,143-151(2014)),並且已經在人和獼猴血漿兩者中報告了在不存在中和的情況下針對流感HA的交叉反應性ADCC(Jegaskanda,S.,et al.J Immunol 190,1837-1848(2013);Jegaskanda,et al.J.Virol.87,5512-5522(2013);Jegaskanda,et al.J Immunol 193,469-475(2014))。與這些報告一致,本文中呈現的結果提示基於HA莖的流感疫苗不需要必然聚焦於中和性表位以誘導廣泛的保護。

使用基於結構的設計並避免對HA頭部結構域的免疫顯性應答,與納米顆粒抗原展示平臺組合,本發明人成功地產生了僅HA莖的納米顆粒疫苗免疫原,其在雪貂中引發針對H5N1疾病的抗體介導的異亞型保護性免疫。這些結果證明,通過僅HA莖的納米顆粒疫苗引發非中和性抗體可以提供針對嚴重疾病的廣泛保護,並且應該用於開發通用流感疫苗。

序列表

美利堅合眾國, 由健康及人類服務部部長代表

Mascola, John R.

Boyington, Jeffrey C.

Yassine, Hadi M.

Kwong, Peter D.

Graham, Barney S.

Kanekiyo, Masaru

穩定化的流感血凝素莖區三聚體及其用途

6137NIAID-36-PCT

尚未分配

2015-05-27

62/003,471

2014-05-27

401

PatentIn version 3.5

1

504

DNA

幽門螺桿菌

1

atgctgtccg acatcatcaa gctgctgaac gaacaggtga acaaggagat gcagagctcc 60

aacctgtaca tgagtatgtc tagttggtgt tatacacact cactggacgg cgctgggctg 120

ttcctgtttg atcacgcagc cgaggaatac gaacatgcaa agaaactgat cattttcctg 180

aatgagaaca atgtgcccgt ccagctgact tcaatcagcg cccctgaaca taagttcgag 240

ggcctgaccc agatctttca gaaagcttac gaacacgagc agcatatttc cgaatctatc 300

aacaatattg tggaccacgc cattaagagc aaagatcatg ctaccttcaa ctttctgcag 360

tggtacgtgg ccgagcagca cgaggaggag gtcctgttta aggacatcct ggataaaatc 420

gaactgattg gaaacgagaa tcatggcctg tacctggcag atcagtatgt gaagggcatt 480

gccaagtcca gaaaaagtgg gtca 504

2

168

PRT

幽門螺桿菌

2

Met Leu Ser Asp Ile Ile Lys Leu Leu Asn Glu Gln Val Asn Lys Glu

1 5 10 15

Met Gln Ser Ser Asn Leu Tyr Met Ser Met Ser Ser Trp Cys Tyr Thr

20 25 30

His Ser Leu Asp Gly Ala Gly Leu Phe Leu Phe Asp His Ala Ala Glu

35 40 45

Glu Tyr Glu His Ala Lys Lys Leu Ile Ile Phe Leu Asn Glu Asn Asn

50 55 60

Val Pro Val Gln Leu Thr Ser Ile Ser Ala Pro Glu His Lys Phe Glu

65 70 75 80

Gly Leu Thr Gln Ile Phe Gln Lys Ala Tyr Glu His Glu Gln His Ile

85 90 95

Ser Glu Ser Ile Asn Asn Ile Val Asp His Ala Ile Lys Ser Lys Asp

100 105 110

His Ala Thr Phe Asn Phe Leu Gln Trp Tyr Val Ala Glu Gln His Glu

115 120 125

Glu Glu Val Leu Phe Lys Asp Ile Leu Asp Lys Ile Glu Leu Ile Gly

130 135 140

Asn Glu Asn His Gly Leu Tyr Leu Ala Asp Gln Tyr Val Lys Gly Ile

145 150 155 160

Ala Lys Ser Arg Lys Ser Gly Ser

165

3

504

DNA

幽門螺桿菌

3

tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60

atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120

ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180

aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240

tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300

ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360

ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420

actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480

cagcttgatg atgtcggaca gcat 504

4

492

DNA

A流感病毒

4

atcatcaagc tgctgaacga acaggtgaac aaggagatgc agagctccaa cctgtacatg 60

agtatgtcta gttggtgtta tacacactca ctggacggcg ctgggctgtt cctgtttgat 120

cacgcagccg aggaatacga acatgcaaag aaactgatca ttttcctgaa tgagaacaat 180

gtgcccgtcc agctgacttc aatcagcgcc cctgaacata agttcgaggg cctgacccag 240

atctttcaga aagcttacga acacgagcag catatttccg aatctatcaa caatattgtg 300

gaccacgcca ttaagagcaa agatcatgct accttcaact ttctgcagtg gtacgtggcc 360

gagcagcacg aggaggaggt cctgtttaag gacatcctgg ataaaatcga actgattgga 420

aacgagaatc atggcctgta cctggcagat cagtatgtga agggcattgc caagtccaga 480

aaaagtgggt ca 492

5

165

PRT

A流感病毒

5

Asp Ile Ile Lys Leu Leu Asn Glu Gln Val Asn Lys Glu Met Gln Ser

1 5 10 15

Ser Asn Leu Tyr Met Ser Met Ser Ser Trp Cys Tyr Thr His Ser Leu

20 25 30

Asp Gly Ala Gly Leu Phe Leu Phe Asp His Ala Ala Glu Glu Tyr Glu

35 40 45

His Ala Lys Lys Leu Ile Ile Phe Leu Asn Glu Asn Asn Val Pro Val

50 55 60

Gln Leu Thr Ser Ile Ser Ala Pro Glu His Lys Phe Glu Gly Leu Thr

65 70 75 80

Gln Ile Phe Gln Lys Ala Tyr Glu His Glu Gln His Ile Ser Glu Ser

85 90 95

Ile Asn Asn Ile Val Asp His Ala Ile Lys Ser Lys Asp His Ala Thr

100 105 110

Phe Asn Phe Leu Gln Trp Tyr Val Ala Glu Gln His Glu Glu Glu Val

115 120 125

Leu Phe Lys Asp Ile Leu Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn

130 135 140

His Gly Leu Tyr Leu Ala Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser

145 150 155 160

Arg Lys Ser Gly Ser

165

6

492

DNA

A流感病毒

6

tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60

atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120

ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180

aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240

tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300

ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360

ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420

actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480

cagcttgatg at 492

7

1695

DNA

A流感病毒

7

atgaaggcca aactgctggt gctgctgtgt acctttaccg ccacctacgc cgacacaatc 60

tgtatcggct accacgccaa caatagcacc gacaccgtgg atacagtgct ggagaagaac 120

gtgaccgtga cccactctgt gaacctgctg gaggacagcc acaatggcaa gctgtgtctg 180

ctgaaaggca ttgcccctct gcagctgggc aattgttctg tggccggatg gattctgggc 240

aaccccgagt gtgagctgct gatttctaag gagagctgga gctacatcgt ggagaccccc 300

aatcctgaga atggcacctg ctaccctggc tacttcgccg attacgagga gctgcgcgag 360

cagctgtcta gcgtgtccag cttcgagaga ttcgagatct tccccaagga gtccagctgg 420

cctaatcaca cagtgacagg cgtgtctgcc agctgtagcc acaacggcaa aagcagcttc 480

taccggaacc tgctgtggct gacaggcaag aatggcctgt accccaacct gagcaagagc 540

tacgtgaaca acaaggaaaa ggaagtgctg gtgctgtggg gagtgcacca ccctcccaac 600

atcggaaatc agcgggccct gtaccacaca gagaacgcct atgtgagcgt ggtgtccagc 660

cactacagca gaagattcac ccccgagatc gccaagagac ccaaagtgag agaccaggag 720

ggccggatca attactactg gaccctgctg gagcctggcg ataccatcat cttcgaggcc 780

aacggcaatc tgatcgcccc ttggtatgcc tttgccctga gcagaggctt tggcagcggc 840

atcatcacaa gcaacgcccc catggatgag tgtgatgcca agtgccagac acctcagggc 900

gccatcaata gcagcctgcc cttccagaat gtgcaccctg tgaccatcgg cgagtgcccc 960

aagtatgtga gaagcgccaa gctgagaatg gtgaccggcc tgagaaacat ccctagcatc 1020

cagagcagag gactgtttgg agccatcgcc ggattcatcg agggaggatg gacaggcatg 1080

gtggatggct ggtacggcta ccaccaccag aatgagcagg gctctggata tgccgccgat 1140

cagaagtcta cccagaacgc catcaacggc atcaccaaca aggtgaacag cgtgatcgag 1200

aagatgaaca cccagtttac cgctgtgggc aaggagttca acaagctgga gcggaggatg 1260

gagaacctga acaagaaggt ggacgacggc tttctggaca tctggaccta caatgccgaa 1320

ctcctggtcc tcctcgagaa tgagaggacc ctggacttcc acgacagcaa cgtgaagaac 1380

ctgtatgaga aggtgaagag ccagctgaag aacaacgcca aggagatcgg caacggctgc 1440

ttcgagttct accacaagtg taacaacgag tgtatggaga gcgtgaagaa cggcacctac 1500

gactacccta agtacagcga ggagagcaag ctgaaccggg agaagatcga tggcgtgaag 1560

ctggagagca tgggcgtgta tcagatcctg gccatctaca gcacagtggc ctcttctctg 1620

gtgctgctgg tgtctctggg cgccatctcc ttttggatgt gctccaacgg cagcctgcag 1680

tgcaggatct gtatc 1695

8

565

PRT

A流感病毒

8

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Leu Glu Asp Ser His Asn Gly Lys Leu Cys Leu Leu Lys Gly Ile

50 55 60

Ala Pro Leu Gln Leu Gly Asn Cys Ser Val Ala Gly Trp Ile Leu Gly

65 70 75 80

Asn Pro Glu Cys Glu Leu Leu Ile Ser Lys Glu Ser Trp Ser Tyr Ile

85 90 95

Val Glu Thr Pro Asn Pro Glu Asn Gly Thr Cys Tyr Pro Gly Tyr Phe

100 105 110

Ala Asp Tyr Glu Glu Leu Arg Glu Gln Leu Ser Ser Val Ser Ser Phe

115 120 125

Glu Arg Phe Glu Ile Phe Pro Lys Glu Ser Ser Trp Pro Asn His Thr

130 135 140

Val Thr Gly Val Ser Ala Ser Cys Ser His Asn Gly Lys Ser Ser Phe

145 150 155 160

Tyr Arg Asn Leu Leu Trp Leu Thr Gly Lys Asn Gly Leu Tyr Pro Asn

165 170 175

Leu Ser Lys Ser Tyr Val Asn Asn Lys Glu Lys Glu Val Leu Val Leu

180 185 190

Trp Gly Val His His Pro Pro Asn Ile Gly Asn Gln Arg Ala Leu Tyr

195 200 205

His Thr Glu Asn Ala Tyr Val Ser Val Val Ser Ser His Tyr Ser Arg

210 215 220

Arg Phe Thr Pro Glu Ile Ala Lys Arg Pro Lys Val Arg Asp Gln Glu

225 230 235 240

Gly Arg Ile Asn Tyr Tyr Trp Thr Leu Leu Glu Pro Gly Asp Thr Ile

245 250 255

Ile Phe Glu Ala Asn Gly Asn Leu Ile Ala Pro Trp Tyr Ala Phe Ala

260 265 270

Leu Ser Arg Gly Phe Gly Ser Gly Ile Ile Thr Ser Asn Ala Pro Met

275 280 285

Asp Glu Cys Asp Ala Lys Cys Gln Thr Pro Gln Gly Ala Ile Asn Ser

290 295 300

Ser Leu Pro Phe Gln Asn Val His Pro Val Thr Ile Gly Glu Cys Pro

305 310 315 320

Lys Tyr Val Arg Ser Ala Lys Leu Arg Met Val Thr Gly Leu Arg Asn

325 330 335

Ile Pro Ser Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe

340 345 350

Ile Glu Gly Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His

355 360 365

His Gln Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr

370 375 380

Gln Asn Ala Ile Asn Gly Ile Thr Asn Lys Val Asn Ser Val Ile Glu

385 390 395 400

Lys Met Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn Lys Leu

405 410 415

Glu Arg Arg Met Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe Leu

420 425 430

Asp Ile Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn Glu

435 440 445

Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu Lys

450 455 460

Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly Cys

465 470 475 480

Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser Val Lys

485 490 495

Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu Asn

500 505 510

Arg Glu Lys Ile Asp Gly Val Lys Leu Glu Ser Met Gly Val Tyr Gln

515 520 525

Ile Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val

530 535 540

Ser Leu Gly Ala Ile Ser Phe Trp Met Cys Ser Asn Gly Ser Leu Gln

545 550 555 560

Cys Arg Ile Cys Ile

565

9

1695

DNA

A流感病毒

9

gatacagatc ctgcactgca ggctgccgtt ggagcacatc caaaaggaga tggcgcccag 60

agacaccagc agcaccagag aagaggccac tgtgctgtag atggccagga tctgatacac 120

gcccatgctc tccagcttca cgccatcgat cttctcccgg ttcagcttgc tctcctcgct 180

gtacttaggg tagtcgtagg tgccgttctt cacgctctcc atacactcgt tgttacactt 240

gtggtagaac tcgaagcagc cgttgccgat ctccttggcg ttgttcttca gctggctctt 300

caccttctca tacaggttct tcacgttgct gtcgtggaag tccagggtcc tctcattctc 360

gaggaggacc aggagttcgg cattgtaggt ccagatgtcc agaaagccgt cgtccacctt 420

cttgttcagg ttctccatcc tccgctccag cttgttgaac tccttgccca cagcggtaaa 480

ctgggtgttc atcttctcga tcacgctgtt caccttgttg gtgatgccgt tgatggcgtt 540

ctgggtagac ttctgatcgg cggcatatcc agagccctgc tcattctggt ggtggtagcc 600

gtaccagcca tccaccatgc ctgtccatcc tccctcgatg aatccggcga tggctccaaa 660

cagtcctctg ctctggatgc tagggatgtt tctcaggccg gtcaccattc tcagcttggc 720

gcttctcaca tacttggggc actcgccgat ggtcacaggg tgcacattct ggaagggcag 780

gctgctattg atggcgccct gaggtgtctg gcacttggca tcacactcat ccatgggggc 840

gttgcttgtg atgatgccgc tgccaaagcc tctgctcagg gcaaaggcat accaaggggc 900

gatcagattg ccgttggcct cgaagatgat ggtatcgcca ggctccagca gggtccagta 960

gtaattgatc cggccctcct ggtctctcac tttgggtctc ttggcgatct cgggggtgaa 1020

tcttctgctg tagtggctgg acaccacgct cacataggcg ttctctgtgt ggtacagggc 1080

ccgctgattt ccgatgttgg gagggtggtg cactccccac agcaccagca cttccttttc 1140

cttgttgttc acgtagctct tgctcaggtt ggggtacagg ccattcttgc ctgtcagcca 1200

cagcaggttc cggtagaagc tgcttttgcc gttgtggcta cagctggcag acacgcctgt 1260

cactgtgtga ttaggccagc tggactcctt ggggaagatc tcgaatctct cgaagctgga 1320

cacgctagac agctgctcgc gcagctcctc gtaatcggcg aagtagccag ggtagcaggt 1380

gccattctca ggattggggg tctccacgat gtagctccag ctctccttag aaatcagcag 1440

ctcacactcg gggttgccca gaatccatcc ggccacagaa caattgccca gctgcagagg 1500

ggcaatgcct ttcagcagac acagcttgcc attgtggctg tcctccagca ggttcacaga 1560

gtgggtcacg gtcacgttct tctccagcac tgtatccacg gtgtcggtgc tattgttggc 1620

gtggtagccg atacagattg tgtcggcgta ggtggcggta aaggtacaca gcagcaccag 1680

cagtttggcc ttcat 1695

10

1698

DNA

A流感病毒

10

atgaaggcta ttttggtcgt gctcctgtac acctttgcca cagccaatgc cgataccctt 60

tgtattggct accatgcaaa caactctacc gatacggtcg acacggtgct cgaaaagaat 120

gttactgtca cccactctgt gaacttgctg gaggataaac acaatggcaa gctctgcaaa 180

ctgcgagggg tggctcccct gcatctggga aaatgtaata ttgccggctg gatactgggt 240

aatccagaat gcgaatcctt gagtacggca tccagttggt cctatatcgt cgagaccccg 300

tcaagtgaca atgggacctg ctacccaggc gacttcattg attatgaaga gctgagggag 360

cagttgtcat ccgtaagcag cttcgaaagg tttgagattt tcccgaaaac tagctcctgg 420

cccaatcatg actctaacaa aggagttact gcagcctgtc ctcatgcggg cgcgaaaagc 480

ttctacaaga acctgatatg gctcgtgaag aaaggcaatt catacccaaa actgtctaag 540

agctacataa acgataaagg gaaagaggtt ctggtgcttt ggggcataca ccacccatct 600

acctcagccg accagcagtc tctgtatcag aacgccgaca catacgtgtt tgtgggcagc 660

tcccgctatt ctaagaagtt caaacccgag atcgccatca gaccaaaggt gagagaccag 720

gaaggaagga tgaattatta ctggaccttg gtcgaacctg gcgataagat aacgtttgag 780

gctacgggca acctggtcgt gccgagatat gcttttgcca tggagaggaa tgcggggagc 840

ggaattatca tcagcgacac tccagttcat gactgtaata ccacatgtca gacaccgaag 900

ggcgccatca acacgagctt gccctttcag aatatacatc caatcacaat cggaaaatgc 960

cccaagtacg tgaaaagcac taaactgaga ctcgccaccg gactcaggaa tatcccaagc 1020

atccagtcac ggggtctgtt cggcgctatc gccggattta ttgaaggcgg ctggacgggg 1080

atggtggacg gttggtacgg ctaccatcat caaaatgagc agggctccgg atacgccgct 1140

gacctgaaat ctacgcagaa tgccatagat gagatcacaa acaaggtcaa tagtgtgata 1200

gaaaaaatga atactcagtt cacagctgtt ggaaaggagt ttaaccacct cgagaagcga 1260

attgagaacc tgaacaagaa ggtggacgat ggctttttgg atatctggac gtataacgct 1320

gagctgcttg ttctgctgga gaacgaaaga acccttgact accacgattc caacgtgaag 1380

aatctgtatg agaaagtgcg aagccagttg aaaaacaacg caaaagaaat aggcaacggc 1440

tgtttcgagt tctaccacaa atgcgataac acctgcatgg agagtgtgaa gaacggaacg 1500

tacgattatc caaaatactc cgaggaggcc aaactcaata gggaggagat agacggtgtt 1560

aagctggagt ccacacgcat ctatcagatt ctggcgatct actctactgt ggcttccagc 1620

ctggtgctgg tcgtttccct tggggcgatc agcttctgga tgtgcagcaa tggctccctg 1680

caatgccgca tctgcatc 1698

11

566

PRT

A流感病毒

11

Met Lys Ala Ile Leu Val Val Leu Leu Tyr Thr Phe Ala Thr Ala Asn

1 5 10 15

Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Leu Glu Asp Lys His Asn Gly Lys Leu Cys Lys Leu Arg Gly Val

50 55 60

Ala Pro Leu His Leu Gly Lys Cys Asn Ile Ala Gly Trp Ile Leu Gly

65 70 75 80

Asn Pro Glu Cys Glu Ser Leu Ser Thr Ala Ser Ser Trp Ser Tyr Ile

85 90 95

Val Glu Thr Pro Ser Ser Asp Asn Gly Thr Cys Tyr Pro Gly Asp Phe

100 105 110

Ile Asp Tyr Glu Glu Leu Arg Glu Gln Leu Ser Ser Val Ser Ser Phe

115 120 125

Glu Arg Phe Glu Ile Phe Pro Lys Thr Ser Ser Trp Pro Asn His Asp

130 135 140

Ser Asn Lys Gly Val Thr Ala Ala Cys Pro His Ala Gly Ala Lys Ser

145 150 155 160

Phe Tyr Lys Asn Leu Ile Trp Leu Val Lys Lys Gly Asn Ser Tyr Pro

165 170 175

Lys Leu Ser Lys Ser Tyr Ile Asn Asp Lys Gly Lys Glu Val Leu Val

180 185 190

Leu Trp Gly Ile His His Pro Ser Thr Ser Ala Asp Gln Gln Ser Leu

195 200 205

Tyr Gln Asn Ala Asp Thr Tyr Val Phe Val Gly Ser Ser Arg Tyr Ser

210 215 220

Lys Lys Phe Lys Pro Glu Ile Ala Ile Arg Pro Lys Val Arg Asp Gln

225 230 235 240

Glu Gly Arg Met Asn Tyr Tyr Trp Thr Leu Val Glu Pro Gly Asp Lys

245 250 255

Ile Thr Phe Glu Ala Thr Gly Asn Leu Val Val Pro Arg Tyr Ala Phe

260 265 270

Ala Met Glu Arg Asn Ala Gly Ser Gly Ile Ile Ile Ser Asp Thr Pro

275 280 285

Val His Asp Cys Asn Thr Thr Cys Gln Thr Pro Lys Gly Ala Ile Asn

290 295 300

Thr Ser Leu Pro Phe Gln Asn Ile His Pro Ile Thr Ile Gly Lys Cys

305 310 315 320

Pro Lys Tyr Val Lys Ser Thr Lys Leu Arg Leu Ala Thr Gly Leu Arg

325 330 335

Asn Ile Pro Ser Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly

340 345 350

Phe Ile Glu Gly Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr

355 360 365

His His Gln Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser

370 375 380

Thr Gln Asn Ala Ile Asp Glu Ile Thr Asn Lys Val Asn Ser Val Ile

385 390 395 400

Glu Lys Met Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn His

405 410 415

Leu Glu Lys Arg Ile Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe

420 425 430

Leu Asp Ile Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn

435 440 445

Glu Arg Thr Leu Asp Tyr His Asp Ser Asn Val Lys Asn Leu Tyr Glu

450 455 460

Lys Val Arg Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly

465 470 475 480

Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Met Glu Ser Val

485 490 495

Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys Leu

500 505 510

Asn Arg Glu Glu Ile Asp Gly Val Lys Leu Glu Ser Thr Arg Ile Tyr

515 520 525

Gln Ile Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Val

530 535 540

Val Ser Leu Gly Ala Ile Ser Phe Trp Met Cys Ser Asn Gly Ser Leu

545 550 555 560

Gln Cys Arg Ile Cys Ile

565

12

1698

DNA

A流感病毒

12

gatgcagatg cggcattgca gggagccatt gctgcacatc cagaagctga tcgccccaag 60

ggaaacgacc agcaccaggc tggaagccac agtagagtag atcgccagaa tctgatagat 120

gcgtgtggac tccagcttaa caccgtctat ctcctcccta ttgagtttgg cctcctcgga 180

gtattttgga taatcgtacg ttccgttctt cacactctcc atgcaggtgt tatcgcattt 240

gtggtagaac tcgaaacagc cgttgcctat ttcttttgcg ttgtttttca actggcttcg 300

cactttctca tacagattct tcacgttgga atcgtggtag tcaagggttc tttcgttctc 360

cagcagaaca agcagctcag cgttatacgt ccagatatcc aaaaagccat cgtccacctt 420

cttgttcagg ttctcaattc gcttctcgag gtggttaaac tcctttccaa cagctgtgaa 480

ctgagtattc attttttcta tcacactatt gaccttgttt gtgatctcat ctatggcatt 540

ctgcgtagat ttcaggtcag cggcgtatcc ggagccctgc tcattttgat gatggtagcc 600

gtaccaaccg tccaccatcc ccgtccagcc gccttcaata aatccggcga tagcgccgaa 660

cagaccccgt gactggatgc ttgggatatt cctgagtccg gtggcgagtc tcagtttagt 720

gcttttcacg tacttggggc attttccgat tgtgattgga tgtatattct gaaagggcaa 780

gctcgtgttg atggcgccct tcggtgtctg acatgtggta ttacagtcat gaactggagt 840

gtcgctgatg ataattccgc tccccgcatt cctctccatg gcaaaagcat atctcggcac 900

gaccaggttg cccgtagcct caaacgttat cttatcgcca ggttcgacca aggtccagta 960

ataattcatc cttccttcct ggtctctcac ctttggtctg atggcgatct cgggtttgaa 1020

cttcttagaa tagcgggagc tgcccacaaa cacgtatgtg tcggcgttct gatacagaga 1080

ctgctggtcg gctgaggtag atgggtggtg tatgccccaa agcaccagaa cctctttccc 1140

tttatcgttt atgtagctct tagacagttt tgggtatgaa ttgcctttct tcacgagcca 1200

tatcaggttc ttgtagaagc ttttcgcgcc cgcatgagga caggctgcag taactccttt 1260

gttagagtca tgattgggcc aggagctagt tttcgggaaa atctcaaacc tttcgaagct 1320

gcttacggat gacaactgct ccctcagctc ttcataatca atgaagtcgc ctgggtagca 1380

ggtcccattg tcacttgacg gggtctcgac gatataggac caactggatg ccgtactcaa 1440

ggattcgcat tctggattac ccagtatcca gccggcaata ttacattttc ccagatgcag 1500

gggagccacc cctcgcagtt tgcagagctt gccattgtgt ttatcctcca gcaagttcac 1560

agagtgggtg acagtaacat tcttttcgag caccgtgtcg accgtatcgg tagagttgtt 1620

tgcatggtag ccaatacaaa gggtatcggc attggctgtg gcaaaggtgt acaggagcac 1680

gaccaaaata gccttcat 1698

13

1683

DNA

A流感病毒

13

atggccatca tctacctgat cctgctgttt acagctgtga gaggcgacca gatctgtatc 60

ggctaccacg ccaacaatag caccgagaag gtggacacca tcctggagag aaacgtgaca 120

gtgacccacg ccaaggacat cctggaaaag acccacaacg gcaagctgtg taagctgaac 180

ggcatccctc ctctggaact gggcgattgt tctatcgccg gatggctgct gggaaacccc 240

gagtgtgata ggctgctgtc tgtgcctgag tggagctaca tcatggagaa ggagaaccct 300

agggacggcc tgtgttaccc tggcagcttc aacgattacg aggagctgaa gcacctgctg 360

tctagcgtga agcacttcga gaaggtgaag atcctgccca aggacagatg gacccagcac 420

acaacaacag gaggaagcag agcctgcgcc gtgtctggca accccagctt cttccggaat 480

atggtgtggc tgaccaagaa gggcagcaat taccctgtgg cccagggcag ctacaataat 540

accagcggcg agcagatgct gatcatctgg ggagtgcacc accctaatga cgagaccgag 600

cagagaaccc tgtaccagaa tgtgggcacc tacgtgtctg tgggcaccag caccctgaat 660

aagagaagca cccccgagat tgccacaaga cccaaggtga acggccaggg aggaagaatg 720

gagttcagct ggaccctgct ggatatgtgg gacaccatca actttgagag caccggcaat 780

ctgatcgccc ctgagtacgg cttcaagatc agcaagagag gcagcagcgg catcatgaaa 840

accgagggca ccctggagaa ttgtgagacc aagtgccaga cacctctggg cgccatcaat 900

accaccctgc ccttccacaa tgtgcaccct ctgaccatcg gcgagtgccc taagtatgtg 960

aagagcgaga agctggtgct ggccacagga ctgagaaacg tgccccagat cgagagcaga 1020

ggcctgtttg gagccatcgc cggattcatc gagggaggat ggcagggaat ggtcgatggc 1080

tggtacggct accaccacag caatgatcag ggctctggct atgccgccga taaggagtct 1140

acccagaagg cctttgacgg catcaccaac aaggtgaaca gcgtgatcga gaagatgaac 1200

acccagtttg aggctgtggg caaggagttt agcaacctgg agcggagact ggagaacctg 1260

aacaagaaga tggaggacgg cttcctggat gtgtggacct acaatgccga actgctggtg 1320

ctgatggaga atgagcggac cctggacttc cacgacagca acgtgaagaa cctgtacgac 1380

aaagtgagga tgcagctgag ggacaacgtg aaggaactgg gcaatggctg cttcgagttc 1440

taccacaagt gtgacgacga gtgtatgaac tccgtgaaga acggcaccta cgactaccct 1500

aagtacgagg aggagagcaa gctgaaccgg aacgagatca agggcgtgaa gctgtctagc 1560

atgggcgtgt atcagatcct ggccatctat gccacagtgg ccggatctct gagcctggca 1620

attatgatgg ctggaatcag cttctggatg tgctccaatg gcagcctgca gtgccggatc 1680

tgt 1683

14

564

PRT

A流感病毒

14

Met Ala Ile Ile Tyr Leu Ile Leu Leu Phe Thr Ala Val Arg Gly Asp

1 5 10 15

Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Lys Val Asp

20 25 30

Thr Ile Leu Glu Arg Asn Val Thr Val Thr His Ala Lys Asp Ile Leu

35 40 45

Glu Lys Thr His Asn Gly Lys Leu Cys Lys Leu Asn Gly Ile Pro Pro

50 55 60

Leu Glu Leu Gly Asp Cys Ser Ile Ala Gly Trp Leu Leu Gly Asn Pro

65 70 75 80

Glu Cys Asp Arg Leu Leu Ser Val Pro Glu Trp Ser Tyr Ile Met Glu

85 90 95

Lys Glu Asn Pro Arg Asp Gly Leu Cys Tyr Pro Gly Ser Phe Asn Asp

100 105 110

Tyr Glu Glu Leu Lys His Leu Leu Ser Ser Val Lys His Phe Glu Lys

115 120 125

Val Lys Ile Leu Pro Lys Asp Arg Trp Thr Gln His Thr Thr Thr Gly

130 135 140

Gly Ser Arg Ala Cys Ala Val Ser Gly Asn Pro Ser Phe Phe Arg Asn

145 150 155 160

Met Val Trp Leu Thr Lys Lys Gly Ser Asn Tyr Pro Val Ala Lys Gly

165 170 175

Ser Tyr Asn Asn Thr Ser Gly Glu Gln Met Leu Ile Ile Trp Gly Val

180 185 190

His His Pro Asn Asp Glu Thr Glu Gln Arg Thr Leu Tyr Gln Asn Val

195 200 205

Gly Thr Tyr Val Ser Val Gly Thr Ser Thr Leu Asn Lys Arg Ser Thr

210 215 220

Pro Asp Tyr His Ile Ala Thr Arg Pro Lys Val Asn Gly Gln Gly Gly

225 230 235 240

Arg Met Glu Phe Ser Trp Thr Leu Leu Asp Met Trp Asp Thr Ile Asn

245 250 255

Phe Glu Ser Thr Gly Asn Leu Ile Ala Pro Glu Tyr Gly Phe Lys Ile

260 265 270

Ser Lys Arg Gly Ser Ser Gly Ile Met Lys Thr Glu Gly Thr Leu Glu

275 280 285

Asn Cys Glu Thr Lys Cys Gln Thr Pro Leu Gly Ala Ile Asn Thr Thr

290 295 300

Leu Pro Phe His Asn Val His Pro Leu Thr Ile Gly Glu Cys Pro Lys

305 310 315 320

Tyr Val Lys Ser Glu Lys Leu Val Leu Ala Thr Gly Leu Arg Asn Val

325 330 335

Pro Gln Ile Glu Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile

340 345 350

Glu Gly Gly Trp Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His His

355 360 365

Ser Asn Asp Gln Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln

370 375 380

Lys Ala Phe Asp Gly Ile Thr Asn Lys Val Asn Ser Val Ile Glu Lys

385 390 395 400

Met Asn Thr Gln Phe Glu Ala Val Gly Lys Glu Phe Ser Asn Leu Glu

405 410 415

Arg Arg Leu Glu Asn Leu Asn Lys Lys Met Glu Asp Gly Phe Leu Asp

420 425 430

Val Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Met Glu Asn Glu Arg

435 440 445

Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys Val

450 455 460

Arg Met Gln Leu Arg Asp Asn Val Lys Glu Leu Gly Asn Gly Cys Phe

465 470 475 480

Glu Phe Tyr His Lys Cys Asp Asp Glu Cys Met Asn Ser Val Lys Asn

485 490 495

Gly Thr Tyr Asp Tyr Pro Lys Tyr Glu Glu Glu Ser Lys Leu Asn Arg

500 505 510

Asn Glu Ile Lys Gly Val Lys Leu Ser Ser Met Gly Val Tyr Gln Ile

515 520 525

Leu Ala Ile Tyr Ala Thr Val Ala Gly Ser Leu Ser Leu Ala Ile Met

530 535 540

Met Ala Gly Ile Ser Phe Trp Met Cys Ser Asn Gly Ser Leu Gln Cys

545 550 555 560

Arg Ile Cys Ile

15

1683

DNA

A流感病毒

15

acagatccgg cactgcaggc tgccattgga gcacatccag aagctgattc cagccatcat 60

aattgccagg ctcagagatc cggccactgt ggcatagatg gccaggatct gatacacgcc 120

catgctagac agcttcacgc ccttgatctc gttccggttc agcttgctct cctcctcgta 180

cttagggtag tcgtaggtgc cgttcttcac ggagttcata cactcgtcgt cacacttgtg 240

gtagaactcg aagcagccat tgcccagttc cttcacgttg tccctcagct gcatcctcac 300

tttgtcgtac aggttcttca cgttgctgtc gtggaagtcc agggtccgct cattctccat 360

cagcaccagc agttcggcat tgtaggtcca cacatccagg aagccgtcct ccatcttctt 420

gttcaggttc tccagtctcc gctccaggtt gctaaactcc ttgcccacag cctcaaactg 480

ggtgttcatc ttctcgatca cgctgttcac cttgttggtg atgccgtcaa aggccttctg 540

ggtagactcc ttatcggcgg catagccaga gccctgatca ttgctgtggt ggtagccgta 600

ccagccatcg accattccct gccatcctcc ctcgatgaat ccggcgatgg ctccaaacag 660

gcctctgctc tcgatctggg gcacgtttct cagtcctgtg gccagcacca gcttctcgct 720

cttcacatac ttagggcact cgccgatggt cagagggtgc acattgtgga agggcagggt 780

ggtattgatg gcgcccagag gtgtctggca cttggtctca caattctcca gggtgccctc 840

ggttttcatg atgccgctgc tgcctctctt gctgatcttg aagccgtact caggggcgat 900

cagattgccg gtgctctcaa agttgatggt gtcccacata tccagcaggg tccagctgaa 960

ctccattctt cctccctggc cgttcacctt gggtcttgtg gcaatctcgg gggtgcttct 1020

cttattcagg gtgctggtgc ccacagacac gtaggtgccc acattctggt acagggttct 1080

ctgctcggtc tcgtcattag ggtggtgcac tccccagatg atcagcatct gctcgccgct 1140

ggtattattg tagctgccct gggccacagg gtaattgctg cccttcttgg tcagccacac 1200

catattccgg aagaagctgg ggttgccaga cacggcgcag gctctgcttc ctcctgttgt 1260

tgtgtgctgg gtccatctgt ccttgggcag gatcttcacc ttctcgaagt gcttcacgct 1320

agacagcagg tgcttcagct cctcgtaatc gttgaagctg ccagggtaac acaggccgtc 1380

cctagggttc tccttctcca tgatgtagct ccactcaggc acagacagca gcctatcaca 1440

ctcggggttt cccagcagcc atccggcgat agaacaatcg cccagttcca gaggagggat 1500

gccgttcagc ttacacagct tgccgttgtg ggtcttttcc aggatgtcct tggcgtgggt 1560

cactgtcacg tttctctcca ggatggtgtc caccttctcg gtgctattgt tggcgtggta 1620

gccgatacag atctggtcgc ctctcacagc tgtaaacagc aggatcaggt agatgatggc 1680

cat 1683

16

1704

DNA

A流感病毒

16

atggaaaaga tcgtgctgct gctggccatt gtgagcctgg tgaagagcga ccagatctgc 60

attggctacc acgccaacaa tagcacagag caggtggaca ccatcatgga aaaaaacgtg 120

accgtgaccc acgctcagga catcctggaa aagacccaca acggcaagct gtgtgatctg 180

gacggcgtga agcctctgat cctgagagat tgtagcgtgg ctggatggct gctgggcaac 240

cctatgtgcg acgagttcat caacgtgccc gagtggagct atatcgtgga gaaggccaac 300

cccaccaacg atctgtgtta ccccggcagc ttcaacgatt acgaggaact gaagcacctg 360

ctgtcccgga tcaaccactt cgagaagatc cagatcatcc ccaagtcctc ttggagcgat 420

cacgaagcct ctagcggagt gtctagcgcc tgtccttacc tgggcagccc cagcttcttc 480

agaaacgtgg tgtggctgat caagaagaac agcacctacc ccaccatcaa gaagagctac 540

aacaacacca accaggaaga tctgctggtc ctgtggggaa tccaccaccc taatgatgcc 600

gccgagcaga ccagactgta ccagaacccc accacctata tcagcatcgg caccagcacc 660

ctgaatcaga gactggtgcc caagatcgcc accagatcca aggtgaacgg ccagagcggc 720

aggatggaat tcttctggac catcctgaag cccaacgacg ccatcaactt cgagagcaac 780

ggcaacttta tcgcccctga gtacgcctac aagatcgtga agaagggcga cagcgccatc 840

atgaagagcg agctggaata cggcaactgc aacaccaagt gccagacacc tatgggcgcc 900

atcaacagca gcatgccctt ccacaacatc caccctctga ccatcggcga gtgccctaag 960

tacgtgaaga gcaacagact ggtgctggcc acaggcctga gaaatagccc ccagcgggag 1020

agcagaagaa agaagagggg cctgtttgga gccatcgccg gctttattga aggcggctgg 1080

cagggaatgg tggatggctg gtacggctac caccacagca atgagcaggg ctctggatat 1140

gccgccgaca aagagtctac ccagaaggcc atcgacggcg tcaccaacaa ggtgaacagc 1200

atcatcgaca agatgaacac ccagttcgag gctgtgggca gagagttcaa caacctggaa 1260

cggcggatcg agaacctgaa caagaaaatg gaagatggct tcctggatgt gtggacctac 1320

aatgccgaac tgctggtgct gatggaaaac gagcggaccc tggacttcca cgacagcaac 1380

gtgaagaacc tgtacgacaa agtgcggctg cagctgagag acaacgccaa agagctgggc 1440

aacggctgct tcgagttcta ccacaagtgc gacaacgagt gcatggaaag catccggaac 1500

ggcacctaca actaccctca gtacagcgag gaagccaggc tgaagaggga agagatcagc 1560

ggcgtgaaac tggaatccat cggcacctac cagatcctga gcatctacag cacagtggcc 1620

tcttctctgg ccctggccat tatgatggcc ggactgagcc tgtggatgtg cagcaatggc 1680

agcctgcagt gcaggatctg catc 1704

17

568

PRT

A流感病毒

17

Met Glu Lys Ile Val Leu Leu Leu Ala Ile Val Ser Leu Val Lys Ser

1 5 10 15

Asp Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Gln Val

20 25 30

Asp Thr Ile Met Glu Lys Asn Val Thr Val Thr His Ala Gln Asp Ile

35 40 45

Leu Glu Lys Thr His Asn Gly Lys Leu Cys Asp Leu Asp Gly Val Lys

50 55 60

Pro Leu Ile Leu Arg Asp Cys Ser Val Ala Gly Trp Leu Leu Gly Asn

65 70 75 80

Pro Met Cys Asp Glu Phe Ile Asn Val Pro Glu Trp Ser Tyr Ile Val

85 90 95

Glu Lys Ala Asn Pro Thr Asn Asp Leu Cys Tyr Pro Gly Ser Phe Asn

100 105 110

Asp Tyr Glu Glu Leu Lys His Leu Leu Ser Arg Ile Asn His Phe Glu

115 120 125

Lys Ile Gln Ile Ile Pro Lys Ser Ser Trp Ser Asp His Glu Ala Ser

130 135 140

Ser Gly Val Ser Ser Ala Cys Pro Tyr Leu Gly Ser Pro Ser Phe Phe

145 150 155 160

Arg Asn Val Val Trp Leu Ile Lys Lys Asn Ser Thr Tyr Pro Thr Ile

165 170 175

Lys Lys Ser Tyr Asn Asn Thr Asn Gln Glu Asp Leu Leu Val Leu Trp

180 185 190

Gly Ile His His Pro Asn Asp Ala Ala Glu Gln Thr Arg Leu Tyr Gln

195 200 205

Asn Pro Thr Thr Tyr Ile Ser Ile Gly Thr Ser Thr Leu Asn Gln Arg

210 215 220

Leu Val Pro Lys Ile Ala Thr Arg Ser Lys Val Asn Gly Gln Ser Gly

225 230 235 240

Arg Met Glu Phe Phe Trp Thr Ile Leu Lys Pro Asn Asp Ala Ile Asn

245 250 255

Phe Glu Ser Asn Gly Asn Phe Ile Ala Pro Glu Tyr Ala Tyr Lys Ile

260 265 270

Val Lys Lys Gly Asp Ser Ala Ile Met Lys Ser Glu Leu Glu Tyr Gly

275 280 285

Asn Cys Asn Thr Lys Cys Gln Thr Pro Met Gly Ala Ile Asn Ser Ser

290 295 300

Met Pro Phe His Asn Ile His Pro Leu Thr Ile Gly Glu Cys Pro Lys

305 310 315 320

Tyr Val Lys Ser Asn Arg Leu Val Leu Ala Thr Gly Leu Arg Asn Ser

325 330 335

Pro Gln Arg Glu Ser Arg Arg Lys Lys Arg Gly Leu Phe Gly Ala Ile

340 345 350

Ala Gly Phe Ile Glu Gly Gly Trp Gln Gly Met Val Asp Gly Trp Tyr

355 360 365

Gly Tyr His His Ser Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Lys

370 375 380

Glu Ser Thr Gln Lys Ala Ile Asp Gly Val Thr Asn Lys Val Asn Ser

385 390 395 400

Ile Ile Asp Lys Met Asn Thr Gln Phe Glu Ala Val Gly Arg Glu Phe

405 410 415

Asn Asn Leu Glu Arg Arg Ile Glu Asn Leu Asn Lys Lys Met Glu Asp

420 425 430

Gly Phe Leu Asp Val Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Met

435 440 445

Glu Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu

450 455 460

Tyr Asp Lys Val Arg Leu Gln Leu Arg Asp Asn Ala Lys Glu Leu Gly

465 470 475 480

Asn Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Glu Cys Met Glu

485 490 495

Ser Ile Arg Asn Gly Thr Tyr Asn Tyr Pro Gln Tyr Ser Glu Glu Ala

500 505 510

Arg Leu Lys Arg Glu Glu Ile Ser Gly Val Lys Leu Glu Ser Ile Gly

515 520 525

Thr Tyr Gln Ile Leu Ser Ile Tyr Ser Thr Val Ala Ser Ser Leu Ala

530 535 540

Leu Ala Ile Met Met Ala Gly Leu Ser Leu Trp Met Cys Ser Asn Gly

545 550 555 560

Ser Leu Gln Cys Arg Ile Cys Ile

565

18

1704

DNA

A流感病毒

18

gatgcagatc ctgcactgca ggctgccatt gctgcacatc cacaggctca gtccggccat 60

cataatggcc agggccagag aagaggccac tgtgctgtag atgctcagga tctggtaggt 120

gccgatggat tccagtttca cgccgctgat ctcttccctc ttcagcctgg cttcctcgct 180

gtactgaggg tagttgtagg tgccgttccg gatgctttcc atgcactcgt tgtcgcactt 240

gtggtagaac tcgaagcagc cgttgcccag ctctttggcg ttgtctctca gctgcagccg 300

cactttgtcg tacaggttct tcacgttgct gtcgtggaag tccagggtcc gctcgttttc 360

catcagcacc agcagttcgg cattgtaggt ccacacatcc aggaagccat cttccatttt 420

cttgttcagg ttctcgatcc gccgttccag gttgttgaac tctctgccca cagcctcgaa 480

ctgggtgttc atcttgtcga tgatgctgtt caccttgttg gtgacgccgt cgatggcctt 540

ctgggtagac tctttgtcgg cggcatatcc agagccctgc tcattgctgt ggtggtagcc 600

gtaccagcca tccaccattc cctgccagcc gccttcaata aagccggcga tggctccaaa 660

caggcccctc ttctttcttc tgctctcccg ctgggggcta tttctcaggc ctgtggccag 720

caccagtctg ttgctcttca cgtacttagg gcactcgccg atggtcagag ggtggatgtt 780

gtggaagggc atgctgctgt tgatggcgcc cataggtgtc tggcacttgg tgttgcagtt 840

gccgtattcc agctcgctct tcatgatggc gctgtcgccc ttcttcacga tcttgtaggc 900

gtactcaggg gcgataaagt tgccgttgct ctcgaagttg atggcgtcgt tgggcttcag 960

gatggtccag aagaattcca tcctgccgct ctggccgttc accttggatc tggtggcgat 1020

cttgggcacc agtctctgat tcagggtgct ggtgccgatg ctgatatagg tggtggggtt 1080

ctggtacagt ctggtctgct cggcggcatc attagggtgg tggattcccc acaggaccag 1140

cagatcttcc tggttggtgt tgttgtagct cttcttgatg gtggggtagg tgctgttctt 1200

cttgatcagc cacaccacgt ttctgaagaa gctggggctg cccaggtaag gacaggcgct 1260

agacactccg ctagaggctt cgtgatcgct ccaagaggac ttggggatga tctggatctt 1320

ctcgaagtgg ttgatccggg acagcaggtg cttcagttcc tcgtaatcgt tgaagctgcc 1380

ggggtaacac agatcgttgg tggggttggc cttctccacg atatagctcc actcgggcac 1440

gttgatgaac tcgtcgcaca tagggttgcc cagcagccat ccagccacgc tacaatctct 1500

caggatcaga ggcttcacgc cgtccagatc acacagcttg ccgttgtggg tcttttccag 1560

gatgtcctga gcgtgggtca cggtcacgtt tttttccatg atggtgtcca cctgctctgt 1620

gctattgttg gcgtggtagc caatgcagat ctggtcgctc ttcaccaggc tcacaatggc 1680

cagcagcagc acgatctttt ccat 1704

19

147

DNA

A流感病毒

19

atgaaggcca aactgctggt gctgctgtgt acctttaccg ccacctacgc cgacacaatc 60

tgtatcggct accacgccaa caatagcacc gacaccgtgg atacagtgct ggagaagaac 120

gtgaccgtga cccactctgt gaacctg 147

20

49

PRT

A流感病毒

20

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu

21

147

DNA

A流感病毒

21

caggttcaca gagtgggtca cggtcacgtt cttctccagc actgtatcca cggtgtcggt 60

gctattgttg gcgtggtagc cgatacagat tgtgtcggcg taggtggcgg taaaggtaca 120

cagcagcacc agcagtttgg ccttcat 147

22

678

DNA

A流感病毒

22

gatgccaagt gccagacacc tcagggcgcc atcaatagca gcctgccctt ccagaatgtg 60

caccctgtga ccatcggcga gtgccccaag tatgtgagaa gcgccaagct gagaatggtg 120

accggcctga gaaacatccc tagcatccag agcagaggac tgtttggagc catcgccgga 180

ttcatcgagg gaggatggac aggcatggtg gatggctggt acggctacca ccaccagaat 240

gagcagggct ctggatatgc cgccgatcag aagtctaccc agaacgccat caacggcatc 300

accaacaagg tgaacagcgt gatcgagaag atgaacaccc agtttaccgc tgtgggcaag 360

gagttcaaca agctggagcg gaggatggag aacctgaaca agaaggtgga cgacggcttt 420

ctggacatct ggacctacaa tgccgaactc ctggtcctcc tcgagaatga gaggaccctg 480

gacttccacg acagcaacgt gaagaacctg tatgagaagg tgaagagcca gctgaagaac 540

aacgccaagg agatcggcaa cggctgcttc gagttctacc acaagtgtaa caacgagtgt 600

atggagagcg tgaagaacgg cacctacgac taccctaagt acagcgagga gagcaagctg 660

aaccgggaga agatcgat 678

23

226

PRT

A流感病毒

23

Asp Ala Lys Cys Gln Thr Pro Gln Gly Ala Ile Asn Ser Ser Leu Pro

1 5 10 15

Phe Gln Asn Val His Pro Val Thr Ile Gly Glu Cys Pro Lys Tyr Val

20 25 30

Arg Ser Ala Lys Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

35 40 45

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

50 55 60

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

65 70 75 80

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

85 90 95

Ile Asn Gly Ile Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Asn

100 105 110

Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn Lys Leu Glu Arg Arg

115 120 125

Met Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp

130 135 140

Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu

145 150 155 160

Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Lys Ser

165 170 175

Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe

180 185 190

Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser Val Lys Asn Gly Thr

195 200 205

Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu Lys

210 215 220

Ile Asp

225

24

678

DNA

A流感病毒

24

atcgatcttc tcccggttca gcttgctctc ctcgctgtac ttagggtagt cgtaggtgcc 60

gttcttcacg ctctccatac actcgttgtt acacttgtgg tagaactcga agcagccgtt 120

gccgatctcc ttggcgttgt tcttcagctg gctcttcacc ttctcataca ggttcttcac 180

gttgctgtcg tggaagtcca gggtcctctc attctcgagg aggaccagga gttcggcatt 240

gtaggtccag atgtccagaa agccgtcgtc caccttcttg ttcaggttct ccatcctccg 300

ctccagcttg ttgaactcct tgcccacagc ggtaaactgg gtgttcatct tctcgatcac 360

gctgttcacc ttgttggtga tgccgttgat ggcgttctgg gtagacttct gatcggcggc 420

atatccagag ccctgctcat tctggtggtg gtagccgtac cagccatcca ccatgcctgt 480

ccatcctccc tcgatgaatc cggcgatggc tccaaacagt cctctgctct ggatgctagg 540

gatgtttctc aggccggtca ccattctcag cttggcgctt ctcacatact tggggcactc 600

gccgatggtc acagggtgca cattctggaa gggcaggctg ctattgatgg cgccctgagg 660

tgtctggcac ttggcatc 678

25

576

DNA

A流感病毒

25

gatgccaagt gccagacacc tcagggcgcc atcaatagca gcctgccctt ccagaatgtg 60

caccctgtga ccatcggcga gtgccccaag tatgtgagaa gcgccaagct gagaatggtg 120

accggcctga gaaacatccc tagcatccag agcagaggac tgtttggagc catcgccgga 180

ttcatcgagg gaggatggac aggcatggtg gatggctggt acggctacca ccaccagaat 240

gagcagggct ctggatatgc cgccgatcag aagtctaccc agaacgccat caacggcatc 300

accaacaagg tgaacagcgt gatcgagaag atgtacaatg ccgaactcct ggtcctcctc 360

gagaatgaga ggaccctgga cttccacgac agcaacgtga agaacctgta tgagaaggtg 420

aagagccagc tgaagaacaa cgccaaggag atcggcaacg gctgcttcga gttctaccac 480

aagtgtaaca acgagtgtat ggagagcgtg aagaacggca cctacgacta ccctaagtac 540

agcgaggaga gcaagctgaa ccgggagaag atcgat 576

26

193

PRT

A流感病毒

26

Asp Ala Lys Cys Gln Thr Pro Gln Gly Ala Ile Asn Ser Ser Leu Pro

1 5 10 15

Phe Gln Asn Val His Pro Val Thr Ile Gly Glu Cys Pro Lys Tyr Val

20 25 30

Arg Ser Ala Lys Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

35 40 45

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

50 55 60

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

65 70 75 80

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

85 90 95

Ile Asn Gly Ile Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Thr

100 105 110

Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu Asp

115 120 125

Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Lys Ser Gln

130 135 140

Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe Tyr

145 150 155 160

His Lys Cys Asn Asn Glu Cys Met Glu Ser Val Lys Asn Gly Thr Tyr

165 170 175

Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu Lys Ile

180 185 190

Asp

27

576

DNA

A流感病毒

27

atcgatcttc tcccggttca gcttgctctc ctcgctgtac ttagggtagt cgtaggtgcc 60

gttcttcacg ctctccatac actcgttgtt acacttgtgg tagaactcga agcagccgtt 120

gccgatctcc ttggcgttgt tcttcagctg gctcttcacc ttctcataca ggttcttcac 180

gttgctgtcg tggaagtcca gggtcctctc attctcgagg aggaccagga gttcggcatt 240

gtacatcttc tcgatcacgc tgttcacctt gttggtgatg ccgttgatgg cgttctgggt 300

agacttctga tcggcggcat atccagagcc ctgctcattc tggtggtggt agccgtacca 360

gccatccacc atgcctgtcc atcctccctc gatgaatccg gcgatggctc caaacagtcc 420

tctgctctgg atgctaggga tgtttctcag gccggtcacc attctcagct tggcgcttct 480

cacatacttg gggcactcgc cgatggtcac agggtgcaca ttctggaagg gcaggctgct 540

attgatggcg ccctgaggtg tctggcactt ggcatc 576

28

570

DNA

A流感病毒

28

ctgagaatgg tgaccggcct gagaaacatc cctagcatcc agagcagagg actgtttgga 60

gccatcgccg gattcatcga gggaggatgg acaggcatgg tggatggctg gtacggctac 120

caccaccaga atgagcaggg ctctggatat gccgccgatc agaagtctac ccagaacgcc 180

atcaacggca tcaccaacaa ggtgaacagc gtgatcgaga agatgaacac ccagtttacc 240

gctgtgggca aggagttcaa caagctggag cggaggatgg agaacctgaa caagaaggtg 300

gacgacggct ttctggacat ctggacctac aatgccgaac tcctggtcct cctcgagaat 360

gagaggaccc tggacttcca cgacagcaac gtgaagaacc tgtatgagaa ggtgaagagc 420

cagctgaaga acaacgccaa ggagatcggc aacggctgct tcgagttcta ccacaagtgt 480

aacaacgagt gtatggagag cgtgaagaac ggcacctacg actaccctaa gtacagcgag 540

gagagcaagc tgaaccggga gaagatcgat 570

29

194

PRT

A流感病毒

29

Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln Arg Glu Thr Ser

1 5 10 15

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

20 25 30

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

35 40 45

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

50 55 60

Ile Asn Gly Ile Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Asn

65 70 75 80

Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn Lys Leu Glu Arg Arg

85 90 95

Met Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp

100 105 110

Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu

115 120 125

Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Lys Ser

130 135 140

Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe

145 150 155 160

Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser Val Lys Asn Gly Thr

165 170 175

Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu Lys

180 185 190

Ile Asp

30

570

DNA

A流感病毒

30

atcgatcttc tcccggttca gcttgctctc ctcgctgtac ttagggtagt cgtaggtgcc 60

gttcttcacg ctctccatac actcgttgtt acacttgtgg tagaactcga agcagccgtt 120

gccgatctcc ttggcgttgt tcttcagctg gctcttcacc ttctcataca ggttcttcac 180

gttgctgtcg tggaagtcca gggtcctctc attctcgagg aggaccagga gttcggcatt 240

gtaggtccag atgtccagaa agccgtcgtc caccttcttg ttcaggttct ccatcctccg 300

ctccagcttg ttgaactcct tgcccacagc ggtaaactgg gtgttcatct tctcgatcac 360

gctgttcacc ttgttggtga tgccgttgat ggcgttctgg gtagacttct gatcggcggc 420

atatccagag ccctgctcat tctggtggtg gtagccgtac cagccatcca ccatgcctgt 480

ccatcctccc tcgatgaatc cggcgatggc tccaaacagt cctctgctct ggatgctagg 540

gatgtttctc aggccggtca ccattctcag 570

31

468

DNA

A流感病毒

31

ctgagaatgg tgaccggcct gagaaacatc cctagcatcc agagcagagg actgtttgga 60

gccatcgccg gattcatcga gggaggatgg acaggcatgg tggatggctg gtacggctac 120

caccaccaga atgagcaggg ctctggatat gccgccgatc agaagtctac ccagaacgcc 180

atcaacggca tcaccaacaa ggtgaacagc gtgatcgaga agatgtacaa tgccgaactc 240

ctggtcctcc tcgagaatga gaggaccctg gacttccacg acagcaacgt gaagaacctg 300

tatgagaagg tgaagagcca gctgaagaac aacgccaagg agatcggcaa cggctgcttc 360

gagttctacc acaagtgtaa caacgagtgt atggagagcg tgaagaacgg cacctacgac 420

taccctaagt acagcgagga gagcaagctg aaccgggaga agatcgat 468

32

157

PRT

A流感病毒

32

Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln Arg Glu Thr Arg

1 5 10 15

Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp Thr Gly

20 25 30

Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn Glu Gln Gly Ser

35 40 45

Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala Ile Asn Gly Ile

50 55 60

Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Thr Tyr Asn Ala Glu

65 70 75 80

Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu Asp Phe His Asp Ser

85 90 95

Asn Val Lys Asn Leu Tyr Glu Lys Val Lys Ser Gln Leu Lys Asn Asn

100 105 110

Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys Asn

115 120 125

Asn Glu Cys Met Glu Ser Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys

130 135 140

Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu Lys Ile Asp

145 150 155

33

468

DNA

A流感病毒

33

atcgatcttc tcccggttca gcttgctctc ctcgctgtac ttagggtagt cgtaggtgcc 60

gttcttcacg ctctccatac actcgttgtt acacttgtgg tagaactcga agcagccgtt 120

gccgatctcc ttggcgttgt tcttcagctg gctcttcacc ttctcataca ggttcttcac 180

gttgctgtcg tggaagtcca gggtcctctc attctcgagg aggaccagga gttcggcatt 240

gtacatcttc tcgatcacgc tgttcacctt gttggtgatg ccgttgatgg cgttctgggt 300

agacttctga tcggcggcat atccagagcc ctgctcattc tggtggtggt agccgtacca 360

gccatccacc atgcctgtcc atcctccctc gatgaatccg gcgatggctc caaacagtcc 420

tctgctctgg atgctaggga tgtttctcag gccggtcacc attctcag 468

34

147

DNA

A流感病毒

34

atgaaggcta ttttggtcgt gctcctgtac acctttgcca cagccaatgc cgataccctt 60

tgtattggct accatgcaaa caactctacc gatacggtcg acacggtgct cgaaaagaat 120

gttactgtca cccactctgt gaacttg 147

35

49

PRT

A流感病毒

35

Met Lys Ala Ile Leu Val Val Leu Leu Tyr Thr Phe Ala Thr Ala Asn

1 5 10 15

Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu

36

147

DNA

A流感病毒

36

caagttcaca gagtgggtga cagtaacatt cttttcgagc accgtgtcga ccgtatcggt 60

agagttgttt gcatggtagc caatacaaag ggtatcggca ttggctgtgg caaaggtgta 120

caggagcacg accaaaatag ccttcat 147

37

672

DNA

A流感病毒

37

acatgtcaga caccgaaggg cgccatcaac acgagcttgc cctttcagaa tatacatcca 60

atcacaatcg gaaaatgccc caagtacgtg aaaagcacta aactgagact cgccaccgga 120

ctcaggaata tcccaagcat ccagtcacgg ggtctgttcg gcgctatcgc cggatttatt 180

gaaggcggct ggacggggat ggtggacggt tggtacggct accatcatca aaatgagcag 240

ggctccggat acgccgctga cctgaaatct acgcagaatg ccatagatga gatcacaaac 300

aaggtcaata gtgtgataga aaaaatgaat actcagttca cagctgttgg aaaggagttt 360

aaccacctcg agaagcgaat tgagaacctg aacaagaagg tggacgatgg ctttttggat 420

atctggacgt ataacgctga gctgcttgtt ctgctggaga acgaaagaac ccttgactac 480

cacgattcca acgtgaagaa tctgtatgag aaagtgcgaa gccagttgaa aaacaacgca 540

aaagaaatag gcaacggctg tttcgagttc taccacaaat gcgataacac ctgcatggag 600

agtgtgaaga acggaacgta cgattatcca aaatactccg aggaggccaa actcaatagg 660

gaggagatag ac 672

38

224

PRT

A流感病毒

38

Thr Cys Gln Thr Pro Lys Gly Ala Ile Asn Thr Ser Leu Pro Phe Gln

1 5 10 15

Asn Ile His Pro Ile Thr Ile Gly Lys Cys Pro Lys Tyr Val Lys Ser

20 25 30

Thr Lys Leu Arg Leu Ala Thr Gly Leu Arg Asn Ile Pro Ser Ile Gln

35 40 45

Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp

50 55 60

Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn Glu Gln

65 70 75 80

Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser Thr Gln Asn Ala Ile Asp

85 90 95

Glu Ile Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Asn Thr Gln

100 105 110

Phe Thr Ala Val Gly Lys Glu Phe Asn His Leu Glu Lys Arg Ile Glu

115 120 125

Asn Leu Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp Thr Tyr

130 135 140

Asn Ala Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu Asp Tyr

145 150 155 160

His Asp Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Arg Ser Gln Leu

165 170 175

Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe Tyr His

180 185 190

Lys Cys Asp Asn Thr Cys Met Glu Ser Val Lys Asn Gly Thr Tyr Asp

195 200 205

Tyr Pro Lys Tyr Ser Glu Glu Ala Lys Leu Asn Arg Glu Glu Ile Asp

210 215 220

39

672

DNA

A流感病毒

39

gtctatctcc tccctattga gtttggcctc ctcggagtat tttggataat cgtacgttcc 60

gttcttcaca ctctccatgc aggtgttatc gcatttgtgg tagaactcga aacagccgtt 120

gcctatttct tttgcgttgt ttttcaactg gcttcgcact ttctcataca gattcttcac 180

gttggaatcg tggtagtcaa gggttctttc gttctccagc agaacaagca gctcagcgtt 240

atacgtccag atatccaaaa agccatcgtc caccttcttg ttcaggttct caattcgctt 300

ctcgaggtgg ttaaactcct ttccaacagc tgtgaactga gtattcattt tttctatcac 360

actattgacc ttgtttgtga tctcatctat ggcattctgc gtagatttca ggtcagcggc 420

gtatccggag ccctgctcat tttgatgatg gtagccgtac caaccgtcca ccatccccgt 480

ccagccgcct tcaataaatc cggcgatagc gccgaacaga ccccgtgact ggatgcttgg 540

gatattcctg agtccggtgg cgagtctcag tttagtgctt ttcacgtact tggggcattt 600

tccgattgtg attggatgta tattctgaaa gggcaagctc gtgttgatgg cgcccttcgg 660

tgtctgacat gt 672

40

573

DNA

A流感病毒

40

acatgtcaga caccgaaggg cgccatcaac acgagcttgc cctttcagaa tatacatcca 60

atcacaatcg gaaaatgccc caagtacgtg aaaagcacta aactgagact cgccaccgga 120

ctcaggaata tcccaagcat ccagtcacgg ggtctgttcg gcgctatcgc cggatttatt 180

gaaggcggct ggacggggat ggtggacggt tggtacggct accatcatca aaatgagcag 240

ggctccggat acgccgctga cctgaaatct acgcagaatg ccatagatga gatcacaaac 300

aaggtcaata gtgtgataga aaaaatgacg tataacgctg agctgcttgt tctgctggag 360

aacgaaagaa cccttgacta ccacgattcc aacgtgaaga atctgtatga gaaagtgcga 420

agccagttga aaaacaacgc aaaagaaata ggcaacggct gtttcgagtt ctaccacaaa 480

tgcgataaca cctgcatgga gagtgtgaag aacggaacgt acgattatcc aaaatactcc 540

gaggaggcca aactcaatag ggaggagata gac 573

41

191

PRT

A流感病毒

41

Thr Cys Gln Thr Pro Lys Gly Ala Ile Asn Thr Ser Leu Pro Phe Gln

1 5 10 15

Asn Ile His Pro Ile Thr Ile Gly Lys Cys Pro Lys Tyr Val Lys Ser

20 25 30

Thr Lys Leu Arg Leu Ala Thr Gly Leu Arg Asn Ile Pro Ser Ile Gln

35 40 45

Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp

50 55 60

Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn Glu Gln

65 70 75 80

Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser Thr Gln Asn Ala Ile Asp

85 90 95

Glu Ile Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Thr Tyr Asn

100 105 110

Ala Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu Asp Tyr His

115 120 125

Asp Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Arg Ser Gln Leu Lys

130 135 140

Asn Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe Tyr His Lys

145 150 155 160

Cys Asp Asn Thr Cys Met Glu Ser Val Lys Asn Gly Thr Tyr Asp Tyr

165 170 175

Pro Lys Tyr Ser Glu Glu Ala Lys Leu Asn Arg Glu Glu Ile Asp

180 185 190

42

573

DNA

A流感病毒

42

gtctatctcc tccctattga gtttggcctc ctcggagtat tttggataat cgtacgttcc 60

gttcttcaca ctctccatgc aggtgttatc gcatttgtgg tagaactcga aacagccgtt 120

gcctatttct tttgcgttgt ttttcaactg gcttcgcact ttctcataca gattcttcac 180

gttggaatcg tggtagtcaa gggttctttc gttctccagc agaacaagca gctcagcgtt 240

atacgtcatt ttttctatca cactattgac cttgtttgtg atctcatcta tggcattctg 300

cgtagatttc aggtcagcgg cgtatccgga gccctgctca ttttgatgat ggtagccgta 360

ccaaccgtcc accatccccg tccagccgcc ttcaataaat ccggcgatag cgccgaacag 420

accccgtgac tggatgcttg ggatattcct gagtccggtg gcgagtctca gtttagtgct 480

tttcacgtac ttggggcatt ttccgattgt gattggatgt atattctgaa agggcaagct 540

cgtgttgatg gcgcccttcg gtgtctgaca tgt 573

43

507

DNA

A流感病毒

43

ctgagactcg ccaccggact caggaatatc ccaagcatcc agtcacgggg tctgttcggc 60

gctatcgccg gatttattga aggcggctgg acggggatgg tggacggttg gtacggctac 120

catcatcaaa atgagcaggg ctccggatac gccgctgacc tgaaatctac gcagaatgcc 180

atagatgaga tcacaaacaa ggtcaatagt gtgatagaaa aaatgaatac tcagttcaca 240

gctgttggaa aggagtttaa ccacctcgag aagcgaattg agaacctgaa caagaaggtg 300

gacgatggct ttttggatat ctggacgtat aacgctgagc tgcttgttct gctggagaac 360

gaaagaaccc ttgactacca cgattccaac gtgaagaatc tgtatgagaa agtgcgaagc 420

cagttgaaaa acaacgcaaa agaaataggc aacggctgtt tcgagttcta ccacaaatgc 480

gataacacct gcatggagag tgtgaag 507

44

190

PRT

A流感病毒

44

Leu Arg Leu Ala Thr Gly Leu Arg Asn Ile Pro Ser Ile Gln Ser Arg

1 5 10 15

Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp Thr Gly

20 25 30

Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn Glu Gln Gly Ser

35 40 45

Gly Tyr Ala Ala Asp Leu Lys Ser Thr Gln Asn Ala Ile Asp Glu Ile

50 55 60

Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Asn Thr Gln Phe Thr

65 70 75 80

Ala Val Gly Lys Glu Phe Asn His Leu Glu Lys Arg Ile Glu Asn Leu

85 90 95

Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp Thr Tyr Asn Ala

100 105 110

Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu Asp Tyr His Asp

115 120 125

Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Arg Ser Gln Leu Lys Asn

130 135 140

Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys

145 150 155 160

Asp Asn Thr Cys Met Glu Ser Val Lys Asn Gly Thr Tyr Asp Tyr Pro

165 170 175

Lys Tyr Ser Glu Glu Ala Lys Leu Asn Arg Glu Glu Ile Asp

180 185 190

45

507

DNA

A流感病毒

45

cttcacactc tccatgcagg tgttatcgca tttgtggtag aactcgaaac agccgttgcc 60

tatttctttt gcgttgtttt tcaactggct tcgcactttc tcatacagat tcttcacgtt 120

ggaatcgtgg tagtcaaggg ttctttcgtt ctccagcaga acaagcagct cagcgttata 180

cgtccagata tccaaaaagc catcgtccac cttcttgttc aggttctcaa ttcgcttctc 240

gaggtggtta aactcctttc caacagctgt gaactgagta ttcatttttt ctatcacact 300

attgaccttg tttgtgatct catctatggc attctgcgta gatttcaggt cagcggcgta 360

tccggagccc tgctcatttt gatgatggta gccgtaccaa ccgtccacca tccccgtcca 420

gccgccttca ataaatccgg cgatagcgcc gaacagaccc cgtgactgga tgcttgggat 480

attcctgagt ccggtggcga gtctcag 507

46

471

DNA

A流感病毒

46

ctgagactcg ccaccggact caggaatatc ccaagcatcc agtcacgggg tctgttcggc 60

gctatcgccg gatttattga aggcggctgg acggggatgg tggacggttg gtacggctac 120

catcatcaaa atgagcaggg ctccggatac gccgctgacc tgaaatctac gcagaatgcc 180

atagatgaga tcacaaacaa ggtcaatagt gtgatagaaa aaatgacgta taacgctgag 240

ctgcttgttc tgctggagaa cgaaagaacc cttgactacc acgattccaa cgtgaagaat 300

ctgtatgaga aagtgcgaag ccagttgaaa aacaacgcaa aagaaatagg caacggctgt 360

ttcgagttct accacaaatg cgataacacc tgcatggaga gtgtgaagaa cggaacgtac 420

gattatccaa aatactccga ggaggccaaa ctcaataggg aggagataga c 471

47

157

PRT

A流感病毒

47

Leu Arg Leu Ala Thr Gly Leu Arg Asn Ile Pro Ser Ile Gln Ser Arg

1 5 10 15

Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp Thr Gly

20 25 30

Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn Glu Gln Gly Ser

35 40 45

Gly Tyr Ala Ala Asp Leu Lys Ser Thr Gln Asn Ala Ile Asp Glu Ile

50 55 60

Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Thr Tyr Asn Ala Glu

65 70 75 80

Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu Asp Tyr His Asp Ser

85 90 95

Asn Val Lys Asn Leu Tyr Glu Lys Val Arg Ser Gln Leu Lys Asn Asn

100 105 110

Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys Asp

115 120 125

Asn Thr Cys Met Glu Ser Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys

130 135 140

Tyr Ser Glu Glu Ala Lys Leu Asn Arg Glu Glu Ile Asp

145 150 155

48

471

DNA

A流感病毒

48

gtctatctcc tccctattga gtttggcctc ctcggagtat tttggataat cgtacgttcc 60

gttcttcaca ctctccatgc aggtgttatc gcatttgtgg tagaactcga aacagccgtt 120

gcctatttct tttgcgttgt ttttcaactg gcttcgcact ttctcataca gattcttcac 180

gttggaatcg tggtagtcaa gggttctttc gttctccagc agaacaagca gctcagcgtt 240

atacgtcatt ttttctatca cactattgac cttgtttgtg atctcatcta tggcattctg 300

cgtagatttc aggtcagcgg cgtatccgga gccctgctca ttttgatgat ggtagccgta 360

ccaaccgtcc accatccccg tccagccgcc ttcaataaat ccggcgatag cgccgaacag 420

accccgtgac tggatgcttg ggatattcct gagtccggtg gcgagtctca g 471

49

141

DNA

A流感病毒

49

atggccatca tctacctgat cctgctgttt acagctgtga gaggcgacca gatctgtatc 60

ggctaccacg ccaacaatag caccgagaag gtggacacca tcctggagag aaacgtgaca 120

gtgacccacg ccaaggacat c 141

50

47

PRT

A流感病毒

50

Met Ala Ile Ile Tyr Leu Ile Leu Leu Phe Thr Ala Val Arg Gly Asp

1 5 10 15

Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Lys Val Asp

20 25 30

Thr Ile Leu Glu Arg Asn Val Thr Val Thr His Ala Lys Asp Ile

35 40 45

51

141

DNA

A流感病毒

51

gatgtccttg gcgtgggtca ctgtcacgtt tctctccagg atggtgtcca ccttctcggt 60

gctattgttg gcgtggtagc cgatacagat ctggtcgcct ctcacagctg taaacagcag 120

gatcaggtag atgatggcca t 141

52

672

DNA

A流感病毒

52

aagtgccaga cacctctggg cgccatcaat accaccctgc ccttccacaa tgtgcaccct 60

ctgaccatcg gcgagtgccc taagtatgtg aagagcgaga agctggtgct ggccacagga 120

ctgagaaacg tgccccagat cgagagcaga ggcctgtttg gagccatcgc cggattcatc 180

gagggaggat ggcagggaat ggtcgatggc tggtacggct accaccacag caatgatcag 240

ggctctggct atgccgccga taaggagtct acccagaagg cctttgacgg catcaccaac 300

aaggtgaaca gcgtgatcga gaagatgaac acccagtttg aggctgtggg caaggagttt 360

agcaacctgg agcggagact ggagaacctg aacaagaaga tggaggacgg cttcctggat 420

gtgtggacct acaatgccga actgctggtg ctgatggaga atgagcggac cctggacttc 480

cacgacagca acgtgaagaa cctgtacgac aaagtgagga tgcagctgag ggacaacgtg 540

aaggaactgg gcaatggctg cttcgagttc taccacaagt gtgacgacga gtgtatgaac 600

tccgtgaaga acggcaccta cgactaccct aagtacgagg aggagagcaa gctgaaccgg 660

aacgagatca ag 672

53

224

PRT

A流感病毒

53

Lys Cys Gln Thr Pro Leu Gly Ala Ile Asn Thr Thr Leu Pro Phe His

1 5 10 15

Asn Val His Pro Leu Thr Ile Gly Glu Cys Pro Lys Tyr Val Lys Ser

20 25 30

Glu Lys Leu Val Leu Ala Thr Gly Leu Arg Asn Val Pro Gln Ile Glu

35 40 45

Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp

50 55 60

Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Ser Asn Asp Gln

65 70 75 80

Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln Lys Ala Phe Asp

85 90 95

Gly Ile Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Asn Thr Gln

100 105 110

Phe Glu Ala Val Gly Lys Glu Phe Ser Asn Leu Glu Arg Arg Leu Glu

115 120 125

Asn Leu Asn Lys Lys Met Glu Asp Gly Phe Leu Asp Val Trp Thr Tyr

130 135 140

Asn Ala Glu Leu Leu Val Leu Met Glu Asn Glu Arg Thr Leu Asp Phe

145 150 155 160

His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys Val Arg Met Gln Leu

165 170 175

Arg Asp Asn Val Lys Glu Leu Gly Asn Gly Cys Phe Glu Phe Tyr His

180 185 190

Lys Cys Asp Asp Glu Cys Met Asn Ser Val Lys Asn Gly Thr Tyr Asp

195 200 205

Tyr Pro Lys Tyr Glu Glu Glu Ser Lys Leu Asn Arg Asn Glu Ile Lys

210 215 220

54

672

DNA

A流感病毒

54

cttgatctcg ttccggttca gcttgctctc ctcctcgtac ttagggtagt cgtaggtgcc 60

gttcttcacg gagttcatac actcgtcgtc acacttgtgg tagaactcga agcagccatt 120

gcccagttcc ttcacgttgt ccctcagctg catcctcact ttgtcgtaca ggttcttcac 180

gttgctgtcg tggaagtcca gggtccgctc attctccatc agcaccagca gttcggcatt 240

gtaggtccac acatccagga agccgtcctc catcttcttg ttcaggttct ccagtctccg 300

ctccaggttg ctaaactcct tgcccacagc ctcaaactgg gtgttcatct tctcgatcac 360

gctgttcacc ttgttggtga tgccgtcaaa ggccttctgg gtagactcct tatcggcggc 420

atagccagag ccctgatcat tgctgtggtg gtagccgtac cagccatcga ccattccctg 480

ccatcctccc tcgatgaatc cggcgatggc tccaaacagg cctctgctct cgatctgggg 540

cacgtttctc agtcctgtgg ccagcaccag cttctcgctc ttcacatact tagggcactc 600

gccgatggtc agagggtgca cattgtggaa gggcagggtg gtattgatgg cgcccagagg 660

tgtctggcac tt 672

55

573

DNA

A流感病毒

55

aagtgccaga cacctctggg cgccatcaat accaccctgc ccttccacaa tgtgcaccct 60

ctgaccatcg gcgagtgccc taagtatgtg aagagcgaga agctggtgct ggccacagga 120

ctgagaaacg tgccccagat cgagagcaga ggcctgtttg gagccatcgc cggattcatc 180

gagggaggat ggcagggaat ggtcgatggc tggtacggct accaccacag caatgatcag 240

ggctctggct atgccgccga taaggagtct acccagaagg cctttgacgg catcaccaac 300

aaggtgaaca gcgtgatcga gaagatgacc tacaatgccg aactgctggt gctgatggag 360

aatgagcgga ccctggactt ccacgacagc aacgtgaaga acctgtacga caaagtgagg 420

atgcagctga gggacaacgt gaaggaactg ggcaatggct gcttcgagtt ctaccacaag 480

tgtgacgacg agtgtatgaa ctccgtgaag aacggcacct acgactaccc taagtacgag 540

gaggagagca agctgaaccg gaacgagatc aag 573

56

191

PRT

A流感病毒

56

Lys Cys Gln Thr Pro Leu Gly Ala Ile Asn Thr Thr Leu Pro Phe His

1 5 10 15

Asn Val His Pro Leu Thr Ile Gly Glu Cys Pro Lys Tyr Val Lys Ser

20 25 30

Glu Lys Leu Val Leu Ala Thr Gly Leu Arg Asn Val Pro Gln Ile Glu

35 40 45

Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp

50 55 60

Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Ser Asn Asp Gln

65 70 75 80

Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln Lys Ala Phe Asp

85 90 95

Gly Ile Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Thr Tyr Asn

100 105 110

Ala Glu Leu Leu Val Leu Met Glu Asn Glu Arg Thr Leu Asp Phe His

115 120 125

Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys Val Arg Met Gln Leu Arg

130 135 140

Asp Asn Val Lys Glu Leu Gly Asn Gly Cys Phe Glu Phe Tyr His Lys

145 150 155 160

Cys Asp Asp Glu Cys Met Asn Ser Val Lys Asn Gly Thr Tyr Asp Tyr

165 170 175

Pro Lys Tyr Glu Glu Glu Ser Lys Leu Asn Arg Asn Glu Ile Lys

180 185 190

57

573

DNA

A流感病毒

57

cttgatctcg ttccggttca gcttgctctc ctcctcgtac ttagggtagt cgtaggtgcc 60

gttcttcacg gagttcatac actcgtcgtc acacttgtgg tagaactcga agcagccatt 120

gcccagttcc ttcacgttgt ccctcagctg catcctcact ttgtcgtaca ggttcttcac 180

gttgctgtcg tggaagtcca gggtccgctc attctccatc agcaccagca gttcggcatt 240

gtaggtcatc ttctcgatca cgctgttcac cttgttggtg atgccgtcaa aggccttctg 300

ggtagactcc ttatcggcgg catagccaga gccctgatca ttgctgtggt ggtagccgta 360

ccagccatcg accattccct gccatcctcc ctcgatgaat ccggcgatgg ctccaaacag 420

gcctctgctc tcgatctggg gcacgtttct cagtcctgtg gccagcacca gcttctcgct 480

cttcacatac ttagggcact cgccgatggt cagagggtgc acattgtgga agggcagggt 540

ggtattgatg gcgcccagag gtgtctggca ctt 573

58

570

DNA

A流感病毒

58

ctggtgctgg ccacaggact gagaaacgtg ccccagatcg agagcagagg cctgtttgga 60

gccatcgccg gattcatcga gggaggatgg cagggaatgg tcgatggctg gtacggctac 120

caccacagca atgatcaggg ctctggctat gccgccgata aggagtctac ccagaaggcc 180

tttgacggca tcaccaacaa ggtgaacagc gtgatcgaga agatgaacac ccagtttgag 240

gctgtgggca aggagtttag caacctggag cggagactgg agaacctgaa caagaagatg 300

gaggacggct tcctggatgt gtggacctac aatgccgaac tgctggtgct gatggagaat 360

gagcggaccc tggacttcca cgacagcaac gtgaagaacc tgtacgacaa agtgaggatg 420

cagctgaggg acaacgtgaa ggaactgggc aatggctgct tcgagttcta ccacaagtgt 480

gacgacgagt gtatgaactc cgtgaagaac ggcacctacg actaccctaa gtacgaggag 540

gagagcaagc tgaaccggaa cgagatcaag 570

59

190

PRT

A流感病毒

59

Leu Val Leu Ala Thr Gly Leu Arg Asn Val Pro Gln Ile Glu Ser Arg

1 5 10 15

Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp Gln Gly

20 25 30

Met Val Asp Gly Trp Tyr Gly Tyr His His Ser Asn Asp Gln Gly Ser

35 40 45

Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln Lys Ala Phe Asp Gly Ile

50 55 60

Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Asn Thr Gln Phe Glu

65 70 75 80

Ala Val Gly Lys Glu Phe Ser Asn Leu Glu Arg Arg Leu Glu Asn Leu

85 90 95

Asn Lys Lys Met Glu Asp Gly Phe Leu Asp Val Trp Thr Tyr Asn Ala

100 105 110

Glu Leu Leu Val Leu Met Glu Asn Glu Arg Thr Leu Asp Phe His Asp

115 120 125

Ser Asn Val Lys Asn Leu Tyr Asp Lys Val Arg Met Gln Leu Arg Asp

130 135 140

Asn Val Lys Glu Leu Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys

145 150 155 160

Asp Asp Glu Cys Met Asn Ser Val Lys Asn Gly Thr Tyr Asp Tyr Pro

165 170 175

Lys Tyr Glu Glu Glu Ser Lys Leu Asn Arg Asn Glu Ile Lys

180 185 190

60

570

DNA

A流感病毒

60

cttgatctcg ttccggttca gcttgctctc ctcctcgtac ttagggtagt cgtaggtgcc 60

gttcttcacg gagttcatac actcgtcgtc acacttgtgg tagaactcga agcagccatt 120

gcccagttcc ttcacgttgt ccctcagctg catcctcact ttgtcgtaca ggttcttcac 180

gttgctgtcg tggaagtcca gggtccgctc attctccatc agcaccagca gttcggcatt 240

gtaggtccac acatccagga agccgtcctc catcttcttg ttcaggttct ccagtctccg 300

ctccaggttg ctaaactcct tgcccacagc ctcaaactgg gtgttcatct tctcgatcac 360

gctgttcacc ttgttggtga tgccgtcaaa ggccttctgg gtagactcct tatcggcggc 420

atagccagag ccctgatcat tgctgtggtg gtagccgtac cagccatcga ccattccctg 480

ccatcctccc tcgatgaatc cggcgatggc tccaaacagg cctctgctct cgatctgggg 540

cacgtttctc agtcctgtgg ccagcaccag 570

61

471

DNA

A流感病毒

61

ctggtgctgg ccacaggact gagaaacgtg ccccagatcg agagcagagg cctgtttgga 60

gccatcgccg gattcatcga gggaggatgg cagggaatgg tcgatggctg gtacggctac 120

caccacagca atgatcaggg ctctggctat gccgccgata aggagtctac ccagaaggcc 180

tttgacggca tcaccaacaa ggtgaacagc gtgatcgaga agatgaccta caatgccgaa 240

ctgctggtgc tgatggagaa tgagcggacc ctggacttcc acgacagcaa cgtgaagaac 300

ctgtacgaca aagtgaggat gcagctgagg gacaacgtga aggaactggg caatggctgc 360

ttcgagttct accacaagtg tgacgacgag tgtatgaact ccgtgaagaa cggcacctac 420

gactacccta agtacgagga ggagagcaag ctgaaccgga acgagatcaa g 471

62

157

PRT

A流感病毒

62

Leu Val Leu Ala Thr Gly Leu Arg Asn Val Pro Gln Ile Glu Ser Arg

1 5 10 15

Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp Gln Gly

20 25 30

Met Val Asp Gly Trp Tyr Gly Tyr His His Ser Asn Asp Gln Gly Ser

35 40 45

Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln Lys Ala Phe Asp Gly Ile

50 55 60

Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Thr Tyr Asn Ala Glu

65 70 75 80

Leu Leu Val Leu Met Glu Asn Glu Arg Thr Leu Asp Phe His Asp Ser

85 90 95

Asn Val Lys Asn Leu Tyr Asp Lys Val Arg Met Gln Leu Arg Asp Asn

100 105 110

Val Lys Glu Leu Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys Asp

115 120 125

Asp Glu Cys Met Asn Ser Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys

130 135 140

Tyr Glu Glu Glu Ser Lys Leu Asn Arg Asn Glu Ile Lys

145 150 155

63

471

DNA

A流感病毒

63

cttgatctcg ttccggttca gcttgctctc ctcctcgtac ttagggtagt cgtaggtgcc 60

gttcttcacg gagttcatac actcgtcgtc acacttgtgg tagaactcga agcagccatt 120

gcccagttcc ttcacgttgt ccctcagctg catcctcact ttgtcgtaca ggttcttcac 180

gttgctgtcg tggaagtcca gggtccgctc attctccatc agcaccagca gttcggcatt 240

gtaggtcatc ttctcgatca cgctgttcac cttgttggtg atgccgtcaa aggccttctg 300

ggtagactcc ttatcggcgg catagccaga gccctgatca ttgctgtggt ggtagccgta 360

ccagccatcg accattccct gccatcctcc ctcgatgaat ccggcgatgg ctccaaacag 420

gcctctgctc tcgatctggg gcacgtttct cagtcctgtg gccagcacca g 471

64

150

DNA

A流感病毒

64

gccaccatgg aaaagatcgt gctgctgctg gccattgtga gcctggtgaa gagcgaccag 60

atctgcattg gctaccacgc caacaatagc acagagcagg tggacaccat catggaaaaa 120

aacgtgaccg tgacccacgc tcaggacatc 150

65

48

PRT

A流感病毒

65

Met Glu Lys Ile Val Leu Leu Leu Ala Ile Val Ser Leu Val Lys Ser

1 5 10 15

Asp Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Gln Val

20 25 30

Asp Thr Ile Met Glu Lys Asn Val Thr Val Thr His Ala Gln Asp Ile

35 40 45

66

150

DNA

A流感病毒

66

gatgtcctga gcgtgggtca cggtcacgtt tttttccatg atggtgtcca cctgctctgt 60

gctattgttg gcgtggtagc caatgcagat ctggtcgctc ttcaccaggc tcacaatggc 120

cagcagcagc acgatctttt ccatggtggc 150

67

681

DNA

A流感病毒

67

aagtgccaga cacctatggg cgccatcaac agcagcatgc ccttccacaa catccaccct 60

ctgaccatcg gcgagtgccc taagtacgtg aagagcaaca gactggtgct ggccacaggc 120

ctgagaaata gcccccagcg ggagagcaga agaaagaaga ggggcctgtt tggagccatc 180

gccggcttta ttgaaggcgg ctggcaggga atggtggatg gctggtacgg ctaccaccac 240

agcaatgagc agggctctgg atatgccgcc gacaaagagt ctacccagaa ggccatcgac 300

ggcgtcacca acaaggtgaa cagcatcatc gacaagatga acacccagtt cgaggctgtg 360

ggcagagagt tcaacaacct ggaacggcgg atcgagaacc tgaacaagaa aatggaagat 420

ggcttcctgg atgtgtggac ctacaatgcc gaactgctgg tgctgatgga aaacgagcgg 480

accctggact tccacgacag caacgtgaag aacctgtacg acaaagtgcg gctgcagctg 540

agagacaacg ccaaagagct gggcaacggc tgcttcgagt tctaccacaa gtgcgacaac 600

gagtgcatgg aaagcatccg gaacggcacc tacaactacc ctcagtacag cgaggaagcc 660

aggctgaaga gggaagagat c 681

68

227

PRT

A流感病毒

68

Lys Cys Gln Thr Pro Met Gly Ala Ile Asn Ser Ser Met Pro Phe His

1 5 10 15

Asn Ile His Pro Leu Thr Ile Gly Glu Cys Pro Lys Tyr Val Lys Ser

20 25 30

Asn Arg Leu Val Leu Ala Thr Gly Leu Arg Asn Ser Pro Gln Arg Glu

35 40 45

Ser Arg Arg Lys Lys Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile

50 55 60

Glu Gly Gly Trp Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His His

65 70 75 80

Ser Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln

85 90 95

Lys Ala Ile Asp Gly Val Thr Asn Lys Val Asn Ser Ile Ile Asp Lys

100 105 110

Met Asn Thr Gln Phe Glu Ala Val Gly Arg Glu Phe Asn Asn Leu Glu

115 120 125

Arg Arg Ile Glu Asn Leu Asn Lys Lys Met Glu Asp Gly Phe Leu Asp

130 135 140

Val Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Met Glu Asn Glu Arg

145 150 155 160

Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys Val

165 170 175

Arg Leu Gln Leu Arg Asp Asn Ala Lys Glu Leu Gly Asn Gly Cys Phe

180 185 190

Glu Phe Tyr His Lys Cys Asp Asn Glu Cys Met Glu Ser Ile Arg Asn

195 200 205

Gly Thr Tyr Asn Tyr Pro Gln Tyr Ser Glu Glu Ala Arg Leu Lys Arg

210 215 220

Glu Glu Ile

225

69

681

DNA

A流感病毒

69

gatctcttcc ctcttcagcc tggcttcctc gctgtactga gggtagttgt aggtgccgtt 60

ccggatgctt tccatgcact cgttgtcgca cttgtggtag aactcgaagc agccgttgcc 120

cagctctttg gcgttgtctc tcagctgcag ccgcactttg tcgtacaggt tcttcacgtt 180

gctgtcgtgg aagtccaggg tccgctcgtt ttccatcagc accagcagtt cggcattgta 240

ggtccacaca tccaggaagc catcttccat tttcttgttc aggttctcga tccgccgttc 300

caggttgttg aactctctgc ccacagcctc gaactgggtg ttcatcttgt cgatgatgct 360

gttcaccttg ttggtgacgc cgtcgatggc cttctgggta gactctttgt cggcggcata 420

tccagagccc tgctcattgc tgtggtggta gccgtaccag ccatccacca ttccctgcca 480

gccgccttca ataaagccgg cgatggctcc aaacaggccc ctcttctttc ttctgctctc 540

ccgctggggg ctatttctca ggcctgtggc cagcaccagt ctgttgctct tcacgtactt 600

agggcactcg ccgatggtca gagggtggat gttgtggaag ggcatgctgc tgttgatggc 660

gcccataggt gtctggcact t 681

70

582

DNA

A流感病毒

70

aagtgccaga cacctatggg cgccatcaac agcagcatgc ccttccacaa catccaccct 60

ctgaccatcg gcgagtgccc taagtacgtg aagagcaaca gactggtgct ggccacaggc 120

ctgagaaata gcccccagcg ggagagcaga agaaagaaga ggggcctgtt tggagccatc 180

gccggcttta ttgaaggcgg ctggcaggga atggtggatg gctggtacgg ctaccaccac 240

agcaatgagc agggctctgg atatgccgcc gacaaagagt ctacccagaa ggccatcgac 300

ggcgtcacca acaaggtgaa cagcatcatc gacaagatga cctacaatgc cgaactgctg 360

gtgctgatgg aaaacgagcg gaccctggac ttccacgaca gcaacgtgaa gaacctgtac 420

gacaaagtgc ggctgcagct gagagacaac gccaaagagc tgggcaacgg ctgcttcgag 480

ttctaccaca agtgcgacaa cgagtgcatg gaaagcatcc ggaacggcac ctacaactac 540

cctcagtaca gcgaggaagc caggctgaag agggaagaga tc 582

71

194

PRT

A流感病毒

71

Lys Cys Gln Thr Pro Met Gly Ala Ile Asn Ser Ser Met Pro Phe His

1 5 10 15

Asn Ile His Pro Leu Thr Ile Gly Glu Cys Pro Lys Tyr Val Lys Ser

20 25 30

Asn Arg Leu Val Leu Ala Thr Gly Leu Arg Asn Ser Pro Gln Arg Glu

35 40 45

Ser Arg Arg Lys Lys Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile

50 55 60

Glu Gly Gly Trp Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His His

65 70 75 80

Ser Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln

85 90 95

Lys Ala Ile Asp Gly Val Thr Asn Lys Val Asn Ser Ile Ile Asp Lys

100 105 110

Met Thr Tyr Asn Ala Glu Leu Leu Val Leu Met Glu Asn Glu Arg Thr

115 120 125

Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys Val Arg

130 135 140

Leu Gln Leu Arg Asp Asn Ala Lys Glu Leu Gly Asn Gly Cys Phe Glu

145 150 155 160

Phe Tyr His Lys Cys Asp Asn Glu Cys Met Glu Ser Ile Arg Asn Gly

165 170 175

Thr Tyr Asn Tyr Pro Gln Tyr Ser Glu Glu Ala Arg Leu Lys Arg Glu

180 185 190

Glu Ile

72

582

DNA

A流感病毒

72

gatctcttcc ctcttcagcc tggcttcctc gctgtactga gggtagttgt aggtgccgtt 60

ccggatgctt tccatgcact cgttgtcgca cttgtggtag aactcgaagc agccgttgcc 120

cagctctttg gcgttgtctc tcagctgcag ccgcactttg tcgtacaggt tcttcacgtt 180

gctgtcgtgg aagtccaggg tccgctcgtt ttccatcagc accagcagtt cggcattgta 240

ggtcatcttg tcgatgatgc tgttcacctt gttggtgacg ccgtcgatgg ccttctgggt 300

agactctttg tcggcggcat atccagagcc ctgctcattg ctgtggtggt agccgtacca 360

gccatccacc attccctgcc agccgccttc aataaagccg gcgatggctc caaacaggcc 420

cctcttcttt cttctgctct cccgctgggg gctatttctc aggcctgtgg ccagcaccag 480

tctgttgctc ttcacgtact tagggcactc gccgatggtc agagggtgga tgttgtggaa 540

gggcatgctg ctgttgatgg cgcccatagg tgtctggcac tt 582

73

579

DNA

A流感病毒

73

ctggtgctgg ccacaggcct gagaaatagc ccccagcggg agagcagaag aaagaagagg 60

ggcctgtttg gagccatcgc cggctttatt gaaggcggct ggcagggaat ggtggatggc 120

tggtacggct accaccacag caatgagcag ggctctggat atgccgccga caaagagtct 180

acccagaagg ccatcgacgg cgtcaccaac aaggtgaaca gcatcatcga caagatgaac 240

acccagttcg aggctgtggg cagagagttc aacaacctgg aacggcggat cgagaacctg 300

aacaagaaaa tggaagatgg cttcctggat gtgtggacct acaatgccga actgctggtg 360

ctgatggaaa acgagcggac cctggacttc cacgacagca acgtgaagaa cctgtacgac 420

aaagtgcggc tgcagctgag agacaacgcc aaagagctgg gcaacggctg cttcgagttc 480

taccacaagt gcgacaacga gtgcatggaa agcatccgga acggcaccta caactaccct 540

cagtacagcg aggaagccag gctgaagagg gaagagatc 579

74

193

PRT

A流感病毒

74

Leu Val Leu Ala Thr Gly Leu Arg Asn Ser Pro Gln Arg Glu Ser Arg

1 5 10 15

Arg Lys Lys Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

20 25 30

Gly Trp Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Ser Asn

35 40 45

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln Lys Ala

50 55 60

Ile Asp Gly Val Thr Asn Lys Val Asn Ser Ile Ile Asp Lys Met Asn

65 70 75 80

Thr Gln Phe Glu Ala Val Gly Arg Glu Phe Asn Asn Leu Glu Arg Arg

85 90 95

Ile Glu Asn Leu Asn Lys Lys Met Glu Asp Gly Phe Leu Asp Val Trp

100 105 110

Thr Tyr Asn Ala Glu Leu Leu Val Leu Met Glu Asn Glu Arg Thr Leu

115 120 125

Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys Val Arg Leu

130 135 140

Gln Leu Arg Asp Asn Ala Lys Glu Leu Gly Asn Gly Cys Phe Glu Phe

145 150 155 160

Tyr His Lys Cys Asp Asn Glu Cys Met Glu Ser Ile Arg Asn Gly Thr

165 170 175

Tyr Asn Tyr Pro Gln Tyr Ser Glu Glu Ala Arg Leu Lys Arg Glu Glu

180 185 190

Ile

75

579

DNA

A流感病毒

75

gatctcttcc ctcttcagcc tggcttcctc gctgtactga gggtagttgt aggtgccgtt 60

ccggatgctt tccatgcact cgttgtcgca cttgtggtag aactcgaagc agccgttgcc 120

cagctctttg gcgttgtctc tcagctgcag ccgcactttg tcgtacaggt tcttcacgtt 180

gctgtcgtgg aagtccaggg tccgctcgtt ttccatcagc accagcagtt cggcattgta 240

ggtccacaca tccaggaagc catcttccat tttcttgttc aggttctcga tccgccgttc 300

caggttgttg aactctctgc ccacagcctc gaactgggtg ttcatcttgt cgatgatgct 360

gttcaccttg ttggtgacgc cgtcgatggc cttctgggta gactctttgt cggcggcata 420

tccagagccc tgctcattgc tgtggtggta gccgtaccag ccatccacca ttccctgcca 480

gccgccttca ataaagccgg cgatggctcc aaacaggccc ctcttctttc ttctgctctc 540

ccgctggggg ctatttctca ggcctgtggc cagcaccag 579

76

480

DNA

A流感病毒

76

ctggtgctgg ccacaggcct gagaaatagc ccccagcggg agagcagaag aaagaagagg 60

ggcctgtttg gagccatcgc cggctttatt gaaggcggct ggcagggaat ggtggatggc 120

tggtacggct accaccacag caatgagcag ggctctggat atgccgccga caaagagtct 180

acccagaagg ccatcgacgg cgtcaccaac aaggtgaaca gcatcatcga caagatgacc 240

tacaatgccg aactgctggt gctgatggaa aacgagcgga ccctggactt ccacgacagc 300

aacgtgaaga acctgtacga caaagtgcgg ctgcagctga gagacaacgc caaagagctg 360

ggcaacggct gcttcgagtt ctaccacaag tgcgacaacg agtgcatgga aagcatccgg 420

aacggcacct acaactaccc tcagtacagc gaggaagcca ggctgaagag ggaagagatc 480

77

160

PRT

A流感病毒

77

Leu Val Leu Ala Thr Gly Leu Arg Asn Ser Pro Gln Arg Glu Ser Arg

1 5 10 15

Arg Lys Lys Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

20 25 30

Gly Trp Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Ser Asn

35 40 45

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln Lys Ala

50 55 60

Ile Asp Gly Val Thr Asn Lys Val Asn Ser Ile Ile Asp Lys Met Thr

65 70 75 80

Tyr Asn Ala Glu Leu Leu Val Leu Met Glu Asn Glu Arg Thr Leu Asp

85 90 95

Phe His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys Val Arg Leu Gln

100 105 110

Leu Arg Asp Asn Ala Lys Glu Leu Gly Asn Gly Cys Phe Glu Phe Tyr

115 120 125

His Lys Cys Asp Asn Glu Cys Met Glu Ser Ile Arg Asn Gly Thr Tyr

130 135 140

Asn Tyr Pro Gln Tyr Ser Glu Glu Ala Arg Leu Lys Arg Glu Glu Ile

145 150 155 160

78

480

DNA

A流感病毒

78

gatctcttcc ctcttcagcc tggcttcctc gctgtactga gggtagttgt aggtgccgtt 60

ccggatgctt tccatgcact cgttgtcgca cttgtggtag aactcgaagc agccgttgcc 120

cagctctttg gcgttgtctc tcagctgcag ccgcactttg tcgtacaggt tcttcacgtt 180

gctgtcgtgg aagtccaggg tccgctcgtt ttccatcagc accagcagtt cggcattgta 240

ggtcatcttg tcgatgatgc tgttcacctt gttggtgacg ccgtcgatgg ccttctgggt 300

agactctttg tcggcggcat atccagagcc ctgctcattg ctgtggtggt agccgtacca 360

gccatccacc attccctgcc agccgccttc aataaagccg gcgatggctc caaacaggcc 420

cctcttcttt cttctgctct cccgctgggg gctatttctc aggcctgtgg ccagcaccag 480

79

645

DNA

人工序列

合成

79

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360

aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420

gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645

80

215

PRT

人工序列

合成

80

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

81

645

DNA

人工序列

合成

81

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240

gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

82

645

DNA

人工序列

合成

82

atgaaggcaa tcctggtcgt cctgctgtat actttcgcta ccgctaacgc tgacaccctg 60

tgcatcggct atcacgctaa caactcaacc gacacagtgg atactgtcct ggagaagaac 120

gtgactgtca cccactctgt gaatctgggc agtggactga ggctggcaac tggactgcga 180

aacatcccac agcgggaaac cagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaacga gcagggatca 300

ggctacgccg ctgacctgaa gagcacacag aatgcaatcg atgaaattac taacatggtg 360

aattccgtca tcgagaaaat gggcagcgga ggctccggaa ccgacctggc agaactgctg 420

gtgctgctgc tgaaccagtg gacactgctg taccacgata gtaacgtgaa gaatctgtat 480

gagaaagtcc gatcacagct gaagaacaat gctaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcgacaa cacctgtatg gagagcgtga aaaatggcac atacgattat 600

cccaagtatt ccgaggaagc caaactgaac agagaggaaa ttgac 645

83

215

PRT

人工序列

合成

83

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Leu Ala Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asp Glu Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Tyr His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Arg Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys

195 200 205

Leu Asn Arg Glu Glu Ile Asp

210 215

84

645

DNA

人工序列

合成

84

gtcaatttcc tctctgttca gtttggcttc ctcggaatac ttgggataat cgtatgtgcc 60

atttttcacg ctctccatac aggtgttgtc gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttagcattgt tcttcagctg tgatcggact ttctcataca gattcttcac 180

gttactatcg tggtacagca gtgtccactg gttcagcagc agcaccagca gttctgccag 240

gtcggttccg gagcctccgc tgcccatttt ctcgatgacg gaattcacca tgttagtaat 300

ttcatcgatt gcattctgtg tgctcttcag gtcagcggcg tagcctgatc cctgctcgtt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctctggtttc ccgctgtggg atgtttcgca gtccagttgc 480

cagcctcagt ccactgccca gattcacaga gtgggtgaca gtcacgttct tctccaggac 540

agtatccact gtgtcggttg agttgttagc gtgatagccg atgcacaggg tgtcagcgtt 600

agcggtagcg aaagtataca gcaggacgac caggattgcc ttcat 645

85

639

DNA

人工序列

合成

85

atggctatca tctacctgat cctgctgttc actgctgtgc ggggggacca gatttgcatc 60

ggctaccacg ctaataattc aactgagaag gtggatacta tcctggagcg gaacgtgacc 120

gtcacacacg ctaaagacat tggcagcgga ctggtgctgg caaccggact gaggaatgtc 180

ccacagatcg agtcccgcgg actgttcggc gctatcgcag ggtttattga aggcgggtgg 240

cagggaatga ttgatgggtg gtacggctac caccattcta acgaccaagg aagtggctac 300

gccgctgata aggagagtac tcagaaagcc ttcgatggca tcaccaacat ggtgaattca 360

gtcattgaga agatgggcag cggaggctcc ggaaccgacc tggcagaact gctggtgctg 420

ctgctgaatc agtggacact gctgtttcac gactctaacg tgaagaatct gtatgataaa 480

gtccggatgc agctgagaga caacgtgaag gagctgggga atggatgctt cgaattttac 540

cataagtgcg acgatgagtg tatgaacagt gtcaaaaatg gcacatacga ttatcccaag 600

tatgaggaag agtcaaaact gaaccgaaat gaaatcaag 639

86

213

PRT

人工序列

合成

86

Met Ala Ile Ile Tyr Leu Ile Leu Leu Phe Thr Ala Val Arg Gly Asp

1 5 10 15

Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Lys Val Asp

20 25 30

Thr Ile Leu Glu Arg Asn Val Thr Val Thr His Ala Lys Asp Ile Gly

35 40 45

Ser Gly Leu Val Leu Ala Thr Gly Leu Arg Asn Val Pro Gln Ile Glu

50 55 60

Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp

65 70 75 80

Gln Gly Met Ile Asp Gly Trp Tyr Gly Tyr His His Ser Asn Asp Gln

85 90 95

Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln Lys Ala Phe Asp

100 105 110

Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly Ser Gly

115 120 125

Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu Asn Gln

130 135 140

Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys

145 150 155 160

Val Arg Met Gln Leu Arg Asp Asn Val Lys Glu Leu Gly Asn Gly Cys

165 170 175

Phe Glu Phe Tyr His Lys Cys Asp Asp Glu Cys Met Asn Ser Val Lys

180 185 190

Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Glu Glu Glu Ser Lys Leu Asn

195 200 205

Arg Asn Glu Ile Lys

210

87

639

DNA

人工序列

合成

87

cttgatttca tttcggttca gttttgactc ttcctcatac ttgggataat cgtatgtgcc 60

atttttgaca ctgttcatac actcatcgtc gcacttatgg taaaattcga agcatccatt 120

ccccagctcc ttcacgttgt ctctcagctg catccggact ttatcataca gattcttcac 180

gttagagtcg tgaaacagca gtgtccactg attcagcagc agcaccagca gttctgccag 240

gtcggttccg gagcctccgc tgcccatctt ctcaatgact gaattcacca tgttggtgat 300

gccatcgaag gctttctgag tactctcctt atcagcggcg tagccacttc cttggtcgtt 360

agaatggtgg tagccgtacc acccatcaat cattccctgc cacccgcctt caataaaccc 420

tgcgatagcg ccgaacagtc cgcgggactc gatctgtggg acattcctca gtccggttgc 480

cagcaccagt ccgctgccaa tgtctttagc gtgtgtgacg gtcacgttcc gctccaggat 540

agtatccacc ttctcagttg aattattagc gtggtagccg atgcaaatct ggtccccccg 600

cacagcagtg aacagcagga tcaggtagat gatagccat 639

88

651

DNA

人工序列

合成

88

atggaaaaaa tcgtgctgct gctggctatc gtgtccctgg tgaagtccga ccagatctgt 60

attgggtatc atgctaacaa ctccacagaa caggtggata ctatcatgga gaagaacgtg 120

accgtcacac acgctcagga cattggatgg ggactggtcc tggcaaccgg actgagaaat 180

tcaccacaga gggaaagccg gagaaagaaa cgcggactgt tcggcgctat cgcagggttt 240

attgagggcg ggtggcaggg aatggtggat gggtggtacg gctaccacca ttccaacgaa 300

cagggatctg gctacgccgc tgataaggag tctactcaga aagctatcga cggcgtgacc 360

aacatggtca atagtatcat tgataagatg ggctctggag gcagtggaac cgacctggca 420

gagctgctgg tgctgctgct gaaccagtgg acactgctgt tccacgactc taacgtgaag 480

aatctgtatg ataaagtccg actgcagctg cgggacaacg ccaaggaact ggggaatgga 540

tgcttcgagt tctaccataa gtgcgataac gaatgtatgg agagcatccg aaacggcaca 600

tacaattatc cccagtattc cgaggaagct aggctgaaac gcgaggaaat t 651

89

217

PRT

人工序列

合成

89

Met Glu Lys Ile Val Leu Leu Leu Ala Ile Val Ser Leu Val Lys Ser

1 5 10 15

Asp Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Gln Val

20 25 30

Asp Thr Ile Met Glu Lys Asn Val Thr Val Thr His Ala Gln Asp Ile

35 40 45

Gly Trp Gly Leu Val Leu Ala Thr Gly Leu Arg Asn Ser Pro Gln Arg

50 55 60

Glu Ser Arg Arg Lys Lys Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe

65 70 75 80

Ile Glu Gly Gly Trp Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His

85 90 95

His Ser Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr

100 105 110

Gln Lys Ala Ile Asp Gly Val Thr Asn Met Val Asn Ser Ile Ile Asp

115 120 125

Lys Met Gly Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val

130 135 140

Leu Leu Leu Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys

145 150 155 160

Asn Leu Tyr Asp Lys Val Arg Leu Gln Leu Arg Asp Asn Ala Lys Glu

165 170 175

Leu Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Glu Cys

180 185 190

Met Glu Ser Ile Arg Asn Gly Thr Tyr Asn Tyr Pro Gln Tyr Ser Glu

195 200 205

Glu Ala Arg Leu Lys Arg Glu Glu Ile

210 215

90

651

DNA

人工序列

合成

90

aatttcctcg cgtttcagcc tagcttcctc ggaatactgg ggataattgt atgtgccgtt 60

tcggatgctc tccatacatt cgttatcgca cttatggtag aactcgaagc atccattccc 120

cagttccttg gcgttgtccc gcagctgcag tcggacttta tcatacagat tcttcacgtt 180

agagtcgtgg aacagcagtg tccactggtt cagcagcagc accagcagct ctgccaggtc 240

ggttccactg cctccagagc ccatcttatc aatgatacta ttgaccatgt tggtcacgcc 300

gtcgatagct ttctgagtag actccttatc agcggcgtag ccagatccct gttcgttgga 360

atggtggtag ccgtaccacc catccaccat tccctgccac ccgccctcaa taaaccctgc 420

gatagcgccg aacagtccgc gtttctttct ccggctttcc ctctgtggtg aatttctcag 480

tccggttgcc aggaccagtc cccatccaat gtcctgagcg tgtgtgacgg tcacgttctt 540

ctccatgata gtatccacct gttctgtgga gttgttagca tgatacccaa tacagatctg 600

gtcggacttc accagggaca cgatagccag cagcagcacg attttttcca t 651

91

645

DNA

人工序列

合成

91

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360

aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420

gtgctgctgc tgaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645

92

215

PRT

人工序列

合成

92

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

93

645

DNA

人工序列

合成

93

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaaatcca gagtccgctc gttcagcagc agcaccagca gctcagccag 240

gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

94

645

DNA

人工序列

合成

94

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccattct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360

aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420

gtgctgatgc tgaaccagtt cactctgctg ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645

95

215

PRT

人工序列

合成

95

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Ile Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Met Leu

130 135 140

Asn Gln Phe Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

96

645

DNA

人工序列

合成

96

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaacagca gagtgaactg gttcagcatc agcaccagca gctcagccag 240

gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccagaat 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

97

645

DNA

人工序列

合成

97

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360

aattctgtca tcgaaaaaat gggcaatgga acaggcggag ctgacctggc tgagctgctg 420

gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645

98

215

PRT

人工序列

合成

98

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Asn Gly Thr Gly Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

99

645

DNA

人工序列

合成

99

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240

gtcagctccg cctgttccat tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

100

645

DNA

人工序列

合成

100

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360

aattctgtca tcgaaaaaat gggcggaaat ggcactggag ctgacctggc tgagctgctg 420

gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645

101

215

PRT

人工序列

合成

101

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

102

645

DNA

人工序列

合成

102

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240

gtcagctcca gtgccatttc cgcccatttt ttcgatgaca gaattcacca tgttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

103

645

DNA

人工序列

合成

103

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360

aattctgtca tcgaaaaaat gggcagcgga ggcaacggaa cagacctggc tgagctgctg 420

gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645

104

215

PRT

人工序列

合成

104

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Asn Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

105

645

DNA

人工序列

合成

105

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240

gtctgttccg ttgcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

106

1149

DNA

人工序列

合成

106

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360

aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420

gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660

atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga gctccaacct gtacatgagt 720

atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780

gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840

cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900

tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960

cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020

cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080

gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140

agtgggtca 1149

107

383

PRT

人工序列

合成

107

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

108

1149

DNA

人工序列

合成

108

tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60

atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120

ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180

aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240

tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300

ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360

ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420

actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480

cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540

atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600

atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660

gaccttctca tacagattct tcacgttgct atcgtggaac agcagagtcc actggttcag 720

cagcagcacc agcagctcag ccaggtctgt tccggagcct ccgctgccca ttttttcgat 780

gacagaattc accatgttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840

ggcgtagccg ctgccctgct cattctgatg gtggtagccg taccacccgt ccaccattcc 900

tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctcttg tttcccgctg 960

tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattca ctgagtgggt 1020

gacagtcacg ttcttctcca ggacggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080

gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140

ggccttcat 1149

109

1149

DNA

人工序列

合成

109

atgaaggcaa tcctggtcgt cctgctgtat actttcgcta ccgctaacgc tgacaccctg 60

tgcatcggct atcacgctaa caactcaacc gacacagtgg atactgtcct ggagaagaac 120

gtgactgtca cccactctgt gaatctgggc agtggactga ggctggcaac tggactgcga 180

aacatcccac agcgggaaac cagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaacga gcagggatca 300

ggctacgccg ctgacctgaa gagcacacag aatgcaatcg atgaaattac taacatggtg 360

aattccgtca tcgagaaaat gggcagcgga ggctccggaa ccgacctggc agaactgctg 420

gtgctgctgc tgaaccagtg gacactgctg taccacgata gtaacgtgaa gaatctgtat 480

gagaaagtcc gatcacagct gaagaacaat gctaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcgacaa cacctgtatg gagagcgtga aaaatggcac atacgattat 600

cccaagtatt ccgaggaagc caaactgaac agagaggaaa ttgactctgg gggcgacatc 660

atcaagctgc tgaacgaaca ggtcaacaag gagatgcaga gctccaatct gtacatgtcc 720

atgtctagtt ggtgttatac ccactctctg gacggcgctg ggctgttcct gtttgatcac 780

gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaacga gaacaatgtg 840

cccgtccagc tgacatcaat cagcgcccct gaacataagt tcgagggcct gactcagatc 900

tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960

cacgccatta agagtaaaga tcatgctacc ttcaattttc tgcagtggta cgtggccgag 1020

cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080

gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gagccggaaa 1140

agtgggtca 1149

110

383

PRT

人工序列

合成

110

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Leu Ala Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asp Glu Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Tyr His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Arg Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

111

1149

DNA

人工序列

合成

111

tgacccactt ttccggctct tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60

atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120

ctcgtgctgc tcggccacgt accactgcag aaaattgaag gtagcatgat ctttactctt 180

aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240

tttctgaaag atctgagtca ggccctcgaa cttatgttca ggggcgctga ttgatgtcag 300

ctggacgggc acattgttct cgttcaggaa aatgatcagt ttctttgcat gttcgtattc 360

ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agagagtggg tataacacca 420

actagacatg gacatgtaca gattggagct ctgcatctcc ttgttgacct gttcgttcag 480

cagcttgatg atgtcgcccc cagagtcaat ttcctctctg ttcagtttgg cttcctcgga 540

atacttggga taatcgtatg tgccattttt cacgctctcc atacaggtgt tgtcgcactt 600

atggtaaaac tcgaagcatc cattcccgat ttctttagca ttgttcttca gctgtgatcg 660

gactttctca tacagattct tcacgttact atcgtggtac agcagtgtcc actggttcag 720

cagcagcacc agcagttctg ccaggtcggt tccggagcct ccgctgccca ttttctcgat 780

gacggaattc accatgttag taatttcatc gattgcattc tgtgtgctct tcaggtcagc 840

ggcgtagcct gatccctgct cgttctgatg gtggtagccg taccacccgt ccaccattcc 900

tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctctgg tttcccgctg 960

tgggatgttt cgcagtccag ttgccagcct cagtccactg cccagattca cagagtgggt 1020

gacagtcacg ttcttctcca ggacagtatc cactgtgtcg gttgagttgt tagcgtgata 1080

gccgatgcac agggtgtcag cgttagcggt agcgaaagta tacagcagga cgaccaggat 1140

tgccttcat 1149

112

1143

DNA

人工序列

合成

112

atggctatca tctacctgat cctgctgttc actgctgtgc ggggggacca gatttgcatc 60

ggctaccacg ctaataattc aactgagaag gtggatacta tcctggagcg gaacgtgacc 120

gtcacacacg ctaaagacat tggcagcgga ctggtgctgg caaccggact gaggaatgtc 180

ccacagatcg agtcccgcgg actgttcggc gctatcgcag ggtttattga aggcgggtgg 240

cagggaatga ttgatgggtg gtacggctac caccattcta acgaccaagg aagtggctac 300

gccgctgata aggagagtac tcagaaagcc ttcgatggca tcaccaacat ggtgaattca 360

gtcattgaga agatgggcag cggaggctcc ggaaccgacc tggcagaact gctggtgctg 420

ctgctgaatc agtggacact gctgtttcac gactctaacg tgaagaatct gtatgataaa 480

gtccggatgc agctgagaga caacgtgaag gagctgggga atggatgctt cgaattttac 540

cataagtgcg acgatgagtg tatgaacagt gtcaaaaatg gcacatacga ttatcccaag 600

tatgaggaag agtcaaaact gaaccgaaat gaaatcaaga gcgggggcga catcatcaag 660

ctgctgaacg agcaagtgaa taaggaaatg cagagctcca acctgtacat gtccatgtct 720

agttggtgtt atactcactc tctggatggc gccgggctgt tcctgtttga ccacgcagcc 780

gaagagtacg agcatgctaa gaaactgatc attttcctga acgaaaacaa cgtgcccgtc 840

cagctgacat caatcagcgc acctgagcat aagttcgaag gcctgactca gatctttcag 900

aaagcttacg agcacgaaca gcatatttcc gagtctatca acaatattgt ggaccacgcc 960

atcaagagca aagatcatgc taccttcaac tttctgcagt ggtacgtggc cgagcagcac 1020

gaagaggaag tcctgtttaa ggacatcctg gataaaatcg agctgattgg aaacgaaaat 1080

catggcctgt acctggcaga ccagtatgtg aagggcattg ccaagtccag aaaaagtggg 1140

tca 1143

113

381

PRT

人工序列

合成

113

Met Ala Ile Ile Tyr Leu Ile Leu Leu Phe Thr Ala Val Arg Gly Asp

1 5 10 15

Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Lys Val Asp

20 25 30

Thr Ile Leu Glu Arg Asn Val Thr Val Thr His Ala Lys Asp Ile Gly

35 40 45

Ser Gly Leu Val Leu Ala Thr Gly Leu Arg Asn Val Pro Gln Ile Glu

50 55 60

Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp

65 70 75 80

Gln Gly Met Ile Asp Gly Trp Tyr Gly Tyr His His Ser Asn Asp Gln

85 90 95

Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln Lys Ala Phe Asp

100 105 110

Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly Ser Gly

115 120 125

Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu Asn Gln

130 135 140

Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys

145 150 155 160

Val Arg Met Gln Leu Arg Asp Asn Val Lys Glu Leu Gly Asn Gly Cys

165 170 175

Phe Glu Phe Tyr His Lys Cys Asp Asp Glu Cys Met Asn Ser Val Lys

180 185 190

Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Glu Glu Glu Ser Lys Leu Asn

195 200 205

Arg Asn Glu Ile Lys Ser Gly Gly Asp Ile Ile Lys Leu Leu Asn Glu

210 215 220

Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser Met Ser

225 230 235 240

Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe Leu Phe

245 250 255

Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile Ile Phe

260 265 270

Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser Ala Pro

275 280 285

Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala Tyr Glu

290 295 300

His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp His Ala

305 310 315 320

Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp Tyr Val

325 330 335

Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu Asp Lys

340 345 350

Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala Asp Gln

355 360 365

Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

114

1143

DNA

人工序列

合成

114

tgacccactt tttctggact tggcaatgcc cttcacatac tggtctgcca ggtacaggcc 60

atgattttcg tttccaatca gctcgatttt atccaggatg tccttaaaca ggacttcctc 120

ttcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180

gatggcgtgg tccacaatat tgttgataga ctcggaaata tgctgttcgt gctcgtaagc 240

tttctgaaag atctgagtca ggccttcgaa cttatgctca ggtgcgctga ttgatgtcag 300

ctggacgggc acgttgtttt cgttcaggaa aatgatcagt ttcttagcat gctcgtactc 360

ttcggctgcg tggtcaaaca ggaacagccc ggcgccatcc agagagtgag tataacacca 420

actagacatg gacatgtaca ggttggagct ctgcatttcc ttattcactt gctcgttcag 480

cagcttgatg atgtcgcccc cgctcttgat ttcatttcgg ttcagttttg actcttcctc 540

atacttggga taatcgtatg tgccattttt gacactgttc atacactcat cgtcgcactt 600

atggtaaaat tcgaagcatc cattccccag ctccttcacg ttgtctctca gctgcatccg 660

gactttatca tacagattct tcacgttaga gtcgtgaaac agcagtgtcc actgattcag 720

cagcagcacc agcagttctg ccaggtcggt tccggagcct ccgctgccca tcttctcaat 780

gactgaattc accatgttgg tgatgccatc gaaggctttc tgagtactct ccttatcagc 840

ggcgtagcca cttccttggt cgttagaatg gtggtagccg taccacccat caatcattcc 900

ctgccacccg ccttcaataa accctgcgat agcgccgaac agtccgcggg actcgatctg 960

tgggacattc ctcagtccgg ttgccagcac cagtccgctg ccaatgtctt tagcgtgtgt 1020

gacggtcacg ttccgctcca ggatagtatc caccttctca gttgaattat tagcgtggta 1080

gccgatgcaa atctggtccc cccgcacagc agtgaacagc aggatcaggt agatgatagc 1140

cat 1143

115

1158

DNA

人工序列

合成

115

atggaaaaaa tcgtgctgct gctggctatc gtgtccctgg tgaagtccga ccagatctgt 60

attgggtatc atgctaacaa ctccacagaa caggtggata ctatcatgga gaagaacgtg 120

accgtcacac acgctcagga cattggatgg ggactggtcc tggcaaccgg actgagaaat 180

tcaccacaga gggaaagccg gagaaagaaa cgcggactgt tcggcgctat cgcagggttt 240

attgagggcg ggtggcaggg aatggtggat gggtggtacg gctaccacca ttccaacgaa 300

cagggatctg gctacgccgc tgataaggag tctactcaga aagctatcga cggcgtgacc 360

aacatggtca atagtatcat tgataagatg ggctctggag gcagtggaac cgacctggca 420

gagctgctgg tgctgctgct gaaccagtgg acactgctgt tccacgactc taacgtgaag 480

aatctgtatg ataaagtccg actgcagctg cgggacaacg ccaaggaact ggggaatgga 540

tgcttcgagt tctaccataa gtgcgataac gaatgtatgg agagcatccg aaacggcaca 600

tacaattatc cccagtattc cgaggaagct aggctgaaac gcgaggaaat tagctccggg 660

ggagacatca ttaagctgct gaacgaacag gtgaacaagg agatgcagtc tagtaacctg 720

tacatgagta tgtcaagctg gtgttatact cactcactgg atggcgccgg gctgttcctg 780

tttgaccacg cagccgagga atacgaacat gctaagaaac tgatcatttt cctgaatgag 840

aacaatgtgc ccgtccagct gacatccatc tctgcacctg aacataagtt cgagggcctg 900

actcagatct ttcagaaagc ctacgaacac gagcagcata ttagtgagtc aatcaacaat 960

attgtggacc acgccatcaa gagcaaagat catgctacct tcaattttct gcagtggtac 1020

gtggccgagc agcacgagga agaggtcctg tttaaggaca tcctggataa aatcgaactg 1080

attggaaacg agaatcatgg cctgtacctg gcagaccagt atgtgaaggg cattgccaag 1140

tccaggaaaa gcgggtcc 1158

116

386

PRT

人工序列

合成

116

Met Glu Lys Ile Val Leu Leu Leu Ala Ile Val Ser Leu Val Lys Ser

1 5 10 15

Asp Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Gln Val

20 25 30

Asp Thr Ile Met Glu Lys Asn Val Thr Val Thr His Ala Gln Asp Ile

35 40 45

Gly Trp Gly Leu Val Leu Ala Thr Gly Leu Arg Asn Ser Pro Gln Arg

50 55 60

Glu Ser Arg Arg Lys Lys Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe

65 70 75 80

Ile Glu Gly Gly Trp Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His

85 90 95

His Ser Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr

100 105 110

Gln Lys Ala Ile Asp Gly Val Thr Asn Met Val Asn Ser Ile Ile Asp

115 120 125

Lys Met Gly Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val

130 135 140

Leu Leu Leu Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys

145 150 155 160

Asn Leu Tyr Asp Lys Val Arg Leu Gln Leu Arg Asp Asn Ala Lys Glu

165 170 175

Leu Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Glu Cys

180 185 190

Met Glu Ser Ile Arg Asn Gly Thr Tyr Asn Tyr Pro Gln Tyr Ser Glu

195 200 205

Glu Ala Arg Leu Lys Arg Glu Glu Ile Ser Ser Gly Gly Asp Ile Ile

210 215 220

Lys Leu Leu Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu

225 230 235 240

Tyr Met Ser Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala

245 250 255

Gly Leu Phe Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys

260 265 270

Lys Leu Ile Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr

275 280 285

Ser Ile Ser Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe

290 295 300

Gln Lys Ala Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn

305 310 315 320

Ile Val Asp His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe

325 330 335

Leu Gln Trp Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys

340 345 350

Asp Ile Leu Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu

355 360 365

Tyr Leu Ala Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser

370 375 380

Gly Ser

385

117

1158

DNA

人工序列

合成

117

ggacccgctt ttcctggact tggcaatgcc cttcacatac tggtctgcca ggtacaggcc 60

atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcttc 120

ctcgtgctgc tcggccacgt accactgcag aaaattgaag gtagcatgat ctttgctctt 180

gatggcgtgg tccacaatat tgttgattga ctcactaata tgctgctcgt gttcgtaggc 240

tttctgaaag atctgagtca ggccctcgaa cttatgttca ggtgcagaga tggatgtcag 300

ctggacgggc acattgttct cattcaggaa aatgatcagt ttcttagcat gttcgtattc 360

ctcggctgcg tggtcaaaca ggaacagccc ggcgccatcc agtgagtgag tataacacca 420

gcttgacata ctcatgtaca ggttactaga ctgcatctcc ttgttcacct gttcgttcag 480

cagcttaatg atgtctcccc cggagctaat ttcctcgcgt ttcagcctag cttcctcgga 540

atactgggga taattgtatg tgccgtttcg gatgctctcc atacattcgt tatcgcactt 600

atggtagaac tcgaagcatc cattccccag ttccttggcg ttgtcccgca gctgcagtcg 660

gactttatca tacagattct tcacgttaga gtcgtggaac agcagtgtcc actggttcag 720

cagcagcacc agcagctctg ccaggtcggt tccactgcct ccagagccca tcttatcaat 780

gatactattg accatgttgg tcacgccgtc gatagctttc tgagtagact ccttatcagc 840

ggcgtagcca gatccctgtt cgttggaatg gtggtagccg taccacccat ccaccattcc 900

ctgccacccg ccctcaataa accctgcgat agcgccgaac agtccgcgtt tctttctccg 960

gctttccctc tgtggtgaat ttctcagtcc ggttgccagg accagtcccc atccaatgtc 1020

ctgagcgtgt gtgacggtca cgttcttctc catgatagta tccacctgtt ctgtggagtt 1080

gttagcatga tacccaatac agatctggtc ggacttcacc agggacacga tagccagcag 1140

cagcacgatt ttttccat 1158

118

1149

DNA

人工序列

合成

118

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360

aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420

gtgctgctgc tgaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660

atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga gctccaacct gtacatgagt 720

atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780

gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840

cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900

tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960

cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020

cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080

gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140

agtgggtca 1149

119

383

PRT

人工序列

合成

119

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

120

1149

DNA

人工序列

合成

120

tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60

atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120

ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180

aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240

tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300

ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360

ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420

actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480

cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540

atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600

atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660

gaccttctca tacagattct tcacgttgct atcgtggaaa tccagagtcc gctcgttcag 720

cagcagcacc agcagctcag ccaggtctgt tccggagcct ccgctgccca ttttttcgat 780

gacagaattc accatgttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840

ggcgtagccg ctgccctgct cattctgatg gtggtagccg taccacccgt ccaccattcc 900

tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctcttg tttcccgctg 960

tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattca ctgagtgggt 1020

gacagtcacg ttcttctcca ggacggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080

gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140

ggccttcat 1149

121

1149

DNA

人工序列

合成

121

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccattct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360

aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420

gtgctgatgc tgaaccagtt cactctgctg ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660

atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga gctccaacct gtacatgagt 720

atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780

gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840

cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900

tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960

cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020

cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080

gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140

agtgggtca 1149

122

383

PRT

人工序列

合成

122

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Ile Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Met Leu

130 135 140

Asn Gln Phe Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

123

1149

DNA

人工序列

合成

123

tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60

atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120

ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180

aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240

tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300

ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360

ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420

actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480

cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540

atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600

atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660

gaccttctca tacagattct tcacgttgct atcgtggaac agcagagtga actggttcag 720

catcagcacc agcagctcag ccaggtctgt tccggagcct ccgctgccca ttttttcgat 780

gacagaattc accatgttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840

ggcgtagccg ctgccctgct cattctgatg gtggtagccg taccacccgt ccaccattcc 900

tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctcttg tttcccgctg 960

tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattca ctgagtgggt 1020

gacagtcacg ttcttctcca gaatggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080

gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140

ggccttcat 1149

124

1149

DNA

人工序列

合成

124

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360

aattctgtca tcgaaaaaat gggcaatgga acaggcggag ctgacctggc tgagctgctg 420

gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660

atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga gctccaacct gtacatgagt 720

atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780

gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840

cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900

tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960

cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020

cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080

gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140

agtgggtca 1149

125

383

PRT

人工序列

合成

125

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Asn Gly Thr Gly Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

126

1149

DNA

人工序列

合成

126

tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60

atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120

ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180

aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240

tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300

ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360

ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420

actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480

cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540

atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600

atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660

gaccttctca tacagattct tcacgttgct atcgtggaac agcagagtcc actggttcag 720

cagcagcacc agcagctcag ccaggtcagc tccgcctgtt ccattgccca ttttttcgat 780

gacagaattc accatgttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840

ggcgtagccg ctgccctgct cattctgatg gtggtagccg taccacccgt ccaccattcc 900

tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctcttg tttcccgctg 960

tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattca ctgagtgggt 1020

gacagtcacg ttcttctcca ggacggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080

gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140

ggccttcat 1149

127

1149

DNA

人工序列

合成

127

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360

aattctgtca tcgaaaaaat gggcggaaat ggcactggag ctgacctggc tgagctgctg 420

gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660

atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga gctccaacct gtacatgagt 720

atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780

gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840

cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900

tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960

cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020

cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080

gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140

agtgggtca 1149

128

383

PRT

人工序列

合成

128

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

129

1149

DNA

人工序列

合成

129

tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60

atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120

ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180

aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240

tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300

ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360

ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420

actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480

cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540

atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600

atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660

gaccttctca tacagattct tcacgttgct atcgtggaac agcagagtcc actggttcag 720

cagcagcacc agcagctcag ccaggtcagc tccagtgcca tttccgccca ttttttcgat 780

gacagaattc accatgttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840

ggcgtagccg ctgccctgct cattctgatg gtggtagccg taccacccgt ccaccattcc 900

tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctcttg tttcccgctg 960

tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattca ctgagtgggt 1020

gacagtcacg ttcttctcca ggacggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080

gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140

ggccttcat 1149

130

1149

DNA

人工序列

合成

130

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360

aattctgtca tcgaaaaaat gggcagcgga ggcaacggaa cagacctggc tgagctgctg 420

gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660

atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga gctccaacct gtacatgagt 720

atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780

gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840

cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900

tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960

cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020

cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080

gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140

agtgggtca 1149

131

383

PRT

人工序列

合成

131

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Asn Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

132

1149

DNA

人工序列

合成

132

tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60

atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120

ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180

aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240

tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300

ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360

ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420

actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480

cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540

atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600

atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660

gaccttctca tacagattct tcacgttgct atcgtggaac agcagagtcc actggttcag 720

cagcagcacc agcagctcag ccaggtctgt tccgttgcct ccgctgccca ttttttcgat 780

gacagaattc accatgttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840

ggcgtagccg ctgccctgct cattctgatg gtggtagccg taccacccgt ccaccattcc 900

tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctcttg tttcccgctg 960

tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattca ctgagtgggt 1020

gacagtcacg ttcttctcca ggacggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080

gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140

ggccttcat 1149

133

33

PRT

A流感病毒

133

Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn Lys Leu Glu Arg

1 5 10 15

Arg Met Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile

20 25 30

Trp

134

12

PRT

A流感病毒

134

Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn

1 5 10

135

12

PRT

A流感病毒

135

Asn Lys Leu Glu Arg Arg Met Glu Asn Leu Asn Lys

1 5 10

136

11

PRT

A流感病毒

136

Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp

1 5 10

137

33

PRT

A流感病毒

137

Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn His Leu Glu Lys

1 5 10 15

Arg Ile Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile

20 25 30

Trp

138

11

PRT

A流感病毒

138

Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe

1 5 10

139

11

PRT

A流感病毒

139

Phe Asn His Leu Glu Lys Arg Ile Glu Asn Leu

1 5 10

140

13

PRT

A流感病毒

140

Leu Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp

1 5 10

141

33

PRT

A流感病毒

141

Asn Thr Gln Phe Glu Ala Val Gly Lys Glu Phe Ser Asn Leu Glu Arg

1 5 10 15

Arg Leu Glu Asn Leu Asn Lys Lys Met Glu Asp Gly Phe Leu Asp Val

20 25 30

Trp

142

11

PRT

A流感病毒

142

Asn Thr Gln Phe Glu Ala Val Gly Lys Glu Phe

1 5 10

143

12

PRT

A流感病毒

143

Phe Ser Asn Leu Glu Arg Arg Leu Glu Asn Leu Asn

1 5 10

144

12

PRT

A流感病毒

144

Asn Lys Lys Met Glu Asp Gly Phe Leu Asp Val Trp

1 5 10

145

33

PRT

A流感病毒

145

Asn Thr Gln Phe Glu Ala Val Gly Arg Glu Phe Asn Asn Leu Glu Arg

1 5 10 15

Arg Ile Glu Asn Leu Asn Lys Lys Met Glu Asp Gly Phe Leu Asp Val

20 25 30

Trp

146

11

PRT

A流感病毒

146

Asn Thr Gln Phe Glu Ala Val Gly Arg Glu Phe

1 5 10

147

12

PRT

A流感病毒

147

Phe Asn Asn Leu Glu Arg Arg Ile Glu Asn Leu Asn

1 5 10

148

12

PRT

A流感病毒

148

Asn Lys Lys Met Glu Asp Gly Phe Leu Asp Val Trp

1 5 10

149

53

PRT

A流感病毒

149

Lys Val Asn Ser Val Ile Glu Lys Met Asn Thr Gln Phe Thr Ala Val

1 5 10 15

Gly Lys Glu Phe Asn Lys Leu Glu Arg Arg Met Glu Asn Leu Asn Lys

20 25 30

Lys Val Asp Asp Gly Phe Leu Asp Ile Trp Thr Tyr Asn Ala Glu Leu

35 40 45

Leu Val Leu Leu Glu

50

150

20

PRT

A流感病毒

150

Lys Val Asn Ser Val Ile Glu Lys Met Thr Tyr Asn Ala Glu Leu Leu

1 5 10 15

Val Leu Leu Glu

20

151

53

PRT

A流感病毒

151

Lys Val Asn Ser Val Ile Glu Lys Met Asn Thr Gln Phe Thr Ala Val

1 5 10 15

Gly Lys Glu Phe Asn His Leu Glu Lys Arg Ile Glu Asn Leu Asn Lys

20 25 30

Lys Val Asp Asp Gly Phe Leu Asp Ile Trp Thr Tyr Asn Ala Glu Leu

35 40 45

Leu Val Leu Leu Glu

50

152

20

PRT

A流感病毒

152

Lys Val Asn Ser Val Ile Glu Lys Met Thr Tyr Asn Ala Glu Leu Leu

1 5 10 15

Val Leu Leu Glu

20

153

53

PRT

A流感病毒

153

Lys Val Asn Ser Val Ile Glu Lys Met Asn Thr Gln Phe Glu Ala Val

1 5 10 15

Gly Lys Glu Phe Ser Asn Leu Glu Arg Arg Leu Glu Asn Leu Asn Lys

20 25 30

Lys Met Glu Asp Gly Phe Leu Asp Val Trp Thr Tyr Asn Ala Glu Leu

35 40 45

Leu Val Leu Met Glu

50

154

20

PRT

A流感病毒

154

Lys Val Asn Ser Val Ile Glu Lys Met Thr Tyr Asn Ala Glu Leu Leu

1 5 10 15

Val Leu Met Glu

20

155

53

PRT

A流感病毒

155

Lys Val Asn Ser Ile Ile Asp Lys Met Asn Thr Gln Phe Glu Ala Val

1 5 10 15

Gly Arg Glu Phe Asn Asn Leu Glu Arg Arg Ile Glu Asn Leu Asn Lys

20 25 30

Lys Met Glu Asp Gly Phe Leu Asp Val Trp Thr Tyr Asn Ala Glu Leu

35 40 45

Leu Val Leu Met Glu

50

156

20

PRT

A流感病毒

156

Lys Val Asn Ser Ile Ile Asp Lys Met Thr Tyr Asn Ala Glu Leu Leu

1 5 10 15

Val Leu Met Glu

20

157

645

DNA

人工序列

合成

157

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360

aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420

gtgctgctga tgaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645

158

215

PRT

人工序列

合成

158

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Met

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

159

645

DNA

人工序列

合成

159

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaaatcca gagtccgctc gttcatcagc agcaccagca gctcagccag 240

gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

160

1149

DNA

人工序列

合成

160

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360

aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420

gtgctgctga tgaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660

atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga gctccaacct gtacatgagt 720

atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780

gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840

cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900

tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960

cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020

cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080

gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140

agtgggtca 1149

161

383

PRT

人工序列

合成

161

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Met

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

162

1149

DNA

人工序列

合成

162

tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60

atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120

ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180

aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240

tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300

ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360

ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420

actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480

cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540

atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600

atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660

gaccttctca tacagattct tcacgttgct atcgtggaaa tccagagtcc gctcgttcat 720

cagcagcacc agcagctcag ccaggtctgt tccggagcct ccgctgccca ttttttcgat 780

gacagaattc accatgttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840

ggcgtagccg ctgccctgct cattctgatg gtggtagccg taccacccgt ccaccattcc 900

tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctcttg tttcccgctg 960

tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattca ctgagtgggt 1020

gacagtcacg ttcttctcca ggacggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080

gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140

ggccttcat 1149

163

645

DNA

人工序列

合成

163

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatcgtg 360

aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420

gtgctgctga tcaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645

164

215

PRT

人工序列

合成

164

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Ile Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

165

645

DNA

人工序列

合成

165

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaaatcca gagtccgctc gttgatcagc agcaccagca gctcagccag 240

gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacga tgttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

166

1149

DNA

人工序列

合成

166

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatcgtg 360

aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420

gtgctgctga tcaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660

atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga gctccaacct gtacatgagt 720

atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780

gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840

cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900

tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960

cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020

cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080

gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140

agtgggtca 1149

167

383

PRT

人工序列

合成

167

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Ile Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

168

1149

DNA

人工序列

合成

168

tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60

atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120

ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180

aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240

tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300

ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360

ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420

actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480

cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540

atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600

atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660

gaccttctca tacagattct tcacgttgct atcgtggaaa tccagagtcc gctcgttgat 720

cagcagcacc agcagctcag ccaggtctgt tccggagcct ccgctgccca ttttttcgat 780

gacagaattc acgatgttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840

ggcgtagccg ctgccctgct cattctgatg gtggtagccg taccacccgt ccaccattcc 900

tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctcttg tttcccgctg 960

tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattca ctgagtgggt 1020

gacagtcacg ttcttctcca ggacggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080

gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140

ggccttcat 1149

169

645

DNA

人工序列

合成

169

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacctggtg 360

aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420

gtgctgctga tcaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645

170

215

PRT

人工序列

合成

170

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

171

645

DNA

人工序列

合成

171

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaaatcca gagtccgctc gttgatcagc agcaccagca gctcagccag 240

gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca ggttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

172

1149

DNA

人工序列

合成

172

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacctggtg 360

aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420

gtgctgctga tcaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660

atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga gctccaacct gtacatgagt 720

atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780

gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840

cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900

tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960

cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020

cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080

gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140

agtgggtca 1149

173

383

PRT

人工序列

合成

173

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

174

1149

DNA

人工序列

合成

174

tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60

atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120

ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180

aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240

tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300

ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360

ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420

actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480

cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540

atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600

atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660

gaccttctca tacagattct tcacgttgct atcgtggaaa tccagagtcc gctcgttgat 720

cagcagcacc agcagctcag ccaggtctgt tccggagcct ccgctgccca ttttttcgat 780

gacagaattc accaggttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840

ggcgtagccg ctgccctgct cattctgatg gtggtagccg taccacccgt ccaccattcc 900

tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctcttg tttcccgctg 960

tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattca ctgagtgggt 1020

gacagtcacg ttcttctcca ggacggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080

gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140

ggccttcat 1149

175

645

DNA

人工序列

合成

175

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacctggtg 360

aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420

gtgctgctgc tgaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645

176

215

PRT

人工序列

合成

176

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

177

645

DNA

人工序列

合成

177

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaaatcca gagtccgctc gttcagcagc agcaccagca gctcagccag 240

gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca ggttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

178

1149

DNA

人工序列

合成

178

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacctggtg 360

aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420

gtgctgctgc tgaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660

atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga gctccaacct gtacatgagt 720

atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780

gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840

cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900

tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960

cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020

cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080

gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140

agtgggtca 1149

179

383

PRT

人工序列

合成

179

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

180

1149

DNA

人工序列

合成

180

tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60

atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120

ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180

aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240

tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300

ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360

ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420

actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480

cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540

atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600

atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660

gaccttctca tacagattct tcacgttgct atcgtggaaa tccagagtcc gctcgttcag 720

cagcagcacc agcagctcag ccaggtctgt tccggagcct ccgctgccca ttttttcgat 780

gacagaattc accaggttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840

ggcgtagccg ctgccctgct cattctgatg gtggtagccg taccacccgt ccaccattcc 900

tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctcttg tttcccgctg 960

tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattca ctgagtgggt 1020

gacagtcacg ttcttctcca ggacggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080

gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140

ggccttcat 1149

181

645

DNA

人工序列

合成

181

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc ataacaatac ccagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360

aattctgtca tcgaaaaaat gggcggaaat ggcactggag ctgacctggc tgagctgctg 420

gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645

182

215

PRT

人工序列

合成

182

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn

85 90 95

Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

183

645

DNA

人工序列

合成

183

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240

gtcagctcca gtgccatttc cgcccatttt ttcgatgaca gaattcacca tgttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgggtatt 360

gttatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

184

1149

DNA

人工序列

合成

184

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc ataacaatac ccagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360

aattctgtca tcgaaaaaat gggcggaaat ggcactggag ctgacctggc tgagctgctg 420

gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660

atcaagctgc tgaacgaaca ggtgaacaag gagatgaaca gcaccaacct gtacatgagt 720

atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780

gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840

cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900

tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960

cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020

cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080

gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140

agtgggtca 1149

185

383

PRT

人工序列

合成

185

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn

85 90 95

Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Asn Ser Thr Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

186

1149

DNA

人工序列

合成

186

tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60

atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120

ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180

aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240

tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300

ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360

ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420

actagacata ctcatgtaca ggttggtgct gttcatctcc ttgttcacct gttcgttcag 480

cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540

atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600

atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660

gaccttctca tacagattct tcacgttgct atcgtggaac agcagagtcc actggttcag 720

cagcagcacc agcagctcag ccaggtcagc tccagtgcca tttccgccca ttttttcgat 780

gacagaattc accatgttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840

ggcgtagccg ctgccctggg tattgttatg gtggtagccg taccacccgt ccaccattcc 900

tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctcttg tttcccgctg 960

tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattca ctgagtgggt 1020

gacagtcacg ttcttctcca ggacggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080

gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140

ggccttcat 1149

187

645

DNA

人工序列

合成

187

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc ataacaatac ccagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360

aattctgtca tcgaaaaaat gggcggaaat ggcactggag ctgacctggc tgagctgctg 420

gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645

188

215

PRT

人工序列

合成

188

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn

85 90 95

Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

189

645

DNA

人工序列

合成

189

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240

gtcagctcca gtgccatttc cgcccatttt ttcgatgaca gaattcacca tgttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgggtatt 360

gttatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

190

1149

DNA

人工序列

合成

190

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc ataacaatac ccagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360

aattctgtca tcgaaaaaat gggcggaaat ggcactggag ctgacctggc tgagctgctg 420

gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660

atcaagctgc tgaacgaaca ggtgaacaag gagatgaaca gcaccaacct gtacatgagt 720

atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780

gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840

cccgtcaacc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900

tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960

cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020

cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080

gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140

agtgggtca 1149

191

383

PRT

人工序列

合成

191

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn

85 90 95

Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Asn Ser Thr Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Asn Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

192

1149

DNA

人工序列

合成

192

tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60

atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120

ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180

aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240

tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300

gttgacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360

ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420

actagacata ctcatgtaca ggttggtgct gttcatctcc ttgttcacct gttcgttcag 480

cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540

atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600

atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660

gaccttctca tacagattct tcacgttgct atcgtggaac agcagagtcc actggttcag 720

cagcagcacc agcagctcag ccaggtcagc tccagtgcca tttccgccca ttttttcgat 780

gacagaattc accatgttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840

ggcgtagccg ctgccctggg tattgttatg gtggtagccg taccacccgt ccaccattcc 900

tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctcttg tttcccgctg 960

tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattca ctgagtgggt 1020

gacagtcacg ttcttctcca ggacggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080

gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140

ggccttcat 1149

193

465

DNA

Aquifex aeolicus

193

atgcaaattt acgaagggaa actaaccgct gaagggctga ggttcggtat agtggcttcc 60

aggttcaacc acgcactcgt ggatagacta gttgagggag ctatagactg catagtaaga 120

cacgggggaa gggaagaaga cataacgctc gttagagtgc cgggctcctg ggaaattccc 180

gtggctgcgg gagagcttgc gagaaaagag gacatagacg ctgtgatagc gataggagtt 240

ctaataaggg gggctactcc ccactttgat tacatagcct ctgaagtgtc aaaagggctt 300

gcgaaccttt ccttagaact gagaaaaccc ataaccttcg gtgttataac tgcggacacc 360

ttggagcagg cgatagaaag ggcgggaaca aagcacggga ataagggctg ggaagctgca 420

ctttccgcaa tagaaatggc aaacttattt aagagtctga gatga 465

194

154

PRT

Aquifex aeolicus

194

Met Gln Ile Tyr Glu Gly Lys Leu Thr Ala Glu Gly Leu Arg Phe Gly

1 5 10 15

Ile Val Ala Ser Arg Phe Asn His Ala Leu Val Asp Arg Leu Val Glu

20 25 30

Gly Ala Ile Asp Cys Ile Val Arg His Gly Gly Arg Glu Glu Asp Ile

35 40 45

Thr Leu Val Arg Val Pro Gly Ser Trp Glu Ile Pro Val Ala Ala Gly

50 55 60

Glu Leu Ala Arg Lys Glu Asp Ile Asp Ala Val Ile Ala Ile Gly Val

65 70 75 80

Leu Ile Arg Gly Ala Thr Pro His Phe Asp Tyr Ile Ala Ser Glu Val

85 90 95

Ser Lys Gly Leu Ala Asn Leu Ser Leu Glu Leu Arg Lys Pro Ile Thr

100 105 110

Phe Gly Val Ile Thr Ala Asp Thr Leu Glu Gln Ala Ile Glu Arg Ala

115 120 125

Gly Thr Lys His Gly Asn Lys Gly Trp Glu Ala Ala Leu Ser Ala Ile

130 135 140

Glu Met Ala Asn Leu Phe Lys Ser Leu Arg

145 150

195

465

DNA

Aquifex aeolicus

195

tcatctcaga ctcttaaata agtttgccat ttctattgcg gaaagtgcag cttcccagcc 60

cttattcccg tgctttgttc ccgccctttc tatcgcctgc tccaaggtgt ccgcagttat 120

aacaccgaag gttatgggtt ttctcagttc taaggaaagg ttcgcaagcc cttttgacac 180

ttcagaggct atgtaatcaa agtggggagt agcccccctt attagaactc ctatcgctat 240

cacagcgtct atgtcctctt ttctcgcaag ctctcccgca gccacgggaa tttcccagga 300

gcccggcact ctaacgagcg ttatgtcttc ttcccttccc ccgtgtctta ctatgcagtc 360

tatagctccc tcaactagtc tatccacgag tgcgtggttg aacctggaag ccactatacc 420

gaacctcagc ccttcagcgg ttagtttccc ttcgtaaatt tgcat 465

196

642

DNA

人工序列

合成

196

atgaaggcca agctgctggt gctcctgtgc accttcaccg ccacctacgc cgacaccatc 60

tgcatcggct accacgccaa caacagcacc gacaccgtgg ataccgtgct ggaaaagaac 120

gtgaccgtga cccacagcgt gaacctgggc agcggcctgc ggatggtgac aggcctgcgg 180

aacatccccc agagagagac acggggcctg ttcggcgcca ttgccggctt tatcgagggc 240

ggctggaccg gcatggtgga cgggtggtac ggctaccacc accagaacga gcagggcagc 300

ggctacgccg ccgaccagaa gtccacccag aacgccatca acggcatcac caacatggtg 360

aacagcgtga tcgagaagat gggctccggc ggcagcggca ccgatctggc tgaactgctg 420

gtcctgctgc tgaacgagcg gaccctggac ttccacgaca gcaacgtgaa gaacctgtac 480

gagaaagtga agtcccagct gaagaacaac gccaaagaga tcggcaacgg ctgcttcgag 540

ttctaccaca agtgcaacaa cgagtgcatg gaaagcgtga agaacggcac ctacgactac 600

cccaagtaca gcgaggaaag caagctgaac cgcgagggag gc 642

197

214

PRT

人工序列

合成

197

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Gly Gly

210

198

642

DNA

人工序列

合成

198

gcctccctcg cggttcagct tgctttcctc gctgtacttg gggtagtcgt aggtgccgtt 60

cttcacgctt tccatgcact cgttgttgca cttgtggtag aactcgaagc agccgttgcc 120

gatctctttg gcgttgttct tcagctggga cttcactttc tcgtacaggt tcttcacgtt 180

gctgtcgtgg aagtccaggg tccgctcgtt cagcagcagg accagcagtt cagccagatc 240

ggtgccgctg ccgccggagc ccatcttctc gatcacgctg ttcaccatgt tggtgatgcc 300

gttgatggcg ttctgggtgg acttctggtc ggcggcgtag ccgctgccct gctcgttctg 360

gtggtggtag ccgtaccacc cgtccaccat gccggtccag ccgccctcga taaagccggc 420

aatggcgccg aacaggcccc gtgtctctct ctgggggatg ttccgcaggc ctgtcaccat 480

ccgcaggccg ctgcccaggt tcacgctgtg ggtcacggtc acgttctttt ccagcacggt 540

atccacggtg tcggtgctgt tgttggcgtg gtagccgatg cagatggtgt cggcgtaggt 600

ggcggtgaag gtgcacagga gcaccagcag cttggccttc at 642

199

1104

DNA

人工序列

合成

199

atgaaggcca agctgctggt gctcctgtgc accttcaccg ccacctacgc cgacaccatc 60

tgcatcggct accacgccaa caacagcacc gacaccgtgg ataccgtgct ggaaaagaac 120

gtgaccgtga cccacagcgt gaacctgggc agcggcctgc ggatggtgac aggcctgcgg 180

aacatccccc agagagagac acggggcctg ttcggcgcca ttgccggctt tatcgagggc 240

ggctggaccg gcatggtgga cgggtggtac ggctaccacc accagaacga gcagggcagc 300

ggctacgccg ccgaccagaa gtccacccag aacgccatca acggcatcac caacatggtg 360

aacagcgtga tcgagaagat gggctccggc ggcagcggca ccgatctggc tgaactgctg 420

gtcctgctgc tgaacgagcg gaccctggac ttccacgaca gcaacgtgaa gaacctgtac 480

gagaaagtga agtcccagct gaagaacaac gccaaagaga tcggcaacgg ctgcttcgag 540

ttctaccaca agtgcaacaa cgagtgcatg gaaagcgtga agaacggcac ctacgactac 600

cccaagtaca gcgaggaaag caagctgaac cgcgagggag gcatgcaaat ctacgagggc 660

aagctgacag ccgagggcct gagattcggc atcgtggcca gccggttcaa ccacgccctg 720

gtggacagac tggtggaagg cgccatcgac tgcatcgtgc ggcacggcgg cagagaagag 780

gacatcaccc tggtccgcgt gcccggcagc tgggaaattc ctgtggctgc cggcgagctg 840

gcccggaaag aggatatcga cgccgtcatc gccatcggcg tgctgatcag aggcgccacc 900

ccccacttcg actatatcgc cagcgaggtg tccaagggcc tggccaacct gagcctggaa 960

ctgcggaagc ccatcacctt cggagtgatc accgccgaca ccctggaaca ggccatcgag 1020

agagccggca ccaagcacgg caacaaggga tgggaagccg ccctgagcgc catcgagatg 1080

gccaatctgt tcaagagcct gcgc 1104

200

368

PRT

人工序列

合成

200

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Gly Gly Met Gln Ile Tyr Glu Gly Lys Leu Thr Ala

210 215 220

Glu Gly Leu Arg Phe Gly Ile Val Ala Ser Arg Phe Asn His Ala Leu

225 230 235 240

Val Asp Arg Leu Val Glu Gly Ala Ile Asp Cys Ile Val Arg His Gly

245 250 255

Gly Arg Glu Glu Asp Ile Thr Leu Val Arg Val Pro Gly Ser Trp Glu

260 265 270

Ile Pro Val Ala Ala Gly Glu Leu Ala Arg Lys Glu Asp Ile Asp Ala

275 280 285

Val Ile Ala Ile Gly Val Leu Ile Arg Gly Ala Thr Pro His Phe Asp

290 295 300

Tyr Ile Ala Ser Glu Val Ser Lys Gly Leu Ala Asn Leu Ser Leu Glu

305 310 315 320

Leu Arg Lys Pro Ile Thr Phe Gly Val Ile Thr Ala Asp Thr Leu Glu

325 330 335

Gln Ala Ile Glu Arg Ala Gly Thr Lys His Gly Asn Lys Gly Trp Glu

340 345 350

Ala Ala Leu Ser Ala Ile Glu Met Ala Asn Leu Phe Lys Ser Leu Arg

355 360 365

201

1104

DNA

人工序列

合成

201

gcgcaggctc ttgaacagat tggccatctc gatggcgctc agggcggctt cccatccctt 60

gttgccgtgc ttggtgccgg ctctctcgat ggcctgttcc agggtgtcgg cggtgatcac 120

tccgaaggtg atgggcttcc gcagttccag gctcaggttg gccaggccct tggacacctc 180

gctggcgata tagtcgaagt ggggggtggc gcctctgatc agcacgccga tggcgatgac 240

ggcgtcgata tcctctttcc gggccagctc gccggcagcc acaggaattt cccagctgcc 300

gggcacgcgg accagggtga tgtcctcttc tctgccgccg tgccgcacga tgcagtcgat 360

ggcgccttcc accagtctgt ccaccagggc gtggttgaac cggctggcca cgatgccgaa 420

tctcaggccc tcggctgtca gcttgccctc gtagatttgc atgcctccct cgcggttcag 480

cttgctttcc tcgctgtact tggggtagtc gtaggtgccg ttcttcacgc tttccatgca 540

ctcgttgttg cacttgtggt agaactcgaa gcagccgttg ccgatctctt tggcgttgtt 600

cttcagctgg gacttcactt tctcgtacag gttcttcacg ttgctgtcgt ggaagtccag 660

ggtccgctcg ttcagcagca ggaccagcag ttcagccaga tcggtgccgc tgccgccgga 720

gcccatcttc tcgatcacgc tgttcaccat gttggtgatg ccgttgatgg cgttctgggt 780

ggacttctgg tcggcggcgt agccgctgcc ctgctcgttc tggtggtggt agccgtacca 840

cccgtccacc atgccggtcc agccgccctc gataaagccg gcaatggcgc cgaacaggcc 900

ccgtgtctct ctctggggga tgttccgcag gcctgtcacc atccgcaggc cgctgcccag 960

gttcacgctg tgggtcacgg tcacgttctt ttccagcacg gtatccacgg tgtcggtgct 1020

gttgttggcg tggtagccga tgcagatggt gtcggcgtag gtggcggtga aggtgcacag 1080

gagcaccagc agcttggcct tcat 1104

202

645

DNA

人工序列

合成

202

atgaaggcca agctgctggt gctcctgtgc accttcaccg ccacctacgc cgacaccatc 60

tgcatcggct accacgccaa caacagcacc gacaccgtgg ataccgtgct ggaaaagaac 120

gtgaccgtga cccacagcgt gaacctgggc agcggcctgc ggatggtgac aggcctgcgg 180

aacatccccc agagagagac acggggcctg ttcggcgcca ttgccggctt tatcgagggc 240

ggctggaccg gcatggtgga cgggtggtac ggctaccacc accagaacga gcagggcagc 300

ggctacgccg ccgaccagaa gtccacccag aacgccatca acggcatcac caacatggtg 360

aacagcgtga tcgagaagat gggctccggc ggcagcggca ccgatctggc tgaactgctg 420

gtcctgctgc tgaacgagcg gaccctggac ttccacgaca gcaacgtgaa gaacctgtac 480

gagaaagtga agtcccagct gaagaacaac gccaaagaga tcggcaacgg ctgcttcgag 540

ttctaccaca agtgcaacaa cgagtgcatg gaaagcgtga agaacggcac ctacgactac 600

cccaagtaca gcgaggaaag caagctgaac cgcgagggaa gcggc 645

203

215

PRT

人工序列

合成

203

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Gly Ser Gly

210 215

204

645

DNA

人工序列

合成

204

gccgcttccc tcgcggttca gcttgctttc ctcgctgtac ttggggtagt cgtaggtgcc 60

gttcttcacg ctttccatgc actcgttgtt gcacttgtgg tagaactcga agcagccgtt 120

gccgatctct ttggcgttgt tcttcagctg ggacttcact ttctcgtaca ggttcttcac 180

gttgctgtcg tggaagtcca gggtccgctc gttcagcagc aggaccagca gttcagccag 240

atcggtgccg ctgccgccgg agcccatctt ctcgatcacg ctgttcacca tgttggtgat 300

gccgttgatg gcgttctggg tggacttctg gtcggcggcg tagccgctgc cctgctcgtt 360

ctggtggtgg tagccgtacc acccgtccac catgccggtc cagccgccct cgataaagcc 420

ggcaatggcg ccgaacaggc cccgtgtctc tctctggggg atgttccgca ggcctgtcac 480

catccgcagg ccgctgccca ggttcacgct gtgggtcacg gtcacgttct tttccagcac 540

ggtatccacg gtgtcggtgc tgttgttggc gtggtagccg atgcagatgg tgtcggcgta 600

ggtggcggtg aaggtgcaca ggagcaccag cagcttggcc ttcat 645

205

1107

DNA

人工序列

合成

205

atgaaggcca agctgctggt gctcctgtgc accttcaccg ccacctacgc cgacaccatc 60

tgcatcggct accacgccaa caacagcacc gacaccgtgg ataccgtgct ggaaaagaac 120

gtgaccgtga cccacagcgt gaacctgggc agcggcctgc ggatggtgac aggcctgcgg 180

aacatccccc agagagagac acggggcctg ttcggcgcca ttgccggctt tatcgagggc 240

ggctggaccg gcatggtgga cgggtggtac ggctaccacc accagaacga gcagggcagc 300

ggctacgccg ccgaccagaa gtccacccag aacgccatca acggcatcac caacatggtg 360

aacagcgtga tcgagaagat gggctccggc ggcagcggca ccgatctggc tgaactgctg 420

gtcctgctgc tgaacgagcg gaccctggac ttccacgaca gcaacgtgaa gaacctgtac 480

gagaaagtga agtcccagct gaagaacaac gccaaagaga tcggcaacgg ctgcttcgag 540

ttctaccaca agtgcaacaa cgagtgcatg gaaagcgtga agaacggcac ctacgactac 600

cccaagtaca gcgaggaaag caagctgaac cgcgagggaa gcggcatgca aatctacgag 660

ggcaagctga cagccgaggg cctgagattc ggcatcgtgg ccagccggtt caaccacgcc 720

ctggtggaca gactggtgga aggcgccatc gactgcatcg tgcggcacgg cggcagagaa 780

gaggacatca ccctggtccg cgtgcccggc agctgggaaa ttcctgtggc tgccggcgag 840

ctggcccgga aagaggatat cgacgccgtc atcgccatcg gcgtgctgat cagaggcgcc 900

accccccact tcgactatat cgccagcgag gtgtccaagg gcctggccaa cctgagcctg 960

gaactgcgga agcccatcac cttcggagtg atcaccgccg acaccctgga acaggccatc 1020

gagagagccg gcaccaagca cggcaacaag ggatgggaag ccgccctgag cgccatcgag 1080

atggccaatc tgttcaagag cctgcgc 1107

206

369

PRT

人工序列

合成

206

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Gly Ser Gly Met Gln Ile Tyr Glu Gly Lys Leu Thr

210 215 220

Ala Glu Gly Leu Arg Phe Gly Ile Val Ala Ser Arg Phe Asn His Ala

225 230 235 240

Leu Val Asp Arg Leu Val Glu Gly Ala Ile Asp Cys Ile Val Arg His

245 250 255

Gly Gly Arg Glu Glu Asp Ile Thr Leu Val Arg Val Pro Gly Ser Trp

260 265 270

Glu Ile Pro Val Ala Ala Gly Glu Leu Ala Arg Lys Glu Asp Ile Asp

275 280 285

Ala Val Ile Ala Ile Gly Val Leu Ile Arg Gly Ala Thr Pro His Phe

290 295 300

Asp Tyr Ile Ala Ser Glu Val Ser Lys Gly Leu Ala Asn Leu Ser Leu

305 310 315 320

Glu Leu Arg Lys Pro Ile Thr Phe Gly Val Ile Thr Ala Asp Thr Leu

325 330 335

Glu Gln Ala Ile Glu Arg Ala Gly Thr Lys His Gly Asn Lys Gly Trp

340 345 350

Glu Ala Ala Leu Ser Ala Ile Glu Met Ala Asn Leu Phe Lys Ser Leu

355 360 365

Arg

207

1107

DNA

人工序列

合成

207

gcgcaggctc ttgaacagat tggccatctc gatggcgctc agggcggctt cccatccctt 60

gttgccgtgc ttggtgccgg ctctctcgat ggcctgttcc agggtgtcgg cggtgatcac 120

tccgaaggtg atgggcttcc gcagttccag gctcaggttg gccaggccct tggacacctc 180

gctggcgata tagtcgaagt ggggggtggc gcctctgatc agcacgccga tggcgatgac 240

ggcgtcgata tcctctttcc gggccagctc gccggcagcc acaggaattt cccagctgcc 300

gggcacgcgg accagggtga tgtcctcttc tctgccgccg tgccgcacga tgcagtcgat 360

ggcgccttcc accagtctgt ccaccagggc gtggttgaac cggctggcca cgatgccgaa 420

tctcaggccc tcggctgtca gcttgccctc gtagatttgc atgccgcttc cctcgcggtt 480

cagcttgctt tcctcgctgt acttggggta gtcgtaggtg ccgttcttca cgctttccat 540

gcactcgttg ttgcacttgt ggtagaactc gaagcagccg ttgccgatct ctttggcgtt 600

gttcttcagc tgggacttca ctttctcgta caggttcttc acgttgctgt cgtggaagtc 660

cagggtccgc tcgttcagca gcaggaccag cagttcagcc agatcggtgc cgctgccgcc 720

ggagcccatc ttctcgatca cgctgttcac catgttggtg atgccgttga tggcgttctg 780

ggtggacttc tggtcggcgg cgtagccgct gccctgctcg ttctggtggt ggtagccgta 840

ccacccgtcc accatgccgg tccagccgcc ctcgataaag ccggcaatgg cgccgaacag 900

gccccgtgtc tctctctggg ggatgttccg caggcctgtc accatccgca ggccgctgcc 960

caggttcacg ctgtgggtca cggtcacgtt cttttccagc acggtatcca cggtgtcggt 1020

gctgttgttg gcgtggtagc cgatgcagat ggtgtcggcg taggtggcgg tgaaggtgca 1080

caggagcacc agcagcttgg ccttcat 1107

208

645

DNA

人工序列

合成

208

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccaa gcatccagag cagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360

aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420

gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645

209

215

PRT

人工序列

合成

209

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

210

645

DNA

人工序列

合成

210

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240

gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctctgctctg gatgcttggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

211

1149

DNA

人工序列

合成

211

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccaa gcatccagag cagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360

aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420

gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660

atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga gctccaacct gtacatgagt 720

atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780

gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840

cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900

tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960

cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020

cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080

gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140

agtgggtca 1149

212

383

PRT

人工序列

合成

212

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

213

1149

DNA

人工序列

合成

213

tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60

atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120

ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180

aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240

tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300

ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360

ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420

actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480

cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540

atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600

atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660

gaccttctca tacagattct tcacgttgct atcgtggaac agcagagtcc actggttcag 720

cagcagcacc agcagctcag ccaggtctgt tccggagcct ccgctgccca ttttttcgat 780

gacagaattc accatgttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840

ggcgtagccg ctgccctgct cattctgatg gtggtagccg taccacccgt ccaccattcc 900

tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctctgc tctggatgct 960

tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattca ctgagtgggt 1020

gacagtcacg ttcttctcca ggacggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080

gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140

ggccttcat 1149

214

215

PRT

人工序列

合成

214

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Leu Ala Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asp Glu Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Tyr His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Arg Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

215

383

PRT

人工序列

合成

215

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Leu Ala Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asp Glu Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Tyr His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Arg Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

216

645

DNA

人工序列

合成

216

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360

aattctgtca tcgaaaaaat gggcagcgga ggctccggaa catacaacgc tgagctgctg 420

gtgctgctgc tgaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645

217

215

PRT

人工序列

合成

217

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

218

645

DNA

人工序列

合成

218

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaaatcca gagtccgctc gttcagcagc agcaccagca gctcagcgtt 240

gtatgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

219

1149

DNA

人工序列

合成

219

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360

aattctgtca tcgaaaaaat gggcagcgga ggctccggaa catacaacgc tgagctgctg 420

gtgctgctgc tgaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660

atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga gctccaacct gtacatgagt 720

atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780

gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840

cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900

tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960

cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020

cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080

gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140

agtgggtca 1149

220

383

PRT

人工序列

合成

220

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

221

1151

DNA

人工序列

合成

221

rctgacccac tttttctgga cttggcaatg cccttcacat actgatctgc caggtacagg 60

ccatgattct cgtttccaat cagttcgatt ttatccagga tgtccttaaa caggacctcc 120

tcctcgtgct gctcggccac gtaccactgc agaaagttga aggtagcatg atctttgctc 180

ttaatggcgt ggtccacaat attgttgata gattcggaaa tatgctgctc gtgttcgtaa 240

gctttctgaa agatctgggt caggccctcg aacttatgtt caggggcgct gattgaagtc 300

agctggacgg gcacattgtt ctcattcagg aaaatgatca gtttctttgc atgttcgtat 360

tcctcggctg cgtgatcaaa caggaacagc ccagcgccgt ccagtgagtg tgtataacac 420

caactagaca tactcatgta caggttggag ctctgcatct ccttgttcac ctgttcgttc 480

agcagcttga tgatgtcgcc cccactgtca attttctctc gattcagctt actctcttca 540

gaatatttgg gatagtcgta agtgccgttc ttcacagact ccatacattc attgttgcac 600

ttatggtaaa actcgaagca tccattcccg atttctttgg cattgttctt cagctgggat 660

ttgaccttct catacagatt cttcacgttg ctatcgtgga aatccagagt ccgctcgttc 720

agcagcagca ccagcagctc agcgttgtat gttccggagc ctccgctgcc cattttttcg 780

atgacagaat tcaccatgtt agtaatgcca ttgattgcgt tctgtgtaga cttctgatca 840

gcggcgtagc cgctgccctg ctcattctga tggtggtagc cgtaccaccc gtccaccatt 900

cctgtccacc cgccctcaat aaaccctgcg atagcgccga acagtcctct tgtttcccgc 960

tgtgggatgt tgcgcagtcc ggtgaccatc ctcagtccgc tgcccagatt cactgagtgg 1020

gtgacagtca cgttcttctc caggacggta tccactgtgt cggtggagtt gtttgcgtga 1080

tagccgatgc agatagtgtc agcgtaggtt gcggtaaaag tacacagcag gaccagcagt 1140

ttggccttca t 1151

222

215

PRT

人工序列

合成

222

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

223

383

PRT

人工序列

合成

223

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

224

215

PRT

人工序列

合成

224

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Ile Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

225

383

PRT

人工序列

合成

225

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Ile Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

226

215

PRT

人工序列

合成

226

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

227

383

PRT

人工序列

合成

227

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

228

215

PRT

人工序列

合成

228

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

229

383

PRT

人工序列

合成

229

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

230

215

PRT

人工序列

合成

230

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Met

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

231

383

PRT

人工序列

合成

231

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Met

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

232

215

PRT

人工序列

合成

232

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Gln Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Gln

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

233

383

PRT

人工序列

合成

233

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Gln Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Gln

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

234

645

DNA

人工序列

合成

234

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca ccaactcaac taatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360

aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420

gtgctgctgc tgaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645

235

215

PRT

人工序列

合成

235

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr Asn Ser Thr Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

236

645

DNA

人工序列

合成

236

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaaatcca gagtccgctc gttcagcagc agcaccagca gctcagccag 240

gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattagttga gttggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

237

1149

DNA

人工序列

合成

237

atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60

tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120

gtgactgtca ccaactcaac taatctgggc agcggactga ggatggtcac cggactgcgc 180

aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240

gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300

ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360

aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420

gtgctgctgc tgaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480

gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540

ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600

cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660

atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga gctccaacct gtacatgagt 720

atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780

gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840

cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900

tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960

cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020

cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080

gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140

agtgggtca 1149

238

383

PRT

人工序列

合成

238

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr Asn Ser Thr Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

239

1149

DNA

人工序列

合成

239

tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60

atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120

ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180

aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240

tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300

ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360

ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420

actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480

cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540

atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600

atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660

gaccttctca tacagattct tcacgttgct atcgtggaaa tccagagtcc gctcgttcag 720

cagcagcacc agcagctcag ccaggtctgt tccggagcct ccgctgccca ttttttcgat 780

gacagaattc accatgttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840

ggcgtagccg ctgccctgct cattctgatg gtggtagccg taccacccgt ccaccattcc 900

tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctcttg tttcccgctg 960

tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattag ttgagttggt 1020

gacagtcacg ttcttctcca ggacggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080

gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140

ggccttcat 1149

240

215

PRT

人工序列

合成

240

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Ile Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Met Leu

130 135 140

Asn Gln Phe Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

241

383

PRT

人工序列

合成

241

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Ile Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Met Leu

130 135 140

Asn Gln Phe Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

242

215

PRT

人工序列

合成

242

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Asn Gly Thr Gly Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

243

383

PRT

人工序列

合成

243

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Asn Gly Thr Gly Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

244

215

PRT

人工序列

合成

244

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

245

383

PRT

人工序列

合成

245

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

246

215

PRT

人工序列

合成

246

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Asn Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

247

383

PRT

人工序列

合成

247

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Asn Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

248

215

PRT

人工序列

合成

248

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn

85 90 95

Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

249

383

PRT

人工序列

合成

249

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn

85 90 95

Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Asn Ser Thr Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

250

215

PRT

人工序列

合成

250

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn

85 90 95

Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

251

383

PRT

人工序列

合成

251

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn

85 90 95

Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Asn Ser Thr Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Asn Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

252

214

PRT

人工序列

合成

252

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Gly Gly

210

253

368

PRT

人工序列

合成

253

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Gly Gly Met Gln Ile Tyr Glu Gly Lys Leu Thr Ala

210 215 220

Glu Gly Leu Arg Phe Gly Ile Val Ala Ser Arg Phe Asn His Ala Leu

225 230 235 240

Val Asp Arg Leu Val Glu Gly Ala Ile Asp Cys Ile Val Arg His Gly

245 250 255

Gly Arg Glu Glu Asp Ile Thr Leu Val Arg Val Pro Gly Ser Trp Glu

260 265 270

Ile Pro Val Ala Ala Gly Glu Leu Ala Arg Lys Glu Asp Ile Asp Ala

275 280 285

Val Ile Ala Ile Gly Val Leu Ile Arg Gly Ala Thr Pro His Phe Asp

290 295 300

Tyr Ile Ala Ser Glu Val Ser Lys Gly Leu Ala Asn Leu Ser Leu Glu

305 310 315 320

Leu Arg Lys Pro Ile Thr Phe Gly Val Ile Thr Ala Asp Thr Leu Glu

325 330 335

Gln Ala Ile Glu Arg Ala Gly Thr Lys His Gly Asn Lys Gly Trp Glu

340 345 350

Ala Ala Leu Ser Ala Ile Glu Met Ala Asn Leu Phe Lys Ser Leu Arg

355 360 365

254

215

PRT

人工序列

合成

254

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Gly Ser Gly

210 215

255

369

PRT

人工序列

合成

255

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Gly Ser Gly Met Gln Ile Tyr Glu Gly Lys Leu Thr

210 215 220

Ala Glu Gly Leu Arg Phe Gly Ile Val Ala Ser Arg Phe Asn His Ala

225 230 235 240

Leu Val Asp Arg Leu Val Glu Gly Ala Ile Asp Cys Ile Val Arg His

245 250 255

Gly Gly Arg Glu Glu Asp Ile Thr Leu Val Arg Val Pro Gly Ser Trp

260 265 270

Glu Ile Pro Val Ala Ala Gly Glu Leu Ala Arg Lys Glu Asp Ile Asp

275 280 285

Ala Val Ile Ala Ile Gly Val Leu Ile Arg Gly Ala Thr Pro His Phe

290 295 300

Asp Tyr Ile Ala Ser Glu Val Ser Lys Gly Leu Ala Asn Leu Ser Leu

305 310 315 320

Glu Leu Arg Lys Pro Ile Thr Phe Gly Val Ile Thr Ala Asp Thr Leu

325 330 335

Glu Gln Ala Ile Glu Arg Ala Gly Thr Lys His Gly Asn Lys Gly Trp

340 345 350

Glu Ala Ala Leu Ser Ala Ile Glu Met Ala Asn Leu Phe Lys Ser Leu

355 360 365

Arg

256

211

PRT

人工序列

合成

256

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Gly Gly

210

257

364

PRT

人工序列

合成

257

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Gly Gly Gln Ile Tyr Glu Gly Lys Leu Thr Ala Glu Gly Leu Arg

210 215 220

Phe Gly Ile Val Ala Ser Arg Phe Asn His Ala Leu Val Asp Arg Leu

225 230 235 240

Val Glu Gly Ala Ile Asp Cys Ile Val Arg His Gly Gly Arg Glu Glu

245 250 255

Asp Ile Thr Leu Val Arg Val Pro Gly Ser Trp Glu Ile Pro Val Ala

260 265 270

Ala Gly Glu Leu Ala Arg Lys Glu Asp Ile Asp Ala Val Ile Ala Ile

275 280 285

Gly Val Leu Ile Arg Gly Ala Thr Pro His Phe Asp Tyr Ile Ala Ser

290 295 300

Glu Val Ser Lys Gly Leu Ala Asn Leu Ser Leu Glu Leu Arg Lys Pro

305 310 315 320

Ile Thr Phe Gly Val Ile Thr Ala Asp Thr Leu Glu Gln Ala Ile Glu

325 330 335

Arg Ala Gly Thr Lys His Gly Asn Lys Gly Trp Glu Ala Ala Leu Ser

340 345 350

Ala Ile Glu Met Ala Asn Leu Phe Lys Ser Leu Arg

355 360

258

212

PRT

人工序列

合成

258

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Gly Ser Gly

210

259

365

PRT

人工序列

合成

259

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser

50 55 60

Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Gly Ser Gly Gln Ile Tyr Glu Gly Lys Leu Thr Ala Glu Gly Leu

210 215 220

Arg Phe Gly Ile Val Ala Ser Arg Phe Asn His Ala Leu Val Asp Arg

225 230 235 240

Leu Val Glu Gly Ala Ile Asp Cys Ile Val Arg His Gly Gly Arg Glu

245 250 255

Glu Asp Ile Thr Leu Val Arg Val Pro Gly Ser Trp Glu Ile Pro Val

260 265 270

Ala Ala Gly Glu Leu Ala Arg Lys Glu Asp Ile Asp Ala Val Ile Ala

275 280 285

Ile Gly Val Leu Ile Arg Gly Ala Thr Pro His Phe Asp Tyr Ile Ala

290 295 300

Ser Glu Val Ser Lys Gly Leu Ala Asn Leu Ser Leu Glu Leu Arg Lys

305 310 315 320

Pro Ile Thr Phe Gly Val Ile Thr Ala Asp Thr Leu Glu Gln Ala Ile

325 330 335

Glu Arg Ala Gly Thr Lys His Gly Asn Lys Gly Trp Glu Ala Ala Leu

340 345 350

Ser Ala Ile Glu Met Ala Asn Leu Phe Lys Ser Leu Arg

355 360 365

260

645

DNA

人工序列

合成

CDS

(1)..(645)

260

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg ctg 432

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

aac cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac 645

Leu Asn Arg Glu Lys Ile Asp

210 215

261

215

PRT

人工序列

合成構建體

261

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

262

645

DNA

人工序列

合成

262

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240

gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

263

1155

DNA

人工序列

合成

CDS

(1)..(1155)

263

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg ctg 432

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

aac cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

aac gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt 720

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

tga 1155

264

383

PRT

人工序列

合成構建體

264

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

265

1155

DNA

人工序列

合成

265

tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60

caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120

ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180

gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240

gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300

agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360

gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420

acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480

gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540

ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600

gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660

ggatttgacc ttctcataca gattcttcac gttgctatcg tggaacagca gagtccactg 720

gttcagcagc agcaccagca gctcagccag gtctgttccg gagcctccgc tgcccatttt 780

ttcgatgaca gaattcacca tgttagtaat gccattgatt gcgttctgtg tagacttctg 840

atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900

cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960

ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020

gtgggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080

gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140

cagtttggcc ttcat 1155

266

5579

DNA

人工序列

合成

266

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240

ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300

tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360

ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420

cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540

tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660

ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720

catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780

cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900

agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960

tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020

cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080

ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140

ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200

accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260

gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320

ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380

atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440

ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500

tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560

ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620

ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaatg 1680

agcagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740

ctaacatggt gaattctgtc atcgaaaaaa tgggcagcgg aggctccgga acagacctgg 1800

ctgagctgct ggtgctgctg ctgaaccagt ggactctgct gttccacgat agcaacgtga 1860

agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920

gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980

cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040

ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgcag agctccaacc 2100

tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160

tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220

agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280

tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340

atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400

acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460

tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520

agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580

tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640

cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700

tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760

aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820

tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880

ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940

tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000

caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060

gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120

atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180

cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240

cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300

accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360

acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420

cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480

acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540

atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600

agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660

acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720

gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780

gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840

gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900

gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960

acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020

tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080

ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140

catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200

ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260

ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320

cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380

gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440

ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500

catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560

ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620

tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680

atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740

gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800

gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860

attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920

ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980

ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040

aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100

tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160

cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220

gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280

ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340

ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400

ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460

aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520

aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579

267

645

DNA

人工序列

合成

CDS

(1)..(645)

267

atg aag gca atc ctg gtc gtc ctg ctg tat act ttc gct acc gct aac 48

Met Lys Ala Ile Leu Val Val Leu Leu Tyr Thr Phe Ala Thr Ala Asn

1 5 10 15

gct gac acc ctg tgc atc ggc tat cac gct aac aac tca acc gac aca 96

Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat act gtc ctg gag aag aac gtg act gtc acc cac tct gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agt gga ctg agg ctg gca act gga ctg cga aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Leu Ala Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa acc aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aac 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag gga tca ggc tac gcc gct gac ctg aag agc aca cag aat gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser Thr Gln Asn Ala

100 105 110

atc gat gaa att act aac atg gtg aat tcc gtc atc gag aaa atg ggc 384

Ile Asp Glu Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

agc gga ggc tcc gga acc gac ctg gca gaa ctg ctg gtg ctg ctg ctg 432

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

aac cag tgg aca ctg ctg tac cac gat agt aac gtg aag aat ctg tat 480

Asn Gln Trp Thr Leu Leu Tyr His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aaa gtc cga tca cag ctg aag aac aat gct aaa gaa atc ggg aat 528

Glu Lys Val Arg Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc gac aac acc tgt atg gag agc 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Met Glu Ser

180 185 190

gtg aaa aat ggc aca tac gat tat ccc aag tat tcc gag gaa gcc aaa 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys

195 200 205

ctg aac aga gag gaa att gac 645

Leu Asn Arg Glu Glu Ile Asp

210 215

268

215

PRT

人工序列

合成構建體

268

Met Lys Ala Ile Leu Val Val Leu Leu Tyr Thr Phe Ala Thr Ala Asn

1 5 10 15

Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Leu Ala Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asp Glu Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Tyr His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Arg Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys

195 200 205

Leu Asn Arg Glu Glu Ile Asp

210 215

269

645

DNA

人工序列

合成

269

gtcaatttcc tctctgttca gtttggcttc ctcggaatac ttgggataat cgtatgtgcc 60

atttttcacg ctctccatac aggtgttgtc gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttagcattgt tcttcagctg tgatcggact ttctcataca gattcttcac 180

gttactatcg tggtacagca gtgtccactg gttcagcagc agcaccagca gttctgccag 240

gtcggttccg gagcctccgc tgcccatttt ctcgatgacg gaattcacca tgttagtaat 300

ttcatcgatt gcattctgtg tgctcttcag gtcagcggcg tagcctgatc cctgctcgtt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctctggtttc ccgctgtggg atgtttcgca gtccagttgc 480

cagcctcagt ccactgccca gattcacaga gtgggtgaca gtcacgttct tctccaggac 540

agtatccact gtgtcggttg agttgttagc gtgatagccg atgcacaggg tgtcagcgtt 600

agcggtagcg aaagtataca gcaggacgac caggattgcc ttcat 645

270

1155

DNA

人工序列

合成

CDS

(1)..(1155)

270

atg aag gca atc ctg gtc gtc ctg ctg tat act ttc gct acc gct aac 48

Met Lys Ala Ile Leu Val Val Leu Leu Tyr Thr Phe Ala Thr Ala Asn

1 5 10 15

gct gac acc ctg tgc atc ggc tat cac gct aac aac tca acc gac aca 96

Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat act gtc ctg gag aag aac gtg act gtc acc cac tct gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agt gga ctg agg ctg gca act gga ctg cga aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Leu Ala Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa acc aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aac 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag gga tca ggc tac gcc gct gac ctg aag agc aca cag aat gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser Thr Gln Asn Ala

100 105 110

atc gat gaa att act aac atg gtg aat tcc gtc atc gag aaa atg ggc 384

Ile Asp Glu Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

agc gga ggc tcc gga acc gac ctg gca gaa ctg ctg gtg ctg ctg ctg 432

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

aac cag tgg aca ctg ctg tac cac gat agt aac gtg aag aat ctg tat 480

Asn Gln Trp Thr Leu Leu Tyr His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aaa gtc cga tca cag ctg aag aac aat gct aaa gaa atc ggg aat 528

Glu Lys Val Arg Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc gac aac acc tgt atg gag agc 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Met Glu Ser

180 185 190

gtg aaa aat ggc aca tac gat tat ccc aag tat tcc gag gaa gcc aaa 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys

195 200 205

ctg aac aga gag gaa att gac tct ggg ggc gac atc atc aag ctg ctg 672

Leu Asn Arg Glu Glu Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

aac gaa cag gtc aac aag gag atg cag agc tcc aat ctg tac atg tcc 720

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

atg tct agt tgg tgt tat acc cac tct ctg gac ggc gct ggg ctg ttc 768

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

att ttc ctg aac gag aac aat gtg ccc gtc cag ctg aca tca atc agc 864

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

gcc cct gaa cat aag ttc gag ggc ctg act cag atc ttt cag aaa gct 912

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

cac gcc att aag agt aaa gat cat gct acc ttc aat ttt ctg cag tgg 1008

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

gat cag tat gtg aag ggc att gcc aag agc cgg aaa agt ggg tca tga 1152

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

tga 1155

271

383

PRT

人工序列

合成構建體

271

Met Lys Ala Ile Leu Val Val Leu Leu Tyr Thr Phe Ala Thr Ala Asn

1 5 10 15

Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Leu Ala Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asp Glu Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Tyr His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Arg Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys

195 200 205

Leu Asn Arg Glu Glu Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

272

1155

DNA

人工序列

合成

272

tcatcatgac ccacttttcc ggctcttggc aatgcccttc acatactgat ctgccaggta 60

caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120

ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaaa ttgaaggtag catgatcttt 180

actcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240

gtaagctttc tgaaagatct gagtcaggcc ctcgaactta tgttcagggg cgctgattga 300

tgtcagctgg acgggcacat tgttctcgtt caggaaaatg atcagtttct ttgcatgttc 360

gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagag agtgggtata 420

acaccaacta gacatggaca tgtacagatt ggagctctgc atctccttgt tgacctgttc 480

gttcagcagc ttgatgatgt cgcccccaga gtcaatttcc tctctgttca gtttggcttc 540

ctcggaatac ttgggataat cgtatgtgcc atttttcacg ctctccatac aggtgttgtc 600

gcacttatgg taaaactcga agcatccatt cccgatttct ttagcattgt tcttcagctg 660

tgatcggact ttctcataca gattcttcac gttactatcg tggtacagca gtgtccactg 720

gttcagcagc agcaccagca gttctgccag gtcggttccg gagcctccgc tgcccatttt 780

ctcgatgacg gaattcacca tgttagtaat ttcatcgatt gcattctgtg tgctcttcag 840

gtcagcggcg tagcctgatc cctgctcgtt ctgatggtgg tagccgtacc acccgtccac 900

cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctctggtttc 960

ccgctgtggg atgtttcgca gtccagttgc cagcctcagt ccactgccca gattcacaga 1020

gtgggtgaca gtcacgttct tctccaggac agtatccact gtgtcggttg agttgttagc 1080

gtgatagccg atgcacaggg tgtcagcgtt agcggtagcg aaagtataca gcaggacgac 1140

caggattgcc ttcat 1155

273

5579

DNA

人工序列

合成

273

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240

ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300

tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360

ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420

cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540

tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660

ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720

catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780

cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900

agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960

tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020

cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080

ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140

ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200

accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260

gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320

ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380

atatcgccac catgaaggca atcctggtcg tcctgctgta tactttcgct accgctaacg 1440

ctgacaccct gtgcatcggc tatcacgcta acaactcaac cgacacagtg gatactgtcc 1500

tggagaagaa cgtgactgtc acccactctg tgaatctggg cagtggactg aggctggcaa 1560

ctggactgcg aaacatccca cagcgggaaa ccagaggact gttcggcgct atcgcagggt 1620

ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaacg 1680

agcagggatc aggctacgcc gctgacctga agagcacaca gaatgcaatc gatgaaatta 1740

ctaacatggt gaattccgtc atcgagaaaa tgggcagcgg aggctccgga accgacctgg 1800

cagaactgct ggtgctgctg ctgaaccagt ggacactgct gtaccacgat agtaacgtga 1860

agaatctgta tgagaaagtc cgatcacagc tgaagaacaa tgctaaagaa atcgggaatg 1920

gatgcttcga gttttaccat aagtgcgaca acacctgtat ggagagcgtg aaaaatggca 1980

catacgatta tcccaagtat tccgaggaag ccaaactgaa cagagaggaa attgactctg 2040

ggggcgacat catcaagctg ctgaacgaac aggtcaacaa ggagatgcag agctccaatc 2100

tgtacatgtc catgtctagt tggtgttata cccactctct ggacggcgct gggctgttcc 2160

tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaacg 2220

agaacaatgt gcccgtccag ctgacatcaa tcagcgcccc tgaacataag ttcgagggcc 2280

tgactcagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340

atattgtgga ccacgccatt aagagtaaag atcatgctac cttcaatttt ctgcagtggt 2400

acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460

tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520

agagccggaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580

tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640

cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700

tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760

aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820

tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880

ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940

tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000

caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060

gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120

atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180

cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240

cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300

accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360

acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420

cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480

acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540

atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600

agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660

acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720

gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780

gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840

gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900

gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960

acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020

tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080

ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140

catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200

ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260

ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320

cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380

gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440

ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500

catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560

ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620

tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680

atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740

gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800

gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860

attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920

ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980

ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040

aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100

tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160

cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220

gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280

ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340

ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400

ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460

aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520

aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579

274

639

DNA

人工序列

合成

CDS

(1)..(639)

274

atg gct atc atc tac ctg atc ctg ctg ttc act gct gtg cgg ggg gac 48

Met Ala Ile Ile Tyr Leu Ile Leu Leu Phe Thr Ala Val Arg Gly Asp

1 5 10 15

cag att tgc atc ggc tac cac gct aat aat tca act gag aag gtg gat 96

Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Lys Val Asp

20 25 30

act atc ctg gag cgg aac gtg acc gtc aca cac gct aaa gac att ggc 144

Thr Ile Leu Glu Arg Asn Val Thr Val Thr His Ala Lys Asp Ile Gly

35 40 45

agc gga ctg gtg ctg gca acc gga ctg agg aat gtc cca cag atc gag 192

Ser Gly Leu Val Leu Ala Thr Gly Leu Arg Asn Val Pro Gln Ile Glu

50 55 60

tcc cgc gga ctg ttc ggc gct atc gca ggg ttt att gaa ggc ggg tgg 240

Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp

65 70 75 80

cag gga atg att gat ggg tgg tac ggc tac cac cat tct aac gac caa 288

Gln Gly Met Ile Asp Gly Trp Tyr Gly Tyr His His Ser Asn Asp Gln

85 90 95

gga agt ggc tac gcc gct gat aag gag agt act cag aaa gcc ttc gat 336

Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln Lys Ala Phe Asp

100 105 110

ggc atc acc aac atg gtg aat tca gtc att gag aag atg ggc agc gga 384

Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly Ser Gly

115 120 125

ggc tcc gga acc gac ctg gca gaa ctg ctg gtg ctg ctg ctg aat cag 432

Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu Asn Gln

130 135 140

tgg aca ctg ctg ttt cac gac tct aac gtg aag aat ctg tat gat aaa 480

Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys

145 150 155 160

gtc cgg atg cag ctg aga gac aac gtg aag gag ctg ggg aat gga tgc 528

Val Arg Met Gln Leu Arg Asp Asn Val Lys Glu Leu Gly Asn Gly Cys

165 170 175

ttc gaa ttt tac cat aag tgc gac gat gag tgt atg aac agt gtc aaa 576

Phe Glu Phe Tyr His Lys Cys Asp Asp Glu Cys Met Asn Ser Val Lys

180 185 190

aat ggc aca tac gat tat ccc aag tat gag gaa gag tca aaa ctg aac 624

Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Glu Glu Glu Ser Lys Leu Asn

195 200 205

cga aat gaa atc aag 639

Arg Asn Glu Ile Lys

210

275

213

PRT

人工序列

合成構建體

275

Met Ala Ile Ile Tyr Leu Ile Leu Leu Phe Thr Ala Val Arg Gly Asp

1 5 10 15

Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Lys Val Asp

20 25 30

Thr Ile Leu Glu Arg Asn Val Thr Val Thr His Ala Lys Asp Ile Gly

35 40 45

Ser Gly Leu Val Leu Ala Thr Gly Leu Arg Asn Val Pro Gln Ile Glu

50 55 60

Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp

65 70 75 80

Gln Gly Met Ile Asp Gly Trp Tyr Gly Tyr His His Ser Asn Asp Gln

85 90 95

Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln Lys Ala Phe Asp

100 105 110

Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly Ser Gly

115 120 125

Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu Asn Gln

130 135 140

Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys

145 150 155 160

Val Arg Met Gln Leu Arg Asp Asn Val Lys Glu Leu Gly Asn Gly Cys

165 170 175

Phe Glu Phe Tyr His Lys Cys Asp Asp Glu Cys Met Asn Ser Val Lys

180 185 190

Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Glu Glu Glu Ser Lys Leu Asn

195 200 205

Arg Asn Glu Ile Lys

210

276

639

DNA

人工序列

合成

276

cttgatttca tttcggttca gttttgactc ttcctcatac ttgggataat cgtatgtgcc 60

atttttgaca ctgttcatac actcatcgtc gcacttatgg taaaattcga agcatccatt 120

ccccagctcc ttcacgttgt ctctcagctg catccggact ttatcataca gattcttcac 180

gttagagtcg tgaaacagca gtgtccactg attcagcagc agcaccagca gttctgccag 240

gtcggttccg gagcctccgc tgcccatctt ctcaatgact gaattcacca tgttggtgat 300

gccatcgaag gctttctgag tactctcctt atcagcggcg tagccacttc cttggtcgtt 360

agaatggtgg tagccgtacc acccatcaat cattccctgc cacccgcctt caataaaccc 420

tgcgatagcg ccgaacagtc cgcgggactc gatctgtggg acattcctca gtccggttgc 480

cagcaccagt ccgctgccaa tgtctttagc gtgtgtgacg gtcacgttcc gctccaggat 540

agtatccacc ttctcagttg aattattagc gtggtagccg atgcaaatct ggtccccccg 600

cacagcagtg aacagcagga tcaggtagat gatagccat 639

277

1149

DNA

人工序列

合成

CDS

(1)..(1149)

277

atg gct atc atc tac ctg atc ctg ctg ttc act gct gtg cgg ggg gac 48

Met Ala Ile Ile Tyr Leu Ile Leu Leu Phe Thr Ala Val Arg Gly Asp

1 5 10 15

cag att tgc atc ggc tac cac gct aat aat tca act gag aag gtg gat 96

Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Lys Val Asp

20 25 30

act atc ctg gag cgg aac gtg acc gtc aca cac gct aaa gac att ggc 144

Thr Ile Leu Glu Arg Asn Val Thr Val Thr His Ala Lys Asp Ile Gly

35 40 45

agc gga ctg gtg ctg gca acc gga ctg agg aat gtc cca cag atc gag 192

Ser Gly Leu Val Leu Ala Thr Gly Leu Arg Asn Val Pro Gln Ile Glu

50 55 60

tcc cgc gga ctg ttc ggc gct atc gca ggg ttt att gaa ggc ggg tgg 240

Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp

65 70 75 80

cag gga atg att gat ggg tgg tac ggc tac cac cat tct aac gac caa 288

Gln Gly Met Ile Asp Gly Trp Tyr Gly Tyr His His Ser Asn Asp Gln

85 90 95

gga agt ggc tac gcc gct gat aag gag agt act cag aaa gcc ttc gat 336

Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln Lys Ala Phe Asp

100 105 110

ggc atc acc aac atg gtg aat tca gtc att gag aag atg ggc agc gga 384

Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly Ser Gly

115 120 125

ggc tcc gga acc gac ctg gca gaa ctg ctg gtg ctg ctg ctg aat cag 432

Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu Asn Gln

130 135 140

tgg aca ctg ctg ttt cac gac tct aac gtg aag aat ctg tat gat aaa 480

Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys

145 150 155 160

gtc cgg atg cag ctg aga gac aac gtg aag gag ctg ggg aat gga tgc 528

Val Arg Met Gln Leu Arg Asp Asn Val Lys Glu Leu Gly Asn Gly Cys

165 170 175

ttc gaa ttt tac cat aag tgc gac gat gag tgt atg aac agt gtc aaa 576

Phe Glu Phe Tyr His Lys Cys Asp Asp Glu Cys Met Asn Ser Val Lys

180 185 190

aat ggc aca tac gat tat ccc aag tat gag gaa gag tca aaa ctg aac 624

Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Glu Glu Glu Ser Lys Leu Asn

195 200 205

cga aat gaa atc aag agc ggg ggc gac atc atc aag ctg ctg aac gag 672

Arg Asn Glu Ile Lys Ser Gly Gly Asp Ile Ile Lys Leu Leu Asn Glu

210 215 220

caa gtg aat aag gaa atg cag agc tcc aac ctg tac atg tcc atg tct 720

Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser Met Ser

225 230 235 240

agt tgg tgt tat act cac tct ctg gat ggc gcc ggg ctg ttc ctg ttt 768

Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe Leu Phe

245 250 255

gac cac gca gcc gaa gag tac gag cat gct aag aaa ctg atc att ttc 816

Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile Ile Phe

260 265 270

ctg aac gaa aac aac gtg ccc gtc cag ctg aca tca atc agc gca cct 864

Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser Ala Pro

275 280 285

gag cat aag ttc gaa ggc ctg act cag atc ttt cag aaa gct tac gag 912

Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala Tyr Glu

290 295 300

cac gaa cag cat att tcc gag tct atc aac aat att gtg gac cac gcc 960

His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp His Ala

305 310 315 320

atc aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg tac gtg 1008

Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp Tyr Val

325 330 335

gcc gag cag cac gaa gag gaa gtc ctg ttt aag gac atc ctg gat aaa 1056

Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu Asp Lys

340 345 350

atc gag ctg att gga aac gaa aat cat ggc ctg tac ctg gca gac cag 1104

Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala Asp Gln

355 360 365

tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga tga 1149

Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

278

381

PRT

人工序列

合成構建體

278

Met Ala Ile Ile Tyr Leu Ile Leu Leu Phe Thr Ala Val Arg Gly Asp

1 5 10 15

Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Lys Val Asp

20 25 30

Thr Ile Leu Glu Arg Asn Val Thr Val Thr His Ala Lys Asp Ile Gly

35 40 45

Ser Gly Leu Val Leu Ala Thr Gly Leu Arg Asn Val Pro Gln Ile Glu

50 55 60

Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp

65 70 75 80

Gln Gly Met Ile Asp Gly Trp Tyr Gly Tyr His His Ser Asn Asp Gln

85 90 95

Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln Lys Ala Phe Asp

100 105 110

Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly Ser Gly

115 120 125

Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu Asn Gln

130 135 140

Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys

145 150 155 160

Val Arg Met Gln Leu Arg Asp Asn Val Lys Glu Leu Gly Asn Gly Cys

165 170 175

Phe Glu Phe Tyr His Lys Cys Asp Asp Glu Cys Met Asn Ser Val Lys

180 185 190

Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Glu Glu Glu Ser Lys Leu Asn

195 200 205

Arg Asn Glu Ile Lys Ser Gly Gly Asp Ile Ile Lys Leu Leu Asn Glu

210 215 220

Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser Met Ser

225 230 235 240

Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe Leu Phe

245 250 255

Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile Ile Phe

260 265 270

Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser Ala Pro

275 280 285

Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala Tyr Glu

290 295 300

His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp His Ala

305 310 315 320

Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp Tyr Val

325 330 335

Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu Asp Lys

340 345 350

Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala Asp Gln

355 360 365

Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

279

1149

DNA

人工序列

合成

279

tcatcatgac ccactttttc tggacttggc aatgcccttc acatactggt ctgccaggta 60

caggccatga ttttcgtttc caatcagctc gattttatcc aggatgtcct taaacaggac 120

ttcctcttcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180

gctcttgatg gcgtggtcca caatattgtt gatagactcg gaaatatgct gttcgtgctc 240

gtaagctttc tgaaagatct gagtcaggcc ttcgaactta tgctcaggtg cgctgattga 300

tgtcagctgg acgggcacgt tgttttcgtt caggaaaatg atcagtttct tagcatgctc 360

gtactcttcg gctgcgtggt caaacaggaa cagcccggcg ccatccagag agtgagtata 420

acaccaacta gacatggaca tgtacaggtt ggagctctgc atttccttat tcacttgctc 480

gttcagcagc ttgatgatgt cgcccccgct cttgatttca tttcggttca gttttgactc 540

ttcctcatac ttgggataat cgtatgtgcc atttttgaca ctgttcatac actcatcgtc 600

gcacttatgg taaaattcga agcatccatt ccccagctcc ttcacgttgt ctctcagctg 660

catccggact ttatcataca gattcttcac gttagagtcg tgaaacagca gtgtccactg 720

attcagcagc agcaccagca gttctgccag gtcggttccg gagcctccgc tgcccatctt 780

ctcaatgact gaattcacca tgttggtgat gccatcgaag gctttctgag tactctcctt 840

atcagcggcg tagccacttc cttggtcgtt agaatggtgg tagccgtacc acccatcaat 900

cattccctgc cacccgcctt caataaaccc tgcgatagcg ccgaacagtc cgcgggactc 960

gatctgtggg acattcctca gtccggttgc cagcaccagt ccgctgccaa tgtctttagc 1020

gtgtgtgacg gtcacgttcc gctccaggat agtatccacc ttctcagttg aattattagc 1080

gtggtagccg atgcaaatct ggtccccccg cacagcagtg aacagcagga tcaggtagat 1140

gatagccat 1149

280

5573

DNA

人工序列

合成

280

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240

ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300

tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360

ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420

cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540

tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660

ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720

catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780

cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900

agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960

tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020

cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080

ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140

ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200

accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260

gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320

ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380

atatcgccac catggctatc atctacctga tcctgctgtt cactgctgtg cggggggacc 1440

agatttgcat cggctaccac gctaataatt caactgagaa ggtggatact atcctggagc 1500

ggaacgtgac cgtcacacac gctaaagaca ttggcagcgg actggtgctg gcaaccggac 1560

tgaggaatgt cccacagatc gagtcccgcg gactgttcgg cgctatcgca gggtttattg 1620

aaggcgggtg gcagggaatg attgatgggt ggtacggcta ccaccattct aacgaccaag 1680

gaagtggcta cgccgctgat aaggagagta ctcagaaagc cttcgatggc atcaccaaca 1740

tggtgaattc agtcattgag aagatgggca gcggaggctc cggaaccgac ctggcagaac 1800

tgctggtgct gctgctgaat cagtggacac tgctgtttca cgactctaac gtgaagaatc 1860

tgtatgataa agtccggatg cagctgagag acaacgtgaa ggagctgggg aatggatgct 1920

tcgaatttta ccataagtgc gacgatgagt gtatgaacag tgtcaaaaat ggcacatacg 1980

attatcccaa gtatgaggaa gagtcaaaac tgaaccgaaa tgaaatcaag agcgggggcg 2040

acatcatcaa gctgctgaac gagcaagtga ataaggaaat gcagagctcc aacctgtaca 2100

tgtccatgtc tagttggtgt tatactcact ctctggatgg cgccgggctg ttcctgtttg 2160

accacgcagc cgaagagtac gagcatgcta agaaactgat cattttcctg aacgaaaaca 2220

acgtgcccgt ccagctgaca tcaatcagcg cacctgagca taagttcgaa ggcctgactc 2280

agatctttca gaaagcttac gagcacgaac agcatatttc cgagtctatc aacaatattg 2340

tggaccacgc catcaagagc aaagatcatg ctaccttcaa ctttctgcag tggtacgtgg 2400

ccgagcagca cgaagaggaa gtcctgttta aggacatcct ggataaaatc gagctgattg 2460

gaaacgaaaa tcatggcctg tacctggcag accagtatgt gaagggcatt gccaagtcca 2520

gaaaaagtgg gtcatgatga acacgtggga tccagatctg ctgtgccttc tagttgccag 2580

ccatctgttg tttgcccctc ccccgtgcct tccttgaccc tggaaggtgc cactcccact 2640

gtcctttcct aataaaatga ggaaattgca tcgcattgtc tgagtaggtg tcattctatt 2700

ctggggggtg gggtggggca ggacagcaag ggggaggatt gggaagacaa tagcaggcat 2760

gctggggatg cggtgggctc tatgggtacc caggtgctga agaattgacc cggttcctcc 2820

tgggccagaa agaagcaggc acatcccctt ctctgtgaca caccctgtcc acgcccctgg 2880

ttcttagttc cagccccact cataggacac tcatagctca ggagggctcc gccttcaatc 2940

ccacccgcta aagtacttgg agcggtctct ccctccctca tcagcccacc aaaccaaacc 3000

tagcctccaa gagtgggaag aaattaaagc aagataggct attaagtgca gagggagaga 3060

aaatgcctcc aacatgtgag gaagtaatga gagaaatcat agaattttaa ggccatgatt 3120

taaggccatc atggccttaa tcttccgctt cctcgctcac tgactcgctg cgctcggtcg 3180

ttcggctgcg gcgagcggta tcagctcact caaaggcggt aatacggtta tccacagaat 3240

caggggataa cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta 3300

aaaaggccgc gttgctggcg tttttccata ggctccgccc ccctgacgag catcacaaaa 3360

atcgacgctc aagtcagagg tggcgaaacc cgacaggact ataaagatac caggcgtttc 3420

cccctggaag ctccctcgtg cgctctcctg ttccgaccct gccgcttacc ggatacctgt 3480

ccgcctttct cccttcggga agcgtggcgc tttctcatag ctcacgctgt aggtatctca 3540

gttcggtgta ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc gttcagcccg 3600

accgctgcgc cttatccggt aactatcgtc ttgagtccaa cccggtaaga cacgacttat 3660

cgccactggc agcagccact ggtaacagga ttagcagagc gaggtatgta ggcggtgcta 3720

cagagttctt gaagtggtgg cctaactacg gctacactag aagaacagta tttggtatct 3780

gcgctctgct gaagccagtt accttcggaa aaagagttgg tagctcttga tccggcaaac 3840

aaaccaccgc tggtagcggt ggtttttttg tttgcaagca gcagattacg cgcagaaaaa 3900

aaggatctca agaagatcct ttgatctttt ctacggggtc tgacgctcag tggaacgaaa 3960

actcacgtta agggattttg gtcatgagat tatcaaaaag gatcttcacc tagatccttt 4020

taaattaaaa atgaagtttt aaatcaatct aaagtatata tgagtaaact tggtctgaca 4080

gttaccaatg cttaatcagt gaggcaccta tctcagcgat ctgtctattt cgttcatcca 4140

tagttgcctg actcgggggg ggggggcgct gaggtctgcc tcgtgaagaa ggtgttgctg 4200

actcatacca ggcctgaatc gccccatcat ccagccagaa agtgagggag ccacggttga 4260

tgagagcttt gttgtaggtg gaccagttgg tgattttgaa cttttgcttt gccacggaac 4320

ggtctgcgtt gtcgggaaga tgcgtgatct gatccttcaa ctcagcaaaa gttcgattta 4380

ttcaacaaag ccgccgtccc gtcaagtcag cgtaatgctc tgccagtgtt acaaccaatt 4440

aaccaattct gattagaaaa actcatcgag catcaaatga aactgcaatt tattcatatc 4500

aggattatca ataccatatt tttgaaaaag ccgtttctgt aatgaaggag aaaactcacc 4560

gaggcagttc cataggatgg caagatcctg gtatcggtct gcgattccga ctcgtccaac 4620

atcaatacaa cctattaatt tcccctcgtc aaaaataagg ttatcaagtg agaaatcacc 4680

atgagtgacg actgaatccg gtgagaatgg caaaagctta tgcatttctt tccagacttg 4740

ttcaacaggc cagccattac gctcgtcatc aaaatcactc gcatcaacca aaccgttatt 4800

cattcgtgat tgcgcctgag cgagacgaaa tacgcgatcg ctgttaaaag gacaattaca 4860

aacaggaatc gaatgcaacc ggcgcaggaa cactgccagc gcatcaacaa tattttcacc 4920

tgaatcagga tattcttcta atacctggaa tgctgttttc ccggggatcg cagtggtgag 4980

taaccatgca tcatcaggag tacggataaa atgcttgatg gtcggaagag gcataaattc 5040

cgtcagccag tttagtctga ccatctcatc tgtaacatca ttggcaacgc tacctttgcc 5100

atgtttcaga aacaactctg gcgcatcggg cttcccatac aatcgataga ttgtcgcacc 5160

tgattgcccg acattatcgc gagcccattt atacccatat aaatcagcat ccatgttgga 5220

atttaatcgc ggcctcgagc aagacgtttc ccgttgaata tggctcataa caccccttgt 5280

attactgttt atgtaagcag acagttttat tgttcatgat gatatatttt tatcttgtgc 5340

aatgtaacat cagagatttt gagacacaac gtggctttcc cccccccccc attattgaag 5400

catttatcag ggttattgtc tcatgagcgg atacatattt gaatgtattt agaaaaataa 5460

acaaataggg gttccgcgca catttccccg aaaagtgcca cctgacgtct aagaaaccat 5520

tattatcatg acattaacct ataaaaatag gcgtatcacg aggccctttc gtc 5573

281

654

DNA

人工序列

合成

CDS

(1)..(654)

281

atg gaa aaa atc gtg ctg ctg ctg gct atc gtg tcc ctg gtg aag tcc 48

Met Glu Lys Ile Val Leu Leu Leu Ala Ile Val Ser Leu Val Lys Ser

1 5 10 15

gac cag atc tgt att ggg tat cat gct aac aac tcc aca gaa cag gtg 96

Asp Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Gln Val

20 25 30

gat act atc atg gag aag aac gtg acc gtc aca cac gct cag gac att 144

Asp Thr Ile Met Glu Lys Asn Val Thr Val Thr His Ala Gln Asp Ile

35 40 45

gga tgg gga ctg gtc ctg gca acc gga ctg aga aat tca cca cag agg 192

Gly Trp Gly Leu Val Leu Ala Thr Gly Leu Arg Asn Ser Pro Gln Arg

50 55 60

gaa agc cgg aga aag aaa cgc gga ctg ttc ggc gct atc gca ggg ttt 240

Glu Ser Arg Arg Lys Lys Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe

65 70 75 80

att gag ggc ggg tgg cag gga atg gtg gat ggg tgg tac ggc tac cac 288

Ile Glu Gly Gly Trp Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His

85 90 95

cat tcc aac gaa cag gga tct ggc tac gcc gct gat aag gag tct act 336

His Ser Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr

100 105 110

cag aaa gct atc gac ggc gtg acc aac atg gtc aat agt atc att gat 384

Gln Lys Ala Ile Asp Gly Val Thr Asn Met Val Asn Ser Ile Ile Asp

115 120 125

aag atg ggc tct gga ggc agt gga acc gac ctg gca gag ctg ctg gtg 432

Lys Met Gly Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val

130 135 140

ctg ctg ctg aac cag tgg aca ctg ctg ttc cac gac tct aac gtg aag 480

Leu Leu Leu Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys

145 150 155 160

aat ctg tat gat aaa gtc cga ctg cag ctg cgg gac aac gcc aag gaa 528

Asn Leu Tyr Asp Lys Val Arg Leu Gln Leu Arg Asp Asn Ala Lys Glu

165 170 175

ctg ggg aat gga tgc ttc gag ttc tac cat aag tgc gat aac gaa tgt 576

Leu Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Glu Cys

180 185 190

atg gag agc atc cga aac ggc aca tac aat tat ccc cag tat tcc gag 624

Met Glu Ser Ile Arg Asn Gly Thr Tyr Asn Tyr Pro Gln Tyr Ser Glu

195 200 205

gaa gct agg ctg aaa cgc gag gaa att agc 654

Glu Ala Arg Leu Lys Arg Glu Glu Ile Ser

210 215

282

218

PRT

人工序列

合成構建體

282

Met Glu Lys Ile Val Leu Leu Leu Ala Ile Val Ser Leu Val Lys Ser

1 5 10 15

Asp Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Gln Val

20 25 30

Asp Thr Ile Met Glu Lys Asn Val Thr Val Thr His Ala Gln Asp Ile

35 40 45

Gly Trp Gly Leu Val Leu Ala Thr Gly Leu Arg Asn Ser Pro Gln Arg

50 55 60

Glu Ser Arg Arg Lys Lys Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe

65 70 75 80

Ile Glu Gly Gly Trp Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His

85 90 95

His Ser Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr

100 105 110

Gln Lys Ala Ile Asp Gly Val Thr Asn Met Val Asn Ser Ile Ile Asp

115 120 125

Lys Met Gly Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val

130 135 140

Leu Leu Leu Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys

145 150 155 160

Asn Leu Tyr Asp Lys Val Arg Leu Gln Leu Arg Asp Asn Ala Lys Glu

165 170 175

Leu Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Glu Cys

180 185 190

Met Glu Ser Ile Arg Asn Gly Thr Tyr Asn Tyr Pro Gln Tyr Ser Glu

195 200 205

Glu Ala Arg Leu Lys Arg Glu Glu Ile Ser

210 215

283

654

DNA

人工序列

合成

283

gctaatttcc tcgcgtttca gcctagcttc ctcggaatac tggggataat tgtatgtgcc 60

gtttcggatg ctctccatac attcgttatc gcacttatgg tagaactcga agcatccatt 120

ccccagttcc ttggcgttgt cccgcagctg cagtcggact ttatcataca gattcttcac 180

gttagagtcg tggaacagca gtgtccactg gttcagcagc agcaccagca gctctgccag 240

gtcggttcca ctgcctccag agcccatctt atcaatgata ctattgacca tgttggtcac 300

gccgtcgata gctttctgag tagactcctt atcagcggcg tagccagatc cctgttcgtt 360

ggaatggtgg tagccgtacc acccatccac cattccctgc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc cgcgtttctt tctccggctt tccctctgtg gtgaatttct 480

cagtccggtt gccaggacca gtccccatcc aatgtcctga gcgtgtgtga cggtcacgtt 540

cttctccatg atagtatcca cctgttctgt ggagttgtta gcatgatacc caatacagat 600

ctggtcggac ttcaccaggg acacgatagc cagcagcagc acgatttttt ccat 654

284

1164

DNA

人工序列

合成

CDS

(1)..(1164)

284

atg gaa aaa atc gtg ctg ctg ctg gct atc gtg tcc ctg gtg aag tcc 48

Met Glu Lys Ile Val Leu Leu Leu Ala Ile Val Ser Leu Val Lys Ser

1 5 10 15

gac cag atc tgt att ggg tat cat gct aac aac tcc aca gaa cag gtg 96

Asp Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Gln Val

20 25 30

gat act atc atg gag aag aac gtg acc gtc aca cac gct cag gac att 144

Asp Thr Ile Met Glu Lys Asn Val Thr Val Thr His Ala Gln Asp Ile

35 40 45

gga tgg gga ctg gtc ctg gca acc gga ctg aga aat tca cca cag agg 192

Gly Trp Gly Leu Val Leu Ala Thr Gly Leu Arg Asn Ser Pro Gln Arg

50 55 60

gaa agc cgg aga aag aaa cgc gga ctg ttc ggc gct atc gca ggg ttt 240

Glu Ser Arg Arg Lys Lys Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe

65 70 75 80

att gag ggc ggg tgg cag gga atg gtg gat ggg tgg tac ggc tac cac 288

Ile Glu Gly Gly Trp Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His

85 90 95

cat tcc aac gaa cag gga tct ggc tac gcc gct gat aag gag tct act 336

His Ser Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr

100 105 110

cag aaa gct atc gac ggc gtg acc aac atg gtc aat agt atc att gat 384

Gln Lys Ala Ile Asp Gly Val Thr Asn Met Val Asn Ser Ile Ile Asp

115 120 125

aag atg ggc tct gga ggc agt gga acc gac ctg gca gag ctg ctg gtg 432

Lys Met Gly Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val

130 135 140

ctg ctg ctg aac cag tgg aca ctg ctg ttc cac gac tct aac gtg aag 480

Leu Leu Leu Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys

145 150 155 160

aat ctg tat gat aaa gtc cga ctg cag ctg cgg gac aac gcc aag gaa 528

Asn Leu Tyr Asp Lys Val Arg Leu Gln Leu Arg Asp Asn Ala Lys Glu

165 170 175

ctg ggg aat gga tgc ttc gag ttc tac cat aag tgc gat aac gaa tgt 576

Leu Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Glu Cys

180 185 190

atg gag agc atc cga aac ggc aca tac aat tat ccc cag tat tcc gag 624

Met Glu Ser Ile Arg Asn Gly Thr Tyr Asn Tyr Pro Gln Tyr Ser Glu

195 200 205

gaa gct agg ctg aaa cgc gag gaa att agc tcc ggg gga gac atc att 672

Glu Ala Arg Leu Lys Arg Glu Glu Ile Ser Ser Gly Gly Asp Ile Ile

210 215 220

aag ctg ctg aac gaa cag gtg aac aag gag atg cag tct agt aac ctg 720

Lys Leu Leu Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu

225 230 235 240

tac atg agt atg tca agc tgg tgt tat act cac tca ctg gat ggc gcc 768

Tyr Met Ser Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala

245 250 255

ggg ctg ttc ctg ttt gac cac gca gcc gag gaa tac gaa cat gct aag 816

Gly Leu Phe Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys

260 265 270

aaa ctg atc att ttc ctg aat gag aac aat gtg ccc gtc cag ctg aca 864

Lys Leu Ile Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr

275 280 285

tcc atc tct gca cct gaa cat aag ttc gag ggc ctg act cag atc ttt 912

Ser Ile Ser Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe

290 295 300

cag aaa gcc tac gaa cac gag cag cat att agt gag tca atc aac aat 960

Gln Lys Ala Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn

305 310 315 320

att gtg gac cac gcc atc aag agc aaa gat cat gct acc ttc aat ttt 1008

Ile Val Asp His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe

325 330 335

ctg cag tgg tac gtg gcc gag cag cac gag gaa gag gtc ctg ttt aag 1056

Leu Gln Trp Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys

340 345 350

gac atc ctg gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg 1104

Asp Ile Leu Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu

355 360 365

tac ctg gca gac cag tat gtg aag ggc att gcc aag tcc agg aaa agc 1152

Tyr Leu Ala Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser

370 375 380

ggg tcc tga tga 1164

Gly Ser

385

285

386

PRT

人工序列

合成構建體

285

Met Glu Lys Ile Val Leu Leu Leu Ala Ile Val Ser Leu Val Lys Ser

1 5 10 15

Asp Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Gln Val

20 25 30

Asp Thr Ile Met Glu Lys Asn Val Thr Val Thr His Ala Gln Asp Ile

35 40 45

Gly Trp Gly Leu Val Leu Ala Thr Gly Leu Arg Asn Ser Pro Gln Arg

50 55 60

Glu Ser Arg Arg Lys Lys Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe

65 70 75 80

Ile Glu Gly Gly Trp Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His

85 90 95

His Ser Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr

100 105 110

Gln Lys Ala Ile Asp Gly Val Thr Asn Met Val Asn Ser Ile Ile Asp

115 120 125

Lys Met Gly Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val

130 135 140

Leu Leu Leu Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys

145 150 155 160

Asn Leu Tyr Asp Lys Val Arg Leu Gln Leu Arg Asp Asn Ala Lys Glu

165 170 175

Leu Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Glu Cys

180 185 190

Met Glu Ser Ile Arg Asn Gly Thr Tyr Asn Tyr Pro Gln Tyr Ser Glu

195 200 205

Glu Ala Arg Leu Lys Arg Glu Glu Ile Ser Ser Gly Gly Asp Ile Ile

210 215 220

Lys Leu Leu Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu

225 230 235 240

Tyr Met Ser Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala

245 250 255

Gly Leu Phe Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys

260 265 270

Lys Leu Ile Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr

275 280 285

Ser Ile Ser Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe

290 295 300

Gln Lys Ala Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn

305 310 315 320

Ile Val Asp His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe

325 330 335

Leu Gln Trp Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys

340 345 350

Asp Ile Leu Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu

355 360 365

Tyr Leu Ala Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser

370 375 380

Gly Ser

385

286

1164

DNA

人工序列

合成

286

tcatcaggac ccgcttttcc tggacttggc aatgcccttc acatactggt ctgccaggta 60

caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120

ctcttcctcg tgctgctcgg ccacgtacca ctgcagaaaa ttgaaggtag catgatcttt 180

gctcttgatg gcgtggtcca caatattgtt gattgactca ctaatatgct gctcgtgttc 240

gtaggctttc tgaaagatct gagtcaggcc ctcgaactta tgttcaggtg cagagatgga 300

tgtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct tagcatgttc 360

gtattcctcg gctgcgtggt caaacaggaa cagcccggcg ccatccagtg agtgagtata 420

acaccagctt gacatactca tgtacaggtt actagactgc atctccttgt tcacctgttc 480

gttcagcagc ttaatgatgt ctcccccgga gctaatttcc tcgcgtttca gcctagcttc 540

ctcggaatac tggggataat tgtatgtgcc gtttcggatg ctctccatac attcgttatc 600

gcacttatgg tagaactcga agcatccatt ccccagttcc ttggcgttgt cccgcagctg 660

cagtcggact ttatcataca gattcttcac gttagagtcg tggaacagca gtgtccactg 720

gttcagcagc agcaccagca gctctgccag gtcggttcca ctgcctccag agcccatctt 780

atcaatgata ctattgacca tgttggtcac gccgtcgata gctttctgag tagactcctt 840

atcagcggcg tagccagatc cctgttcgtt ggaatggtgg tagccgtacc acccatccac 900

cattccctgc cacccgccct caataaaccc tgcgatagcg ccgaacagtc cgcgtttctt 960

tctccggctt tccctctgtg gtgaatttct cagtccggtt gccaggacca gtccccatcc 1020

aatgtcctga gcgtgtgtga cggtcacgtt cttctccatg atagtatcca cctgttctgt 1080

ggagttgtta gcatgatacc caatacagat ctggtcggac ttcaccaggg acacgatagc 1140

cagcagcagc acgatttttt ccat 1164

287

5588

DNA

人工序列

合成

287

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240

ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300

tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360

ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420

cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540

tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660

ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720

catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780

cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900

agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960

tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020

cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080

ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140

ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200

accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260

gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320

ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380

atatcgccac catggaaaaa atcgtgctgc tgctggctat cgtgtccctg gtgaagtccg 1440

accagatctg tattgggtat catgctaaca actccacaga acaggtggat actatcatgg 1500

agaagaacgt gaccgtcaca cacgctcagg acattggatg gggactggtc ctggcaaccg 1560

gactgagaaa ttcaccacag agggaaagcc ggagaaagaa acgcggactg ttcggcgcta 1620

tcgcagggtt tattgagggc gggtggcagg gaatggtgga tgggtggtac ggctaccacc 1680

attccaacga acagggatct ggctacgccg ctgataagga gtctactcag aaagctatcg 1740

acggcgtgac caacatggtc aatagtatca ttgataagat gggctctgga ggcagtggaa 1800

ccgacctggc agagctgctg gtgctgctgc tgaaccagtg gacactgctg ttccacgact 1860

ctaacgtgaa gaatctgtat gataaagtcc gactgcagct gcgggacaac gccaaggaac 1920

tggggaatgg atgcttcgag ttctaccata agtgcgataa cgaatgtatg gagagcatcc 1980

gaaacggcac atacaattat ccccagtatt ccgaggaagc taggctgaaa cgcgaggaaa 2040

ttagctccgg gggagacatc attaagctgc tgaacgaaca ggtgaacaag gagatgcagt 2100

ctagtaacct gtacatgagt atgtcaagct ggtgttatac tcactcactg gatggcgccg 2160

ggctgttcct gtttgaccac gcagccgagg aatacgaaca tgctaagaaa ctgatcattt 2220

tcctgaatga gaacaatgtg cccgtccagc tgacatccat ctctgcacct gaacataagt 2280

tcgagggcct gactcagatc tttcagaaag cctacgaaca cgagcagcat attagtgagt 2340

caatcaacaa tattgtggac cacgccatca agagcaaaga tcatgctacc ttcaattttc 2400

tgcagtggta cgtggccgag cagcacgagg aagaggtcct gtttaaggac atcctggata 2460

aaatcgaact gattggaaac gagaatcatg gcctgtacct ggcagaccag tatgtgaagg 2520

gcattgccaa gtccaggaaa agcgggtcct gatgaacacg tgggatccag atctgctgtg 2580

ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt gaccctggaa 2640

ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt 2700

aggtgtcatt ctattctggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa 2760

gacaatagca ggcatgctgg ggatgcggtg ggctctatgg gtacccaggt gctgaagaat 2820

tgacccggtt cctcctgggc cagaaagaag caggcacatc cccttctctg tgacacaccc 2880

tgtccacgcc cctggttctt agttccagcc ccactcatag gacactcata gctcaggagg 2940

gctccgcctt caatcccacc cgctaaagta cttggagcgg tctctccctc cctcatcagc 3000

ccaccaaacc aaacctagcc tccaagagtg ggaagaaatt aaagcaagat aggctattaa 3060

gtgcagaggg agagaaaatg cctccaacat gtgaggaagt aatgagagaa atcatagaat 3120

tttaaggcca tgatttaagg ccatcatggc cttaatcttc cgcttcctcg ctcactgact 3180

cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac 3240

ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa 3300

aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg 3360

acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa 3420

gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc 3480

ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac 3540

gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac 3600

cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg 3660

taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt 3720

atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagaa 3780

cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct 3840

cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga 3900

ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg 3960

ctcagtggaa cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct 4020

tcacctagat ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt 4080

aaacttggtc tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc 4140

tatttcgttc atccatagtt gcctgactcg gggggggggg gcgctgaggt ctgcctcgtg 4200

aagaaggtgt tgctgactca taccaggcct gaatcgcccc atcatccagc cagaaagtga 4260

gggagccacg gttgatgaga gctttgttgt aggtggacca gttggtgatt ttgaactttt 4320

gctttgccac ggaacggtct gcgttgtcgg gaagatgcgt gatctgatcc ttcaactcag 4380

caaaagttcg atttattcaa caaagccgcc gtcccgtcaa gtcagcgtaa tgctctgcca 4440

gtgttacaac caattaacca attctgatta gaaaaactca tcgagcatca aatgaaactg 4500

caatttattc atatcaggat tatcaatacc atatttttga aaaagccgtt tctgtaatga 4560

aggagaaaac tcaccgaggc agttccatag gatggcaaga tcctggtatc ggtctgcgat 4620

tccgactcgt ccaacatcaa tacaacctat taatttcccc tcgtcaaaaa taaggttatc 4680

aagtgagaaa tcaccatgag tgacgactga atccggtgag aatggcaaaa gcttatgcat 4740

ttctttccag acttgttcaa caggccagcc attacgctcg tcatcaaaat cactcgcatc 4800

aaccaaaccg ttattcattc gtgattgcgc ctgagcgaga cgaaatacgc gatcgctgtt 4860

aaaaggacaa ttacaaacag gaatcgaatg caaccggcgc aggaacactg ccagcgcatc 4920

aacaatattt tcacctgaat caggatattc ttctaatacc tggaatgctg ttttcccggg 4980

gatcgcagtg gtgagtaacc atgcatcatc aggagtacgg ataaaatgct tgatggtcgg 5040

aagaggcata aattccgtca gccagtttag tctgaccatc tcatctgtaa catcattggc 5100

aacgctacct ttgccatgtt tcagaaacaa ctctggcgca tcgggcttcc catacaatcg 5160

atagattgtc gcacctgatt gcccgacatt atcgcgagcc catttatacc catataaatc 5220

agcatccatg ttggaattta atcgcggcct cgagcaagac gtttcccgtt gaatatggct 5280

cataacaccc cttgtattac tgtttatgta agcagacagt tttattgttc atgatgatat 5340

atttttatct tgtgcaatgt aacatcagag attttgagac acaacgtggc tttccccccc 5400

cccccattat tgaagcattt atcagggtta ttgtctcatg agcggataca tatttgaatg 5460

tatttagaaa aataaacaaa taggggttcc gcgcacattt ccccgaaaag tgccacctga 5520

cgtctaagaa accattatta tcatgacatt aacctataaa aataggcgta tcacgaggcc 5580

ctttcgtc 5588

288

645

DNA

人工序列

合成

CDS

(1)..(645)

288

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

agc gga ggc tcc gga aca tac aac gct gag ctg ctg gtg ctg ctg ctg 432

Ser Gly Gly Ser Gly Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac 645

Leu Asn Arg Glu Lys Ile Asp

210 215

289

215

PRT

人工序列

合成構建體

289

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

290

645

DNA

人工序列

合成

290

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaaatcca gagtccgctc gttcagcagc agcaccagca gctcagcgtt 240

gtatgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

291

1155

DNA

人工序列

合成

CDS

(1)..(1155)

291

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

agc gga ggc tcc gga aca tac aac gct gag ctg ctg gtg ctg ctg ctg 432

Ser Gly Gly Ser Gly Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

aac gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt 720

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

tga 1155

292

383

PRT

人工序列

合成構建體

292

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

293

1155

DNA

人工序列

合成

293

tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60

caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120

ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180

gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240

gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300

agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360

gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420

acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480

gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540

ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600

gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660

ggatttgacc ttctcataca gattcttcac gttgctatcg tggaaatcca gagtccgctc 720

gttcagcagc agcaccagca gctcagcgtt gtatgttccg gagcctccgc tgcccatttt 780

ttcgatgaca gaattcacca tgttagtaat gccattgatt gcgttctgtg tagacttctg 840

atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900

cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960

ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020

gtgggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080

gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140

cagtttggcc ttcat 1155

294

5579

DNA

人工序列

合成

294

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240

ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300

tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360

ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420

cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540

tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660

ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720

catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780

cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900

agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960

tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020

cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080

ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140

ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200

accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260

gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320

ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380

atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440

ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500

tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560

ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620

ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaatg 1680

agcagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740

ctaacatggt gaattctgtc atcgaaaaaa tgggcagcgg aggctccgga acatacaacg 1800

ctgagctgct ggtgctgctg ctgaacgagc ggactctgga tttccacgat agcaacgtga 1860

agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920

gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980

cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040

ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgcag agctccaacc 2100

tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160

tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220

agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280

tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340

atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400

acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460

tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520

agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580

tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640

cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700

tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760

aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820

tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880

ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940

tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000

caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060

gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120

atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180

cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240

cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300

accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360

acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420

cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480

acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540

atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600

agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660

acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720

gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780

gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840

gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900

gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960

acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020

tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080

ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140

catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200

ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260

ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320

cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380

gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440

ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500

catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560

ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620

tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680

atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740

gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800

gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860

attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920

ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980

ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040

aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100

tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160

cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220

gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280

ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340

ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400

ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460

aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520

aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579

295

645

DNA

人工序列

合成

CDS

(1)..(645)

295

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg ctg 432

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac 645

Leu Asn Arg Glu Lys Ile Asp

210 215

296

215

PRT

人工序列

合成構建體

296

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

297

645

DNA

人工序列

合成

297

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaaatcca gagtccgctc gttcagcagc agcaccagca gctcagccag 240

gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

298

1155

DNA

人工序列

合成

CDS

(1)..(1155)

298

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg ctg 432

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

aac gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt 720

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

tga 1155

299

383

PRT

人工序列

合成構建體

299

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

300

1155

DNA

人工序列

合成

300

tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60

caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120

ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180

gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240

gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300

agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360

gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420

acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480

gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540

ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600

gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660

ggatttgacc ttctcataca gattcttcac gttgctatcg tggaaatcca gagtccgctc 720

gttcagcagc agcaccagca gctcagccag gtctgttccg gagcctccgc tgcccatttt 780

ttcgatgaca gaattcacca tgttagtaat gccattgatt gcgttctgtg tagacttctg 840

atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900

cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960

ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020

gtgggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080

gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140

cagtttggcc ttcat 1155

301

5579

DNA

人工序列

合成

301

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240

ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300

tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360

ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420

cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540

tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660

ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720

catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780

cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900

agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960

tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020

cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080

ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140

ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200

accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260

gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320

ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380

atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440

ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500

tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560

ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620

ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaatg 1680

agcagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740

ctaacatggt gaattctgtc atcgaaaaaa tgggcagcgg aggctccgga acagacctgg 1800

ctgagctgct ggtgctgctg ctgaacgagc ggactctgga tttccacgat agcaacgtga 1860

agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920

gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980

cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040

ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgcag agctccaacc 2100

tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160

tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220

agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280

tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340

atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400

acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460

tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520

agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580

tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640

cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700

tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760

aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820

tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880

ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940

tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000

caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060

gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120

atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180

cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240

cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300

accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360

acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420

cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480

acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540

atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600

agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660

acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720

gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780

gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840

gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900

gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960

acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020

tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080

ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140

catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200

ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260

ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320

cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380

gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440

ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500

catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560

ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620

tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680

atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740

gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800

gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860

attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920

ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980

ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040

aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100

tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160

cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220

gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280

ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340

ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400

ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460

aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520

aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579

302

645

DNA

人工序列

合成

CDS

(1)..(645)

302

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac atc gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Ile Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg atc 432

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile

130 135 140

aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac 645

Leu Asn Arg Glu Lys Ile Asp

210 215

303

215

PRT

人工序列

合成構建體

303

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Ile Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

304

645

DNA

人工序列

合成

304

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaaatcca gagtccgctc gttgatcagc agcaccagca gctcagccag 240

gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacga tgttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

305

1155

DNA

人工序列

合成

CDS

(1)..(1155)

305

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac atc gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Ile Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg atc 432

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile

130 135 140

aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

aac gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt 720

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

tga 1155

306

383

PRT

人工序列

合成構建體

306

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Ile Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

307

1155

DNA

人工序列

合成

307

tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60

caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120

ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180

gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240

gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300

agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360

gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420

acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480

gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540

ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600

gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660

ggatttgacc ttctcataca gattcttcac gttgctatcg tggaaatcca gagtccgctc 720

gttgatcagc agcaccagca gctcagccag gtctgttccg gagcctccgc tgcccatttt 780

ttcgatgaca gaattcacga tgttagtaat gccattgatt gcgttctgtg tagacttctg 840

atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900

cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960

ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020

gtgggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080

gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140

cagtttggcc ttcat 1155

308

5579

DNA

人工序列

合成

308

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240

ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300

tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360

ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420

cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540

tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660

ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720

catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780

cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900

agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960

tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020

cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080

ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140

ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200

accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260

gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320

ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380

atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440

ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500

tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560

ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620

ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaatg 1680

agcagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740

ctaacatcgt gaattctgtc atcgaaaaaa tgggcagcgg aggctccgga acagacctgg 1800

ctgagctgct ggtgctgctg atcaacgagc ggactctgga tttccacgat agcaacgtga 1860

agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920

gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980

cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040

ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgcag agctccaacc 2100

tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160

tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220

agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280

tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340

atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400

acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460

tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520

agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580

tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640

cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700

tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760

aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820

tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880

ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940

tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000

caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060

gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120

atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180

cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240

cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300

accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360

acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420

cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480

acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540

atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600

agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660

acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720

gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780

gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840

gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900

gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960

acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020

tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080

ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140

catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200

ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260

ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320

cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380

gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440

ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500

catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560

ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620

tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680

atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740

gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800

gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860

attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920

ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980

ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040

aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100

tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160

cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220

gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280

ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340

ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400

ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460

aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520

aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579

309

645

DNA

人工序列

合成

CDS

(1)..(645)

309

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac ctg gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg atc 432

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile

130 135 140

aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac 645

Leu Asn Arg Glu Lys Ile Asp

210 215

310

215

PRT

人工序列

合成構建體

310

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

311

645

DNA

人工序列

合成

311

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaaatcca gagtccgctc gttgatcagc agcaccagca gctcagccag 240

gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca ggttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

312

1155

DNA

人工序列

合成

CDS

(1)..(1155)

312

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac ctg gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg atc 432

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile

130 135 140

aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

aac gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt 720

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

tga 1155

313

383

PRT

人工序列

合成構建體

313

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

314

1155

DNA

人工序列

合成

314

tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60

caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120

ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180

gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240

gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300

agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360

gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420

acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480

gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540

ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600

gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660

ggatttgacc ttctcataca gattcttcac gttgctatcg tggaaatcca gagtccgctc 720

gttgatcagc agcaccagca gctcagccag gtctgttccg gagcctccgc tgcccatttt 780

ttcgatgaca gaattcacca ggttagtaat gccattgatt gcgttctgtg tagacttctg 840

atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900

cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960

ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020

gtgggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080

gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140

cagtttggcc ttcat 1155

315

5579

DNA

人工序列

合成

315

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240

ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300

tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360

ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420

cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540

tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660

ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720

catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780

cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900

agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960

tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020

cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080

ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140

ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200

accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260

gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320

ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380

atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440

ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500

tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560

ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620

ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaatg 1680

agcagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740

ctaacctggt gaattctgtc atcgaaaaaa tgggcagcgg aggctccgga acagacctgg 1800

ctgagctgct ggtgctgctg atcaacgagc ggactctgga tttccacgat agcaacgtga 1860

agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920

gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980

cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040

ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgcag agctccaacc 2100

tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160

tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220

agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280

tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340

atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400

acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460

tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520

agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580

tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640

cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700

tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760

aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820

tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880

ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940

tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000

caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060

gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120

atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180

cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240

cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300

accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360

acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420

cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480

acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540

atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600

agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660

acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720

gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780

gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840

gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900

gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960

acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020

tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080

ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140

catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200

ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260

ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320

cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380

gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440

ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500

catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560

ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620

tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680

atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740

gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800

gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860

attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920

ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980

ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040

aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100

tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160

cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220

gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280

ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340

ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400

ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460

aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520

aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579

316

645

DNA

人工序列

合成

CDS

(1)..(645)

316

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac ctg gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg ctg 432

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac 645

Leu Asn Arg Glu Lys Ile Asp

210 215

317

215

PRT

人工序列

合成構建體

317

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

318

645

DNA

人工序列

合成

318

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaaatcca gagtccgctc gttcagcagc agcaccagca gctcagccag 240

gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca ggttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

319

1155

DNA

人工序列

合成

CDS

(1)..(1155)

319

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac ctg gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg ctg 432

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

aac gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt 720

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

tga 1155

320

383

PRT

人工序列

合成構建體

320

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

321

1155

DNA

人工序列

合成

321

tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60

caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120

ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180

gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240

gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300

agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360

gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420

acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480

gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540

ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600

gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660

ggatttgacc ttctcataca gattcttcac gttgctatcg tggaaatcca gagtccgctc 720

gttcagcagc agcaccagca gctcagccag gtctgttccg gagcctccgc tgcccatttt 780

ttcgatgaca gaattcacca ggttagtaat gccattgatt gcgttctgtg tagacttctg 840

atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900

cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960

ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020

gtgggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080

gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140

cagtttggcc ttcat 1155

322

5579

DNA

人工序列

合成

322

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240

ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300

tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360

ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420

cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540

tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660

ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720

catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780

cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900

agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960

tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020

cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080

ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140

ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200

accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260

gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320

ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380

atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440

ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500

tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560

ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620

ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaatg 1680

agcagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740

ctaacctggt gaattctgtc atcgaaaaaa tgggcagcgg aggctccgga acagacctgg 1800

ctgagctgct ggtgctgctg ctgaacgagc ggactctgga tttccacgat agcaacgtga 1860

agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920

gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980

cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040

ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgcag agctccaacc 2100

tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160

tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220

agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280

tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340

atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400

acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460

tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520

agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580

tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640

cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700

tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760

aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820

tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880

ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940

tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000

caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060

gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120

atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180

cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240

cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300

accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360

acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420

cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480

acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540

atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600

agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660

acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720

gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780

gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840

gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900

gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960

acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020

tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080

ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140

catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200

ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260

ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320

cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380

gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440

ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500

catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560

ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620

tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680

atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740

gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800

gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860

attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920

ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980

ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040

aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100

tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160

cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220

gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280

ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340

ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400

ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460

aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520

aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579

323

645

DNA

人工序列

合成

CDS

(1)..(645)

323

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg atg 432

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Met

130 135 140

aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac 645

Leu Asn Arg Glu Lys Ile Asp

210 215

324

215

PRT

人工序列

合成構建體

324

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Met

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

325

645

DNA

人工序列

合成

325

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaaatcca gagtccgctc gttcatcagc agcaccagca gctcagccag 240

gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

326

1155

DNA

人工序列

合成

CDS

(1)..(1155)

326

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg atg 432

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Met

130 135 140

aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

aac gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt 720

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

tga 1155

327

383

PRT

人工序列

合成構建體

327

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Met

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

328

1155

DNA

人工序列

合成

328

tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60

caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120

ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180

gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240

gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300

agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360

gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420

acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480

gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540

ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600

gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660

ggatttgacc ttctcataca gattcttcac gttgctatcg tggaaatcca gagtccgctc 720

gttcatcagc agcaccagca gctcagccag gtctgttccg gagcctccgc tgcccatttt 780

ttcgatgaca gaattcacca tgttagtaat gccattgatt gcgttctgtg tagacttctg 840

atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900

cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960

ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020

gtgggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080

gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140

cagtttggcc ttcat 1155

329

5579

DNA

人工序列

合成

329

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240

ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300

tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360

ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420

cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540

tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660

ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720

catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780

cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900

agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960

tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020

cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080

ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140

ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200

accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260

gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320

ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380

atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440

ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500

tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560

ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620

ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaatg 1680

agcagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740

ctaacatggt gaattctgtc atcgaaaaaa tgggcagcgg aggctccgga acagacctgg 1800

ctgagctgct ggtgctgctg atgaacgagc ggactctgga tttccacgat agcaacgtga 1860

agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920

gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980

cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040

ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgcag agctccaacc 2100

tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160

tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220

agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280

tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340

atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400

acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460

tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520

agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580

tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640

cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700

tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760

aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820

tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880

ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940

tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000

caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060

gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120

atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180

cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240

cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300

accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360

acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420

cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480

acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540

atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600

agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660

acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720

gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780

gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840

gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900

gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960

acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020

tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080

ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140

catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200

ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260

ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320

cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380

gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440

ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500

catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560

ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620

tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680

atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740

gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800

gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860

attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920

ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980

ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040

aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100

tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160

cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220

gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280

ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340

ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400

ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460

aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520

aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579

330

645

DNA

人工序列

合成

CDS

(1)..(645)

330

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac cag gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Gln Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg cag 432

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Gln

130 135 140

aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac 645

Leu Asn Arg Glu Lys Ile Asp

210 215

331

215

PRT

人工序列

合成構建體

331

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Gln Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Gln

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

332

645

DNA

人工序列

合成

332

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaaatcca gagtccgctc gttctgcagc agcaccagca gctcagccag 240

gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacct ggttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

333

1155

DNA

人工序列

合成

CDS

(1)..(1155)

333

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac cag gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Gln Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg cag 432

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Gln

130 135 140

aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

aac gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt 720

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

tga 1155

334

383

PRT

人工序列

合成構建體

334

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Gln Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Gln

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

335

1155

DNA

人工序列

合成

335

tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60

caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120

ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180

gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240

gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300

agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360

gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420

acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480

gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540

ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600

gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660

ggatttgacc ttctcataca gattcttcac gttgctatcg tggaaatcca gagtccgctc 720

gttctgcagc agcaccagca gctcagccag gtctgttccg gagcctccgc tgcccatttt 780

ttcgatgaca gaattcacct ggttagtaat gccattgatt gcgttctgtg tagacttctg 840

atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900

cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960

ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020

gtgggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080

gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140

cagtttggcc ttcat 1155

336

5579

DNA

人工序列

合成

336

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240

ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300

tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360

ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420

cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540

tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660

ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720

catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780

cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900

agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960

tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020

cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080

ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140

ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200

accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260

gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320

ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380

atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440

ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500

tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560

ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620

ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaatg 1680

agcagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740

ctaaccaggt gaattctgtc atcgaaaaaa tgggcagcgg aggctccgga acagacctgg 1800

ctgagctgct ggtgctgctg cagaacgagc ggactctgga tttccacgat agcaacgtga 1860

agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920

gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980

cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040

ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgcag agctccaacc 2100

tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160

tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220

agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280

tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340

atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400

acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460

tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520

agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580

tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640

cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700

tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760

aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820

tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880

ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940

tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000

caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060

gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120

atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180

cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240

cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300

accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360

acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420

cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480

acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540

atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600

agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660

acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720

gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780

gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840

gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900

gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960

acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020

tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080

ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140

catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200

ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260

ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320

cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380

gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440

ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500

catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560

ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620

tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680

atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740

gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800

gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860

attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920

ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980

ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040

aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100

tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160

cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220

gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280

ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340

ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400

ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460

aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520

aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579

337

645

DNA

人工序列

合成

CDS

(1)..(645)

337

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc aac tca act aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr Asn Ser Thr Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg ctg 432

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac 645

Leu Asn Arg Glu Lys Ile Asp

210 215

338

215

PRT

人工序列

合成構建體

338

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr Asn Ser Thr Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

339

645

DNA

人工序列

合成

339

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaaatcca gagtccgctc gttcagcagc agcaccagca gctcagccag 240

gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattagttga gttggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

340

1155

DNA

人工序列

合成

CDS

(1)..(1155)

340

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc aac tca act aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr Asn Ser Thr Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg ctg 432

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

aac gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt 720

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

tga 1155

341

383

PRT

人工序列

合成構建體

341

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr Asn Ser Thr Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

342

1155

DNA

人工序列

合成

342

tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60

caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120

ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180

gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240

gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300

agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360

gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420

acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480

gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540

ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600

gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660

ggatttgacc ttctcataca gattcttcac gttgctatcg tggaaatcca gagtccgctc 720

gttcagcagc agcaccagca gctcagccag gtctgttccg gagcctccgc tgcccatttt 780

ttcgatgaca gaattcacca tgttagtaat gccattgatt gcgttctgtg tagacttctg 840

atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900

cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960

ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattagttga 1020

gttggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080

gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140

cagtttggcc ttcat 1155

343

5579

DNA

人工序列

合成

343

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240

ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300

tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360

ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420

cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540

tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660

ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720

catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780

cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900

agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960

tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020

cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080

ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140

ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200

accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260

gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320

ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380

atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440

ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500

tggagaagaa cgtgactgtc accaactcaa ctaatctggg cagcggactg aggatggtca 1560

ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620

ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaatg 1680

agcagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740

ctaacatggt gaattctgtc atcgaaaaaa tgggcagcgg aggctccgga acagacctgg 1800

ctgagctgct ggtgctgctg ctgaacgagc ggactctgga tttccacgat agcaacgtga 1860

agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920

gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980

cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040

ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgcag agctccaacc 2100

tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160

tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220

agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280

tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340

atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400

acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460

tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520

agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580

tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640

cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700

tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760

aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820

tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880

ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940

tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000

caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060

gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120

atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180

cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240

cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300

accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360

acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420

cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480

acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540

atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600

agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660

acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720

gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780

gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840

gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900

gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960

acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020

tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080

ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140

catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200

ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260

ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320

cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380

gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440

ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500

catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560

ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620

tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680

atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740

gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800

gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860

attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920

ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980

ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040

aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100

tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160

cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220

gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280

ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340

ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400

ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460

aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520

aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579

344

645

DNA

人工序列

合成

CDS

(1)..(645)

344

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc att ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Ile Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg atg ctg 432

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Met Leu

130 135 140

aac cag ttc act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Gln Phe Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac 645

Leu Asn Arg Glu Lys Ile Asp

210 215

345

215

PRT

人工序列

合成構建體

345

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Ile Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Met Leu

130 135 140

Asn Gln Phe Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

346

645

DNA

人工序列

合成

346

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaacagca gagtgaactg gttcagcatc agcaccagca gctcagccag 240

gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccagaat 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

347

1155

DNA

人工序列

合成

CDS

(1)..(1155)

347

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc att ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Ile Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg atg ctg 432

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Met Leu

130 135 140

aac cag ttc act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Gln Phe Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

aac gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt 720

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

tga 1155

348

383

PRT

人工序列

合成構建體

348

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Ile Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Met Leu

130 135 140

Asn Gln Phe Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

349

1155

DNA

人工序列

合成

349

tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60

caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120

ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180

gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240

gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300

agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360

gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420

acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480

gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540

ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600

gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660

ggatttgacc ttctcataca gattcttcac gttgctatcg tggaacagca gagtgaactg 720

gttcagcatc agcaccagca gctcagccag gtctgttccg gagcctccgc tgcccatttt 780

ttcgatgaca gaattcacca tgttagtaat gccattgatt gcgttctgtg tagacttctg 840

atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900

cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960

ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020

gtgggtgaca gtcacgttct tctccagaat ggtatccact gtgtcggtgg agttgtttgc 1080

gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140

cagtttggcc ttcat 1155

350

5579

DNA

人工序列

合成

350

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240

ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300

tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360

ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420

cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540

tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660

ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720

catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780

cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900

agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960

tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020

cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080

ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140

ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200

accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260

gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320

ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380

atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440

ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccattc 1500

tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560

ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620

ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaatg 1680

agcagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740

ctaacatggt gaattctgtc atcgaaaaaa tgggcagcgg aggctccgga acagacctgg 1800

ctgagctgct ggtgctgatg ctgaaccagt tcactctgct gttccacgat agcaacgtga 1860

agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920

gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980

cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040

ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgcag agctccaacc 2100

tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160

tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220

agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280

tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340

atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400

acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460

tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520

agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580

tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640

cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700

tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760

aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820

tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880

ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940

tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000

caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060

gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120

atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180

cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240

cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300

accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360

acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420

cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480

acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540

atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600

agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660

acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720

gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780

gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840

gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900

gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960

acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020

tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080

ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140

catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200

ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260

ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320

cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380

gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440

ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500

catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560

ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620

tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680

atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740

gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800

gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860

attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920

ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980

ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040

aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100

tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160

cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220

gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280

ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340

ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400

ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460

aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520

aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579

351

645

DNA

人工序列

合成

CDS

(1)..(645)

351

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

aat gga aca ggc gga gct gac ctg gct gag ctg ctg gtg ctg ctg ctg 432

Asn Gly Thr Gly Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

aac cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac 645

Leu Asn Arg Glu Lys Ile Asp

210 215

352

215

PRT

人工序列

合成構建體

352

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Asn Gly Thr Gly Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

353

645

DNA

人工序列

合成

353

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240

gtcagctccg cctgttccat tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

354

1155

DNA

人工序列

合成

CDS

(1)..(1155)

354

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

aat gga aca ggc gga gct gac ctg gct gag ctg ctg gtg ctg ctg ctg 432

Asn Gly Thr Gly Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

aac cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

aac gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt 720

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

tga 1155

355

383

PRT

人工序列

合成構建體

355

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Asn Gly Thr Gly Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

356

1155

DNA

人工序列

合成

356

tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60

caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120

ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180

gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240

gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300

agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360

gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420

acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480

gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540

ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600

gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660

ggatttgacc ttctcataca gattcttcac gttgctatcg tggaacagca gagtccactg 720

gttcagcagc agcaccagca gctcagccag gtcagctccg cctgttccat tgcccatttt 780

ttcgatgaca gaattcacca tgttagtaat gccattgatt gcgttctgtg tagacttctg 840

atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900

cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960

ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020

gtgggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080

gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140

cagtttggcc ttcat 1155

357

5579

DNA

人工序列

合成

357

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240

ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300

tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360

ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420

cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540

tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660

ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720

catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780

cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900

agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960

tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020

cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080

ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140

ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200

accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260

gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320

ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380

atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440

ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500

tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560

ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620

ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaatg 1680

agcagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740

ctaacatggt gaattctgtc atcgaaaaaa tgggcaatgg aacaggcgga gctgacctgg 1800

ctgagctgct ggtgctgctg ctgaaccagt ggactctgct gttccacgat agcaacgtga 1860

agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920

gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980

cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040

ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgcag agctccaacc 2100

tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160

tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220

agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280

tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340

atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400

acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460

tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520

agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580

tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640

cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700

tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760

aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820

tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880

ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940

tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000

caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060

gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120

atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180

cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240

cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300

accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360

acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420

cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480

acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540

atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600

agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660

acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720

gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780

gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840

gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900

gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960

acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020

tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080

ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140

catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200

ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260

ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320

cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380

gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440

ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500

catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560

ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620

tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680

atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740

gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800

gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860

attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920

ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980

ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040

aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100

tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160

cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220

gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280

ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340

ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400

ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460

aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520

aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579

358

645

DNA

人工序列

合成

CDS

(1)..(645)

358

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

gga aat ggc act gga gct gac ctg gct gag ctg ctg gtg ctg ctg ctg 432

Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

aac cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac 645

Leu Asn Arg Glu Lys Ile Asp

210 215

359

215

PRT

人工序列

合成構建體

359

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

360

645

DNA

人工序列

合成

360

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240

gtcagctcca gtgccatttc cgcccatttt ttcgatgaca gaattcacca tgttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

361

1155

DNA

人工序列

合成

CDS

(1)..(1155)

361

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

gga aat ggc act gga gct gac ctg gct gag ctg ctg gtg ctg ctg ctg 432

Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

aac cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

aac gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt 720

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

tga 1155

362

383

PRT

人工序列

合成構建體

362

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

363

1155

DNA

人工序列

合成

363

tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60

caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120

ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180

gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240

gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300

agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360

gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420

acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480

gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540

ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600

gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660

ggatttgacc ttctcataca gattcttcac gttgctatcg tggaacagca gagtccactg 720

gttcagcagc agcaccagca gctcagccag gtcagctcca gtgccatttc cgcccatttt 780

ttcgatgaca gaattcacca tgttagtaat gccattgatt gcgttctgtg tagacttctg 840

atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900

cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960

ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020

gtgggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080

gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140

cagtttggcc ttcat 1155

364

5579

DNA

人工序列

合成

364

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240

ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300

tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360

ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420

cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540

tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660

ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720

catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780

cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900

agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960

tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020

cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080

ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140

ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200

accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260

gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320

ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380

atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440

ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500

tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560

ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620

ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaatg 1680

agcagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740

ctaacatggt gaattctgtc atcgaaaaaa tgggcggaaa tggcactgga gctgacctgg 1800

ctgagctgct ggtgctgctg ctgaaccagt ggactctgct gttccacgat agcaacgtga 1860

agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920

gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980

cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040

ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgcag agctccaacc 2100

tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160

tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220

agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280

tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340

atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400

acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460

tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520

agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580

tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640

cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700

tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760

aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820

tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880

ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940

tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000

caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060

gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120

atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180

cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240

cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300

accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360

acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420

cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480

acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540

atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600

agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660

acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720

gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780

gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840

gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900

gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960

acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020

tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080

ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140

catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200

ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260

ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320

cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380

gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440

ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500

catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560

ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620

tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680

atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740

gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800

gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860

attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920

ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980

ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040

aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100

tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160

cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220

gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280

ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340

ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400

ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460

aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520

aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579

365

645

DNA

人工序列

合成

CDS

(1)..(645)

365

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

agc gga ggc aac gga aca gac ctg gct gag ctg ctg gtg ctg ctg ctg 432

Ser Gly Gly Asn Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

aac cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac 645

Leu Asn Arg Glu Lys Ile Asp

210 215

366

215

PRT

人工序列

合成構建體

366

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Asn Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

367

645

DNA

人工序列

合成

367

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240

gtctgttccg ttgcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

368

1155

DNA

人工序列

合成

CDS

(1)..(1155)

368

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

agc gga ggc aac gga aca gac ctg gct gag ctg ctg gtg ctg ctg ctg 432

Ser Gly Gly Asn Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

aac cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

aac gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt 720

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

tga 1155

369

383

PRT

人工序列

合成構建體

369

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Asn Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

370

1155

DNA

人工序列

合成

370

tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60

caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120

ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180

gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240

gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300

agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360

gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420

acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480

gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540

ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600

gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660

ggatttgacc ttctcataca gattcttcac gttgctatcg tggaacagca gagtccactg 720

gttcagcagc agcaccagca gctcagccag gtctgttccg ttgcctccgc tgcccatttt 780

ttcgatgaca gaattcacca tgttagtaat gccattgatt gcgttctgtg tagacttctg 840

atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900

cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960

ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020

gtgggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080

gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140

cagtttggcc ttcat 1155

371

5579

DNA

人工序列

合成

371

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240

ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300

tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360

ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420

cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540

tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660

ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720

catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780

cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900

agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960

tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020

cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080

ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140

ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200

accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260

gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320

ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380

atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440

ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500

tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560

ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620

ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaatg 1680

agcagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740

ctaacatggt gaattctgtc atcgaaaaaa tgggcagcgg aggcaacgga acagacctgg 1800

ctgagctgct ggtgctgctg ctgaaccagt ggactctgct gttccacgat agcaacgtga 1860

agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920

gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980

cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040

ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgcag agctccaacc 2100

tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160

tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220

agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280

tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340

atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400

acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460

tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520

agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580

tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640

cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700

tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760

aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820

tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880

ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940

tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000

caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060

gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120

atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180

cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240

cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300

accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360

acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420

cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480

acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540

atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600

agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660

acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720

gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780

gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840

gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900

gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960

acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020

tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080

ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140

catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200

ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260

ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320

cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380

gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440

ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500

catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560

ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620

tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680

atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740

gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800

gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860

attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920

ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980

ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040

aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100

tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160

cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220

gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280

ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340

ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400

ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460

aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520

aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579

372

645

DNA

人工序列

合成

CDS

(1)..(645)

372

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat aac aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn

85 90 95

acc cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

gga aat ggc act gga gct gac ctg gct gag ctg ctg gtg ctg ctg ctg 432

Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

aac cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac 645

Leu Asn Arg Glu Lys Ile Asp

210 215

373

215

PRT

人工序列

合成構建體

373

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn

85 90 95

Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

374

645

DNA

人工序列

合成

374

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240

gtcagctcca gtgccatttc cgcccatttt ttcgatgaca gaattcacca tgttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgggtatt 360

gttatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

375

1155

DNA

人工序列

合成

CDS

(1)..(1155)

375

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat aac aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn

85 90 95

acc cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

gga aat ggc act gga gct gac ctg gct gag ctg ctg gtg ctg ctg ctg 432

Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

aac cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

aac gaa cag gtg aac aag gag atg aac agc acc aac ctg tac atg agt 720

Asn Glu Gln Val Asn Lys Glu Met Asn Ser Thr Asn Leu Tyr Met Ser

225 230 235 240

atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

tga 1155

376

383

PRT

人工序列

合成構建體

376

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn

85 90 95

Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Asn Ser Thr Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

377

1155

DNA

人工序列

合成

377

tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60

caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120

ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180

gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240

gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300

agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360

gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420

acaccaacta gacatactca tgtacaggtt ggtgctgttc atctccttgt tcacctgttc 480

gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540

ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600

gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660

ggatttgacc ttctcataca gattcttcac gttgctatcg tggaacagca gagtccactg 720

gttcagcagc agcaccagca gctcagccag gtcagctcca gtgccatttc cgcccatttt 780

ttcgatgaca gaattcacca tgttagtaat gccattgatt gcgttctgtg tagacttctg 840

atcagcggcg tagccgctgc cctgggtatt gttatggtgg tagccgtacc acccgtccac 900

cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960

ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020

gtgggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080

gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140

cagtttggcc ttcat 1155

378

5579

DNA

人工序列

合成

378

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240

ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300

tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360

ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420

cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540

tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660

ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720

catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780

cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900

agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960

tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020

cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080

ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140

ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200

accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260

gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320

ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380

atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440

ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500

tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560

ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620

ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac cataacaata 1680

cccagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740

ctaacatggt gaattctgtc atcgaaaaaa tgggcggaaa tggcactgga gctgacctgg 1800

ctgagctgct ggtgctgctg ctgaaccagt ggactctgct gttccacgat agcaacgtga 1860

agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920

gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980

cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040

ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgaac agcaccaacc 2100

tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160

tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220

agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280

tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340

atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400

acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460

tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520

agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580

tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640

cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700

tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760

aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820

tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880

ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940

tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000

caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060

gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120

atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180

cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240

cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300

accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360

acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420

cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480

acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540

atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600

agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660

acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720

gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780

gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840

gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900

gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960

acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020

tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080

ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140

catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200

ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260

ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320

cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380

gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440

ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500

catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560

ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620

tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680

atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740

gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800

gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860

attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920

ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980

ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040

aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100

tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160

cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220

gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280

ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340

ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400

ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460

aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520

aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579

379

645

DNA

人工序列

合成

CDS

(1)..(645)

379

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat aac aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn

85 90 95

acc cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

gga aat ggc act gga gct gac ctg gct gag ctg ctg gtg ctg ctg ctg 432

Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

aac cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac 645

Leu Asn Arg Glu Lys Ile Asp

210 215

380

215

PRT

人工序列

合成構建體

380

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn

85 90 95

Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp

210 215

381

645

DNA

人工序列

合成

381

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240

gtcagctcca gtgccatttc cgcccatttt ttcgatgaca gaattcacca tgttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgggtatt 360

gttatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420

tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480

catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540

ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600

ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645

382

1155

DNA

人工序列

合成

CDS

(1)..(1155)

382

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat aac aat 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn

85 90 95

acc cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336

Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

gga aat ggc act gga gct gac ctg gct gag ctg ctg gtg ctg ctg ctg 432

Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

aac cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

aac gaa cag gtg aac aag gag atg aac agc acc aac ctg tac atg agt 720

Asn Glu Gln Val Asn Lys Glu Met Asn Ser Thr Asn Leu Tyr Met Ser

225 230 235 240

atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

att ttc ctg aat gag aac aat gtg ccc gtc aac ctg act tca atc agc 864

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Asn Leu Thr Ser Ile Ser

275 280 285

gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

tga 1155

383

383

PRT

人工序列

合成構建體

383

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn

85 90 95

Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu

210 215 220

Asn Glu Gln Val Asn Lys Glu Met Asn Ser Thr Asn Leu Tyr Met Ser

225 230 235 240

Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe

245 250 255

Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile

260 265 270

Ile Phe Leu Asn Glu Asn Asn Val Pro Val Asn Leu Thr Ser Ile Ser

275 280 285

Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala

290 295 300

Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp

305 310 315 320

His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp

325 330 335

Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu

340 345 350

Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala

355 360 365

Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

370 375 380

384

1155

DNA

人工序列

合成

384

tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60

caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120

ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180

gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240

gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300

agtcaggttg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360

gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420

acaccaacta gacatactca tgtacaggtt ggtgctgttc atctccttgt tcacctgttc 480

gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540

ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600

gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660

ggatttgacc ttctcataca gattcttcac gttgctatcg tggaacagca gagtccactg 720

gttcagcagc agcaccagca gctcagccag gtcagctcca gtgccatttc cgcccatttt 780

ttcgatgaca gaattcacca tgttagtaat gccattgatt gcgttctgtg tagacttctg 840

atcagcggcg tagccgctgc cctgggtatt gttatggtgg tagccgtacc acccgtccac 900

cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960

ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020

gtgggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080

gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140

cagtttggcc ttcat 1155

385

5579

DNA

人工序列

合成

385

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240

ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300

tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360

ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420

cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540

tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660

ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720

catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780

cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900

agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960

tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020

cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080

ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140

ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200

accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260

gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320

ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380

atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440

ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500

tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560

ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620

ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac cataacaata 1680

cccagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740

ctaacatggt gaattctgtc atcgaaaaaa tgggcggaaa tggcactgga gctgacctgg 1800

ctgagctgct ggtgctgctg ctgaaccagt ggactctgct gttccacgat agcaacgtga 1860

agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920

gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980

cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040

ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgaac agcaccaacc 2100

tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160

tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220

agaacaatgt gcccgtcaac ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280

tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340

atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400

acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460

tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520

agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580

tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640

cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700

tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760

aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820

tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880

ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940

tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000

caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060

gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120

atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180

cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240

cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300

accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360

acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420

cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480

acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540

atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600

agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660

acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720

gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780

gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840

gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900

gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960

acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020

tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080

ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140

catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200

ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260

ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320

cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380

gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440

ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500

catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560

ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620

tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680

atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740

gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800

gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860

attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920

ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980

ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040

aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100

tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160

cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220

gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280

ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340

ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400

ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460

aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520

aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579

386

384

DNA

人工序列

合成

CDS

(1)..(384)

386

atg aag gcc aag ctg ctg gtg ctc ctg tgc acc ttc acc gcc acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gcc gac acc atc tgc atc ggc tac cac gcc aac aac agc acc gac acc 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtg ctg gaa aag aac gtg acc gtg acc cac agc gtg aac 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc ggc ctg cgg atg gtg aca ggc ctg cgg aac atc ccc cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

aga gag aca cgg ggc ctg ttc ggc gcc att gcc ggc ttt atc gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggc tgg acc ggc atg gtg gac ggg tgg tac ggc tac cac cac cag aac 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gcc gac cag aag tcc acc cag aac gcc 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aac ggc atc acc aac atg gtg aac agc gtg atc gag aag atg ggc 384

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

387

128

PRT

人工序列

合成構建體

387

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

388

384

DNA

人工序列

合成

388

gcccatcttc tcgatcacgc tgttcaccat gttggtgatg ccgttgatgg cgttctgggt 60

ggacttctgg tcggcggcgt agccgctgcc ctgctcgttc tggtggtggt agccgtacca 120

cccgtccacc atgccggtcc agccgccctc gataaagccg gcaatggcgc cgaacaggcc 180

ccgtgtctct ctctggggga tgttccgcag gcctgtcacc atccgcaggc cgctgcccag 240

gttcacgctg tgggtcacgg tcacgttctt ttccagcacg gtatccacgg tgtcggtgct 300

gttgttggcg tggtagccga tgcagatggt gtcggcgtag gtggcggtga aggtgcacag 360

gagcaccagc agcttggcct tcat 384

389

1110

DNA

人工序列

合成

CDS

(1)..(1110)

389

atg aag gcc aag ctg ctg gtg ctc ctg tgc acc ttc acc gcc acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gcc gac acc atc tgc atc ggc tac cac gcc aac aac agc acc gac acc 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtg ctg gaa aag aac gtg acc gtg acc cac agc gtg aac 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc ggc ctg cgg atg gtg aca ggc ctg cgg aac atc ccc cag 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

aga gag aca cgg ggc ctg ttc ggc gcc att gcc ggc ttt atc gag ggc 240

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

ggc tgg acc ggc atg gtg gac ggg tgg tac ggc tac cac cac cag aac 288

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

gag cag ggc agc ggc tac gcc gcc gac cag aag tcc acc cag aac gcc 336

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

atc aac ggc atc acc aac atg gtg aac agc gtg atc gag aag atg ggc 384

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

tcc ggc ggc agc ggc acc gat ctg gct gaa ctg ctg gtc ctg ctg ctg 432

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

aac gag cgg acc ctg gac ttc cac gac agc aac gtg aag aac ctg tac 480

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

gag aaa gtg aag tcc cag ctg aag aac aac gcc aaa gag atc ggc aac 528

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

ggc tgc ttc gag ttc tac cac aag tgc aac aac gag tgc atg gaa agc 576

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

gtg aag aac ggc acc tac gac tac ccc aag tac agc gag gaa agc aag 624

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

ctg aac cgc gag gga ggc atg caa atc tac gag ggc aag ctg aca gcc 672

Leu Asn Arg Glu Gly Gly Met Gln Ile Tyr Glu Gly Lys Leu Thr Ala

210 215 220

gag ggc ctg aga ttc ggc atc gtg gcc agc cgg ttc aac cac gcc ctg 720

Glu Gly Leu Arg Phe Gly Ile Val Ala Ser Arg Phe Asn His Ala Leu

225 230 235 240

gtg gac aga ctg gtg gaa ggc gcc atc gac tgc atc gtg cgg cac ggc 768

Val Asp Arg Leu Val Glu Gly Ala Ile Asp Cys Ile Val Arg His Gly

245 250 255

ggc aga gaa gag gac atc acc ctg gtc cgc gtg ccc ggc agc tgg gaa 816

Gly Arg Glu Glu Asp Ile Thr Leu Val Arg Val Pro Gly Ser Trp Glu

260 265 270

att cct gtg gct gcc ggc gag ctg gcc cgg aaa gag gat atc gac gcc 864

Ile Pro Val Ala Ala Gly Glu Leu Ala Arg Lys Glu Asp Ile Asp Ala

275 280 285

gtc atc gcc atc ggc gtg ctg atc aga ggc gcc acc ccc cac ttc gac 912

Val Ile Ala Ile Gly Val Leu Ile Arg Gly Ala Thr Pro His Phe Asp

290 295 300

tat atc gcc agc gag gtg tcc aag ggc ctg gcc aac ctg agc ctg gaa 960

Tyr Ile Ala Ser Glu Val Ser Lys Gly Leu Ala Asn Leu Ser Leu Glu

305 310 315 320

ctg cgg aag ccc atc acc ttc gga gtg atc acc gcc gac acc ctg gaa 1008

Leu Arg Lys Pro Ile Thr Phe Gly Val Ile Thr Ala Asp Thr Leu Glu

325 330 335

cag gcc atc gag aga gcc ggc acc aag cac ggc aac aag gga tgg gaa 1056

Gln Ala Ile Glu Arg Ala Gly Thr Lys His Gly Asn Lys Gly Trp Glu

340 345 350

gcc gcc ctg agc gcc atc gag atg gcc aat ctg ttc aag agc ctg cgc 1104

Ala Ala Leu Ser Ala Ile Glu Met Ala Asn Leu Phe Lys Ser Leu Arg

355 360 365

tga tga 1110

390

368

PRT

人工序列

合成構建體

390

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln

50 55 60

Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly

65 70 75 80

Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn

85 90 95

Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala

100 105 110

Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly

115 120 125

Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu

130 135 140

Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr

145 150 155 160

Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn

165 170 175

Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser

180 185 190

Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys

195 200 205

Leu Asn Arg Glu Gly Gly Met Gln Ile Tyr Glu Gly Lys Leu Thr Ala

210 215 220

Glu Gly Leu Arg Phe Gly Ile Val Ala Ser Arg Phe Asn His Ala Leu

225 230 235 240

Val Asp Arg Leu Val Glu Gly Ala Ile Asp Cys Ile Val Arg His Gly

245 250 255

Gly Arg Glu Glu Asp Ile Thr Leu Val Arg Val Pro Gly Ser Trp Glu

260 265 270

Ile Pro Val Ala Ala Gly Glu Leu Ala Arg Lys Glu Asp Ile Asp Ala

275 280 285

Val Ile Ala Ile Gly Val Leu Ile Arg Gly Ala Thr Pro His Phe Asp

290 295 300

Tyr Ile Ala Ser Glu Val Ser Lys Gly Leu Ala Asn Leu Ser Leu Glu

305 310 315 320

Leu Arg Lys Pro Ile Thr Phe Gly Val Ile Thr Ala Asp Thr Leu Glu

325 330 335

Gln Ala Ile Glu Arg Ala Gly Thr Lys His Gly Asn Lys Gly Trp Glu

340 345 350

Ala Ala Leu Ser Ala Ile Glu Met Ala Asn Leu Phe Lys Ser Leu Arg

355 360 365

391

1110

DNA

人工序列

合成

391

tcatcagcgc aggctcttga acagattggc catctcgatg gcgctcaggg cggcttccca 60

tcccttgttg ccgtgcttgg tgccggctct ctcgatggcc tgttccaggg tgtcggcggt 120

gatcactccg aaggtgatgg gcttccgcag ttccaggctc aggttggcca ggcccttgga 180

cacctcgctg gcgatatagt cgaagtgggg ggtggcgcct ctgatcagca cgccgatggc 240

gatgacggcg tcgatatcct ctttccgggc cagctcgccg gcagccacag gaatttccca 300

gctgccgggc acgcggacca gggtgatgtc ctcttctctg ccgccgtgcc gcacgatgca 360

gtcgatggcg ccttccacca gtctgtccac cagggcgtgg ttgaaccggc tggccacgat 420

gccgaatctc aggccctcgg ctgtcagctt gccctcgtag atttgcatgc ctccctcgcg 480

gttcagcttg ctttcctcgc tgtacttggg gtagtcgtag gtgccgttct tcacgctttc 540

catgcactcg ttgttgcact tgtggtagaa ctcgaagcag ccgttgccga tctctttggc 600

gttgttcttc agctgggact tcactttctc gtacaggttc ttcacgttgc tgtcgtggaa 660

gtccagggtc cgctcgttca gcagcaggac cagcagttca gccagatcgg tgccgctgcc 720

gccggagccc atcttctcga tcacgctgtt caccatgttg gtgatgccgt tgatggcgtt 780

ctgggtggac ttctggtcgg cggcgtagcc gctgccctgc tcgttctggt ggtggtagcc 840

gtaccacccg tccaccatgc cggtccagcc gccctcgata aagccggcaa tggcgccgaa 900

caggccccgt gtctctctct gggggatgtt ccgcaggcct gtcaccatcc gcaggccgct 960

gcccaggttc acgctgtggg tcacggtcac gttcttttcc agcacggtat ccacggtgtc 1020

ggtgctgttg ttggcgtggt agccgatgca gatggtgtcg gcgtaggtgg cggtgaaggt 1080

gcacaggagc accagcagct tggccttcat 1110

392

5528

DNA

人工序列

合成

392

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240

ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300

tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360

ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420

cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540

tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660

ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720

catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780

cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900

agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960

tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020

cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080

ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140

ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200

accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260

gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320

ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380

ccaccatgaa ggccaagctg ctggtgctcc tgtgcacctt caccgccacc tacgccgaca 1440

ccatctgcat cggctaccac gccaacaaca gcaccgacac cgtggatacc gtgctggaaa 1500

agaacgtgac cgtgacccac agcgtgaacc tgggcagcgg cctgcggatg gtgacaggcc 1560

tgcggaacat cccccagaga gagacacggg gcctgttcgg cgccattgcc ggctttatcg 1620

agggcggctg gaccggcatg gtggacgggt ggtacggcta ccaccaccag aacgagcagg 1680

gcagcggcta cgccgccgac cagaagtcca cccagaacgc catcaacggc atcaccaaca 1740

tggtgaacag cgtgatcgag aagatgggct ccggcggcag cggcaccgat ctggctgaac 1800

tgctggtcct gctgctgaac gagcggaccc tggacttcca cgacagcaac gtgaagaacc 1860

tgtacgagaa agtgaagtcc cagctgaaga acaacgccaa agagatcggc aacggctgct 1920

tcgagttcta ccacaagtgc aacaacgagt gcatggaaag cgtgaagaac ggcacctacg 1980

actaccccaa gtacagcgag gaaagcaagc tgaaccgcga gggaggcatg caaatctacg 2040

agggcaagct gacagccgag ggcctgagat tcggcatcgt ggccagccgg ttcaaccacg 2100

ccctggtgga cagactggtg gaaggcgcca tcgactgcat cgtgcggcac ggcggcagag 2160

aagaggacat caccctggtc cgcgtgcccg gcagctggga aattcctgtg gctgccggcg 2220

agctggcccg gaaagaggat atcgacgccg tcatcgccat cggcgtgctg atcagaggcg 2280

ccacccccca cttcgactat atcgccagcg aggtgtccaa gggcctggcc aacctgagcc 2340

tggaactgcg gaagcccatc accttcggag tgatcaccgc cgacaccctg gaacaggcca 2400

tcgagagagc cggcaccaag cacggcaaca agggatggga agccgccctg agcgccatcg 2460

agatggccaa tctgttcaag agcctgcgct gatgaacacg tgggatccag atctgctgtg 2520

ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt gaccctggaa 2580

ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt 2640

aggtgtcatt ctattctggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa 2700

gacaatagca ggcatgctgg ggatgcggtg ggctctatgg gtacccaggt gctgaagaat 2760

tgacccggtt cctcctgggc cagaaagaag caggcacatc cccttctctg tgacacaccc 2820

tgtccacgcc cctggttctt agttccagcc ccactcatag gacactcata gctcaggagg 2880

gctccgcctt caatcccacc cgctaaagta cttggagcgg tctctccctc cctcatcagc 2940

ccaccaaacc aaacctagcc tccaagagtg ggaagaaatt aaagcaagat aggctattaa 3000

gtgcagaggg agagaaaatg cctccaacat gtgaggaagt aatgagagaa atcatagaat 3060

tttaaggcca tgatttaagg ccatcatggc cttaatcttc cgcttcctcg ctcactgact 3120

cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac 3180

ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa 3240

aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg 3300

acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa 3360

gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc 3420

ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac 3480

gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac 3540

cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg 3600

taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt 3660

atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagaa 3720

cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct 3780

cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga 3840

ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg 3900

ctcagtggaa cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct 3960

tcacctagat ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt 4020

aaacttggtc tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc 4080

tatttcgttc atccatagtt gcctgactcg gggggggggg gcgctgaggt ctgcctcgtg 4140

aagaaggtgt tgctgactca taccaggcct gaatcgcccc atcatccagc cagaaagtga 4200

gggagccacg gttgatgaga gctttgttgt aggtggacca gttggtgatt ttgaactttt 4260

gctttgccac ggaacggtct gcgttgtcgg gaagatgcgt gatctgatcc ttcaactcag 4320

caaaagttcg atttattcaa caaagccgcc gtcccgtcaa gtcagcgtaa tgctctgcca 4380

gtgttacaac caattaacca attctgatta gaaaaactca tcgagcatca aatgaaactg 4440

caatttattc atatcaggat tatcaatacc atatttttga aaaagccgtt tctgtaatga 4500

aggagaaaac tcaccgaggc agttccatag gatggcaaga tcctggtatc ggtctgcgat 4560

tccgactcgt ccaacatcaa tacaacctat taatttcccc tcgtcaaaaa taaggttatc 4620

aagtgagaaa tcaccatgag tgacgactga atccggtgag aatggcaaaa gcttatgcat 4680

ttctttccag acttgttcaa caggccagcc attacgctcg tcatcaaaat cactcgcatc 4740

aaccaaaccg ttattcattc gtgattgcgc ctgagcgaga cgaaatacgc gatcgctgtt 4800

aaaaggacaa ttacaaacag gaatcgaatg caaccggcgc aggaacactg ccagcgcatc 4860

aacaatattt tcacctgaat caggatattc ttctaatacc tggaatgctg ttttcccggg 4920

gatcgcagtg gtgagtaacc atgcatcatc aggagtacgg ataaaatgct tgatggtcgg 4980

aagaggcata aattccgtca gccagtttag tctgaccatc tcatctgtaa catcattggc 5040

aacgctacct ttgccatgtt tcagaaacaa ctctggcgca tcgggcttcc catacaatcg 5100

atagattgtc gcacctgatt gcccgacatt atcgcgagcc catttatacc catataaatc 5160

agcatccatg ttggaattta atcgcggcct cgagcaagac gtttcccgtt gaatatggct 5220

cataacaccc cttgtattac tgtttatgta agcagacagt tttattgttc atgatgatat 5280

atttttatct tgtgcaatgt aacatcagag attttgagac acaacgtggc tttccccccc 5340

cccccattat tgaagcattt atcagggtta ttgtctcatg agcggataca tatttgaatg 5400

tatttagaaa aataaacaaa taggggttcc gcgcacattt ccccgaaaag tgccacctga 5460

cgtctaagaa accattatta tcatgacatt aacctataaa aataggcgta tcacgaggcc 5520

ctttcgtc 5528

393

594

DNA

人工序列

合成

CDS

(1)..(594)

393

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac ggg tca ggc 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Gly Ser Gly

50 55 60

tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat gag 240

Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn Glu

65 70 75 80

cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca atc 288

Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala Ile

85 90 95

aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc agc 336

Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly Ser

100 105 110

gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg ctg aac 384

Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu Asn

115 120 125

cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat gag 432

Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu

130 135 140

aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat gga 480

Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly

145 150 155 160

tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct gtg 528

Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser Val

165 170 175

aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag ctg 576

Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu

180 185 190

aat cga gag aaa att gac 594

Asn Arg Glu Lys Ile Asp

195

394

198

PRT

人工序列

合成構建體

394

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Gly Ser Gly

50 55 60

Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn Glu

65 70 75 80

Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala Ile

85 90 95

Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly Ser

100 105 110

Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu Asn

115 120 125

Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu

130 135 140

Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly

145 150 155 160

Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser Val

165 170 175

Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu

180 185 190

Asn Arg Glu Lys Ile Asp

195

395

594

DNA

人工序列

合成

395

gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60

gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120

cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180

gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240

gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300

gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360

ctgatggtgg tagccgtacc acccgtccac cattcctgtc cagcctgacc cgttgcgcag 420

tccggtgacc atcctcagtc cgctgcccag attcactgag tgggtgacag tcacgttctt 480

ctccaggacg gtatccactg tgtcggtgga gttgtttgcg tgatagccga tgcagatagt 540

gtcagcgtag gttgcggtaa aagtacacag caggaccagc agtttggcct tcat 594

396

1104

DNA

人工序列

合成

CDS

(1)..(1104)

396

atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac ggg tca ggc 192

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Gly Ser Gly

50 55 60

tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat gag 240

Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn Glu

65 70 75 80

cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca atc 288

Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala Ile

85 90 95

aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc agc 336

Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly Ser

100 105 110

gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg ctg aac 384

Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu Asn

115 120 125

cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat gag 432

Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu

130 135 140

aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat gga 480

Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly

145 150 155 160

tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct gtg 528

Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser Val

165 170 175

aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag ctg 576

Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu

180 185 190

aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg aac 624

Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu Asn

195 200 205

gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt atg 672

Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser Met

210 215 220

tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc ctg 720

Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe Leu

225 230 235 240

ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc att 768

Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile Ile

245 250 255

ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc gcc 816

Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser Ala

260 265 270

cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct tac 864

Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala Tyr

275 280 285

gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac cac 912

Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp His

290 295 300

gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg tac 960

Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp Tyr

305 310 315 320

gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg gat 1008

Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu Asp

325 330 335

aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca gat 1056

Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala Asp

340 345 350

cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga tga 1104

Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

355 360 365

397

366

PRT

人工序列

合成構建體

397

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Gly Ser Gly

50 55 60

Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn Glu

65 70 75 80

Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala Ile

85 90 95

Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly Ser

100 105 110

Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu Asn

115 120 125

Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu

130 135 140

Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly

145 150 155 160

Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser Val

165 170 175

Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu

180 185 190

Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu Asn

195 200 205

Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser Met

210 215 220

Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe Leu

225 230 235 240

Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile Ile

245 250 255

Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser Ala

260 265 270

Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala Tyr

275 280 285

Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp His

290 295 300

Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp Tyr

305 310 315 320

Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu Asp

325 330 335

Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala Asp

340 345 350

Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

355 360 365

398

1104

DNA

人工序列

合成

398

tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60

caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120

ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180

gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240

gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300

agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360

gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420

acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480

gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540

ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600

gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660

ggatttgacc ttctcataca gattcttcac gttgctatcg tggaacagca gagtccactg 720

gttcagcagc agcaccagca gctcagccag gtctgttccg gagcctccgc tgcccatttt 780

ttcgatgaca gaattcacca tgttagtaat gccattgatt gcgttctgtg tagacttctg 840

atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900

cattcctgtc cagcctgacc cgttgcgcag tccggtgacc atcctcagtc cgctgcccag 960

attcactgag tgggtgacag tcacgttctt ctccaggacg gtatccactg tgtcggtgga 1020

gttgtttgcg tgatagccga tgcagatagt gtcagcgtag gttgcggtaa aagtacacag 1080

caggaccagc agtttggcct tcat 1104

399

5528

DNA

人工序列

合成

399

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240

ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300

tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360

ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420

cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480

catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540

tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600

tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660

ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720

catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780

cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840

ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900

agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960

tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020

cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080

ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140

ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200

accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260

gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320

ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380

atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440

ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500

tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560

ccggactgcg caacgggtca ggctggacag gaatggtgga cgggtggtac ggctaccacc 1620

atcagaatga gcagggcagc ggctacgccg ctgatcagaa gtctacacag aacgcaatca 1680

atggcattac taacatggtg aattctgtca tcgaaaaaat gggcagcgga ggctccggaa 1740

cagacctggc tgagctgctg gtgctgctgc tgaaccagtg gactctgctg ttccacgata 1800

gcaacgtgaa gaatctgtat gagaaggtca aatcccagct gaagaacaat gccaaagaaa 1860

tcgggaatgg atgcttcgag ttttaccata agtgcaacaa tgaatgtatg gagtctgtga 1920

agaacggcac ttacgactat cccaaatatt ctgaagagag taagctgaat cgagagaaaa 1980

ttgacagtgg gggcgacatc atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga 2040

gctccaacct gtacatgagt atgtctagtt ggtgttatac acactcactg gacggcgctg 2100

ggctgttcct gtttgatcac gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt 2160

tcctgaatga gaacaatgtg cccgtccagc tgacttcaat cagcgcccct gaacataagt 2220

tcgagggcct gacccagatc tttcagaaag cttacgaaca cgagcagcat atttccgaat 2280

ctatcaacaa tattgtggac cacgccatta agagcaaaga tcatgctacc ttcaactttc 2340

tgcagtggta cgtggccgag cagcacgagg aggaggtcct gtttaaggac atcctggata 2400

aaatcgaact gattggaaac gagaatcatg gcctgtacct ggcagatcag tatgtgaagg 2460

gcattgccaa gtccagaaaa agtgggtcat gatgaacacg tgggatccag atctgctgtg 2520

ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt gaccctggaa 2580

ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt 2640

aggtgtcatt ctattctggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa 2700

gacaatagca ggcatgctgg ggatgcggtg ggctctatgg gtacccaggt gctgaagaat 2760

tgacccggtt cctcctgggc cagaaagaag caggcacatc cccttctctg tgacacaccc 2820

tgtccacgcc cctggttctt agttccagcc ccactcatag gacactcata gctcaggagg 2880

gctccgcctt caatcccacc cgctaaagta cttggagcgg tctctccctc cctcatcagc 2940

ccaccaaacc aaacctagcc tccaagagtg ggaagaaatt aaagcaagat aggctattaa 3000

gtgcagaggg agagaaaatg cctccaacat gtgaggaagt aatgagagaa atcatagaat 3060

tttaaggcca tgatttaagg ccatcatggc cttaatcttc cgcttcctcg ctcactgact 3120

cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac 3180

ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa 3240

aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg 3300

acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa 3360

gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc 3420

ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac 3480

gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac 3540

cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg 3600

taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt 3660

atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagaa 3720

cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct 3780

cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga 3840

ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg 3900

ctcagtggaa cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct 3960

tcacctagat ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt 4020

aaacttggtc tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc 4080

tatttcgttc atccatagtt gcctgactcg gggggggggg gcgctgaggt ctgcctcgtg 4140

aagaaggtgt tgctgactca taccaggcct gaatcgcccc atcatccagc cagaaagtga 4200

gggagccacg gttgatgaga gctttgttgt aggtggacca gttggtgatt ttgaactttt 4260

gctttgccac ggaacggtct gcgttgtcgg gaagatgcgt gatctgatcc ttcaactcag 4320

caaaagttcg atttattcaa caaagccgcc gtcccgtcaa gtcagcgtaa tgctctgcca 4380

gtgttacaac caattaacca attctgatta gaaaaactca tcgagcatca aatgaaactg 4440

caatttattc atatcaggat tatcaatacc atatttttga aaaagccgtt tctgtaatga 4500

aggagaaaac tcaccgaggc agttccatag gatggcaaga tcctggtatc ggtctgcgat 4560

tccgactcgt ccaacatcaa tacaacctat taatttcccc tcgtcaaaaa taaggttatc 4620

aagtgagaaa tcaccatgag tgacgactga atccggtgag aatggcaaaa gcttatgcat 4680

ttctttccag acttgttcaa caggccagcc attacgctcg tcatcaaaat cactcgcatc 4740

aaccaaaccg ttattcattc gtgattgcgc ctgagcgaga cgaaatacgc gatcgctgtt 4800

aaaaggacaa ttacaaacag gaatcgaatg caaccggcgc aggaacactg ccagcgcatc 4860

aacaatattt tcacctgaat caggatattc ttctaatacc tggaatgctg ttttcccggg 4920

gatcgcagtg gtgagtaacc atgcatcatc aggagtacgg ataaaatgct tgatggtcgg 4980

aagaggcata aattccgtca gccagtttag tctgaccatc tcatctgtaa catcattggc 5040

aacgctacct ttgccatgtt tcagaaacaa ctctggcgca tcgggcttcc catacaatcg 5100

atagattgtc gcacctgatt gcccgacatt atcgcgagcc catttatacc catataaatc 5160

agcatccatg ttggaattta atcgcggcct cgagcaagac gtttcccgtt gaatatggct 5220

cataacaccc cttgtattac tgtttatgta agcagacagt tttattgttc atgatgatat 5280

atttttatct tgtgcaatgt aacatcagag attttgagac acaacgtggc tttccccccc 5340

cccccattat tgaagcattt atcagggtta ttgtctcatg agcggataca tatttgaatg 5400

tatttagaaa aataaacaaa taggggttcc gcgcacattt ccccgaaaag tgccacctga 5460

cgtctaagaa accattatta tcatgacatt aacctataaa aataggcgta tcacgaggcc 5520

ctttcgtc 5528

400

198

PRT

人工序列

合成

400

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Gly Ser Gly

50 55 60

Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn Glu

65 70 75 80

Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala Ile

85 90 95

Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly Ser

100 105 110

Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu Asn

115 120 125

Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu

130 135 140

Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly

145 150 155 160

Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser Val

165 170 175

Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu

180 185 190

Asn Arg Glu Lys Ile Asp

195

401

366

PRT

人工序列

合成

401

Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr

1 5 10 15

Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr

20 25 30

Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn

35 40 45

Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Gly Ser Gly

50 55 60

Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn Glu

65 70 75 80

Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala Ile

85 90 95

Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly Ser

100 105 110

Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu Asn

115 120 125

Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu

130 135 140

Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly

145 150 155 160

Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser Val

165 170 175

Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu

180 185 190

Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu Asn

195 200 205

Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser Met

210 215 220

Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe Leu

225 230 235 240

Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile Ile

245 250 255

Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser Ala

260 265 270

Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala Tyr

275 280 285

Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp His

290 295 300

Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp Tyr

305 310 315 320

Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu Asp

325 330 335

Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala Asp

340 345 350

Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser

355 360 365

同类文章

一種新型多功能組合攝影箱的製作方法

一種新型多功能組合攝影箱的製作方法【專利摘要】本實用新型公開了一種新型多功能組合攝影箱,包括敞開式箱體和前攝影蓋,在箱體頂部設有移動式光源盒,在箱體底部設有LED脫影板,LED脫影板放置在底板上;移動式光源盒包括上蓋,上蓋內設有光源,上蓋部設有磨沙透光片,磨沙透光片將光源封閉在上蓋內;所述LED脫影

壓縮模式圖樣重疊檢測方法與裝置與流程

本發明涉及通信領域,特別涉及一種壓縮模式圖樣重疊檢測方法與裝置。背景技術:在寬帶碼分多址(WCDMA,WidebandCodeDivisionMultipleAccess)系統頻分復用(FDD,FrequencyDivisionDuplex)模式下,為了進行異頻硬切換、FDD到時分復用(TDD,Ti

個性化檯曆的製作方法

專利名稱::個性化檯曆的製作方法技術領域::本實用新型涉及一種檯曆,尤其涉及一種既顯示月曆、又能插入照片的個性化檯曆,屬於生活文化藝術用品領域。背景技術::公知的立式檯曆每頁皆由月曆和畫面兩部分構成,這兩部分都是事先印刷好,固定而不能更換的。畫面或為風景,或為模特、明星。功能單一局限性較大。特別是畫

一種實現縮放的視頻解碼方法

專利名稱:一種實現縮放的視頻解碼方法技術領域:本發明涉及視頻信號處理領域,特別是一種實現縮放的視頻解碼方法。背景技術: Mpeg標準是由運動圖像專家組(Moving Picture Expert Group,MPEG)開發的用於視頻和音頻壓縮的一系列演進的標準。按照Mpeg標準,視頻圖像壓縮編碼後包

基於加熱模壓的纖維增強PBT複合材料成型工藝的製作方法

本發明涉及一種基於加熱模壓的纖維增強pbt複合材料成型工藝。背景技術:熱塑性複合材料與傳統熱固性複合材料相比其具有較好的韌性和抗衝擊性能,此外其還具有可回收利用等優點。熱塑性塑料在液態時流動能力差,使得其與纖維結合浸潤困難。環狀對苯二甲酸丁二醇酯(cbt)是一種環狀預聚物,該材料力學性能差不適合做纖

一種pe滾塑儲槽的製作方法

專利名稱:一種pe滾塑儲槽的製作方法技術領域:一種PE滾塑儲槽一、 技術領域 本實用新型涉及一種PE滾塑儲槽,主要用於化工、染料、醫藥、農藥、冶金、稀土、機械、電子、電力、環保、紡織、釀造、釀造、食品、給水、排水等行業儲存液體使用。二、 背景技術 目前,化工液體耐腐蝕貯運設備,普遍使用傳統的玻璃鋼容

釘的製作方法

專利名稱:釘的製作方法技術領域:本實用新型涉及一種釘,尤其涉及一種可提供方便拔除的鐵(鋼)釘。背景技術:考慮到廢木材回收後再加工利用作業的方便性與安全性,根據環保規定,廢木材的回收是必須將釘於廢木材上的鐵(鋼)釘拔除。如圖1、圖2所示,目前用以釘入木材的鐵(鋼)釘10主要是在一釘體11的一端形成一尖

直流氧噴裝置的製作方法

專利名稱:直流氧噴裝置的製作方法技術領域:本實用新型涉及ー種醫療器械,具體地說是ー種直流氧噴裝置。背景技術:臨床上的放療過程極易造成患者的局部皮膚損傷和炎症,被稱為「放射性皮炎」。目前對於放射性皮炎的主要治療措施是塗抹藥膏,而放射性皮炎患者多伴有局部疼痛,對於止痛,多是通過ロ服或靜脈注射進行止痛治療

新型熱網閥門操作手輪的製作方法

專利名稱:新型熱網閥門操作手輪的製作方法技術領域:新型熱網閥門操作手輪技術領域:本實用新型涉及一種新型熱網閥門操作手輪,屬於機械領域。背景技術::閥門作為流體控制裝置應用廣泛,手輪傳動的閥門使用比例佔90%以上。國家標準中提及手輪所起作用為傳動功能,不作為閥門的運輸、起吊裝置,不承受軸向力。現有閥門

用來自動讀取管狀容器所載識別碼的裝置的製作方法

專利名稱:用來自動讀取管狀容器所載識別碼的裝置的製作方法背景技術:1-本發明所屬領域本發明涉及一種用來自動讀取管狀容器所載識別碼的裝置,其中的管狀容器被放在循環於配送鏈上的文檔匣或託架裝置中。本發明特別適用於,然而並非僅僅專用於,對引入自動分析系統的血液樣本試管之類的自動識別。本發明還涉及專為實現讀