單倍體誘導系的製作方法
2023-10-23 21:56:57
發明領域
本發明涉及通過分子生物學方法和標記技術和遺傳工程修飾植物的領域。其涉及提供技術工具例如核酸和載體,以及涉及方法和用途,用於產生和鑑定非轉基因和轉基因植物單倍體誘導系(haploidinducer),以及改良已有的植物單倍體誘導系。
發明背景
一般而言,在雜交植物生產中,作為親本的兩個育種系相互雜交,由於已知的雜種優勢效應,其後代部分地產生相對於親本系而言強烈增加的產量。所述育種系可以通過多個自交步驟獲得,然而其需要進行多代並因此涉及巨大的時間成本。現代植物育種許多年以前已經越來越多地轉變為經由單倍體誘導以及隨後的染色體加倍在短得多的時間量內生成育種系。對此的一技術需求是有用的單倍體誘導系統,其同時也提供足夠的高效以經濟上可用。
例如,對於玉米(zeamays),已知母體體內誘導系統,其中待誘導的植物用誘導系的花粉授粉。然後由此生成的後代的多達10%僅僅含有種子親本的簡單(單倍體)染色體組。現在可獲得用於玉米雜交育種的少數幾個這樣的誘導系。然而,這些全部都歸因於coe,1959描述的單一系「stock6」。這樣的已知的誘導系的一個實例是rws(etal.,2005)系。在過去,在這些品系上進行了多項qtl研究以鑑別誘導系相關基因座。deimling等在1997年已經鑑定到玉米種染色體1的主要qtl(bin1.04)。barretetal.2008更精確的定位於染色體1上66.96mb和68.11mb之間的範圍,priggeetal.2012更精確的定位於62.9mb和70.8mb之間的範圍,和隨後dongetal.2013更精確的定位於68.18mb和68.43mb之間的範圍,其根據公開注釋含有三個基因。全部位置信息指的是b73參考基因組,agpv02版本。dongetal.2014實現5%的誘導率似乎已經證明該基因座其自身的功能性。然而,不能排除錯誤的精細定位,因為由於缺乏輪迴親本中側翼標記的信息,不可能明確地對所述qtl進行劃界。
此外,wo2012/030893公開了玉米中染色體1上誘導系相關基因座,然而,其與前述基因座顯著不同並且更具體地定位於端粒處。在所考慮的基因組區域中沒有重疊。
總的而言,從「stock6」獲得的玉米品系中體內單倍體誘導的分子和發育特異性機制至今很大程度是未知的。例如可以考慮,發生受精,但是隨後其導致染色體消除,然後允許產生單倍體後代。例如,ravi&chan(2010)已經描述了在具有組蛋白cenh3的系統中的此類機制。然而,在另一方面,受精也可失敗,在三倍體胚乳中出現單倍體卵細胞的發育。如果不了解來自「stock6」的誘導系基因型背後的母體體內單倍體誘導基因和所負責的基因的知識,該玉米誘導系的定向改良或所述誘導基因向非誘導系基因型的轉移,或在玉米非誘導系中體內單倍體誘導能力的定向處理實際上是不可能的。
此外,對於一些栽培植物,根本沒有已知的高效(且因此是經濟的)可用的系統用於產生單倍體和雙-單倍體植物,例如高粱、黑麥或向日葵。
有需要提供遺傳元件如基因或調控元件,其在轉基因和/或非轉基因方法中可用以允許通過體內誘導的單倍體開發,或改善單倍體開發的效率。
發明概述
針對上文所述的現有技術的背景獲得本發明,其中本發明的目的是提供工具和方法,其可用於產生體內單倍體誘導系和/或提供單倍體植物。
根據本發明,所述目的通過這樣的核酸實現,所述核酸在植物中轉錄或表達後,適於介導單倍體誘導系的特性或適於增加單倍體誘導系的誘導能力。根據本發明的所述核酸可以作為轉基因使用。在另一方面,與本發明的所述核酸之一相同的植物基因組中或植物單倍體誘導系基因組中的內源dna序列也可以被修飾,使得在所述內源dna序列轉錄或表達後,單倍體誘導系的特性被介導或單倍體誘導系的誘導能力被增加。本發明的核酸優選是分離的核酸,其提取自其天然或原始環境(遺傳背景)。核酸可以是雙鏈或單鏈的,且可以是線性或環狀的。其因此可以是基因組dna、合成的dna、cdna或rna類型(如lncrna、sirna或mirna),其中rna中存在核鹼基尿嘧啶代替核鹼基胸腺嘧啶。
在本發明一優選實施方式中,本發明的核酸或所述核酸編碼的rna或所述核酸編碼的蛋白或多肽,對植物中花粉管的生長、植物花粉的能量代謝和/或優選在生殖細胞(例如其發育成花粉)中的著絲粒的活性有影響。
本發明的核酸的特徵可在於,所述核酸或所述核酸編碼的rna或所述核酸編碼的蛋白或多肽,適於或可用於與野生型植物的花粉相比加速或促進花粉管生長(例如,在植物花粉中),其中本發明的核酸或所述核酸編碼的rna或所述核酸編碼的蛋白或多肽如下文所描述使用。例如,本發明的核酸編碼蛋白,其在植物花粉的花粉管中參與大分子的運輸或影響該運輸。屬於這些的是例如snarev蛋白,其例如介導例如在花粉管頂端的果膠類或磷脂類的運輸(katoetal.,2010)。此外,磷脂酶類的酶——特別是磷脂酶a2或patatin磷脂酶——能夠促進花粉管的生長(kimetal.,2011),而肌醇聚磷酸酯-5-磷酸酶類如肌醇-1,4,5-三磷酸-5-磷酸酶,可以抑制花粉管生長(wangetal.,2012)。本發明的核酸可以用作轉基因,用於加速花粉管生長的目的,其中其然後在植物或其部分中例如通過過表達方法,與野生型植物或其相應部分相比,增加花粉管生長促進基因的表達率或增加正調節(激活)花粉管生長促進基因或負調節(抑制)花粉管生長抑制基因的rna如lncrna的轉錄率,和/或在植物或其部分中通過rnai方法或mirna方法(fireetal.,1998),與野生型植物或其相應部分相比,減少花粉管生長抑制基因的表達率。在另一方面,植物基因組中或植物單倍體誘導系基因組中與本發明的核酸相同的內源dna序列,或所述內源dna序列的調控序列也可以被修飾,例如,通過誘變或「基因組編輯」。與未誘變的野生型植物相比,該修飾可以增加或減少植物中所述內源dna序列的轉錄或表達率,或所述內源dna序列編碼的蛋白或多肽的活性或穩定性。例如,相比於未誘變的野生型植物或未經由「基因組編輯」修飾的野生型植物,植物中內源花粉管生長促進基因的表達率或正調節(激活)花粉管生長促進基因或負調節(抑制)花粉管生長抑制基因的內源rna如lncrna的轉錄率可以因此被增加,或者相比於未誘變的野生型植物或未經由「基因組編輯」修飾的野生型植物,植物中內源花粉管生長抑制基因的表達率或負調節(抑制)花粉管生長促進基因或正調節(激活)花粉管生長抑制基因的內源rna如lncrna的轉錄率可以因此被減少。此外,相比於未誘變的野生型植物或未經由「基因組編輯」修飾的野生型植物,植物中所述內源dna序列編碼的花粉管生長促進蛋白或多肽的活性或穩定性可以被增加,或者相比於未誘變的野生型植物或未經由「基因組編輯」修飾的野生型植物,植物中所述內源dna序列編碼的花粉管生長抑制蛋白或多肽的活性或穩定性可以被減少。
在進一步的實例中,本發明的核酸可以表徵為,通過使用所述核酸,或通過使用所述核酸編碼的rna,或通過使用所述核酸編碼的蛋白或多肽,植物中花粉的能量代謝與野生型植物相比可以被負影響。例如,這可以通過磷酸甘油酸變位酶或線粒體轉運蛋白或線粒體輸入受體(mitochondrialimportreceptor)實現。出於該目的,本發明的核酸可以在過表達方法中、或在rnai方法中、或在mirna方法中用作轉基因(fireetal.,1998)。在另一方面,植物基因組中或植物單倍體誘導系基因組中與本發明的核酸相同的內源dna序列,或所述內源dna序列的調控序列也可以被修飾,例如,通過誘變或「基因組編輯」。與未誘變的野生型植物或未經由「基因組編輯」修飾的野生型植物相比,該修飾可以增加或減少植物中所述內源dna序列的轉錄或表達率,或所述內源dna序列編碼的蛋白或多肽的活性或穩定性。
在另一實例中,本發明的核酸還可以表徵為,通過使用所述核酸或使用所述核酸編碼的rna或使用所述核酸編碼的蛋白或多肽,植物中的著絲粒的活性與野生型相比被修飾(尤其是在早期胚胎發生中和優選在所述植物中發育成例如花粉的生殖細胞中),其導致例如所述誘導系基因組的消除。著絲粒的活性可以通過dna的染色質修飾或在組蛋白水平修飾,此外,還可以通過轉錄、rna相互作用或rna結合。例如,著絲粒的活性的改變可以通過甲基轉移酶如rna甲基轉移酶實現。出於該目的,本發明的核酸用作轉基因,其中它然後通過過表達方法相對於野生型植物增加植物中染色質修飾基因或正調節(激活)染色質修飾基因的rna(如lncrna)的表達率。在另一方面,植物基因組中或植物單倍體誘導系基因組中與本發明的核酸相同的內源dna序列,或所述內源dna序列的調控序列也可以被修飾,例如,通過誘變或「基因組編輯」。與未誘變的野生型植物或未經由「基因組編輯」修飾的野生型植物相比,該修飾可以增加或減少植物中所述內源dna序列的轉錄或表達率,或所述內源dna序列編碼的蛋白或多肽的活性或穩定性。與未誘變的野生型植物或未經由「基因組編輯」修飾的野生型植物相比,植物中染色質修飾基因或正調節(激活)染色質修飾基因的rna(如lncrna)的表達率因此也可以被增加。此外,與未誘變的野生型植物或未經由「基因組編輯」修飾的野生型植物相比,植物中所述內源dna序列編碼的染色質修飾蛋白的活性或穩定性可以被增加。
前面所述的本發明的核酸或所述核酸編碼的rna或所述核酸編碼的蛋白或多肽的用途不是排他性的或限制性的,相反應該被理解為僅僅是示例。本領域技術人員從現有技術已知許多額外的技術手段和方法,通過所述技術手段和方法其可以實現上文所描述的本發明的核酸或相同的內源dna序列的表達或轉錄率的改變,或上文所描述的有本發明的核酸或所述內源dna序列編碼的蛋白或多肽的穩定性和活性的改變。
在本發明一特別優選的實施方案中,所述核酸,其在植物中轉錄後或翻譯後,適於介導單倍體誘導系的特性或適於增加單倍體誘導系的誘導能力,所述核酸可以包含這樣的核苷酸序列,其
(i)選自seqidno:1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、42、43、46、47、49、50、52、53、55、56、57、58、59、60、61和/或62,或具有這些的功能性片段,或
(ii)與來自(i)的序列互補,或
(iii)與來自(i)的序列至少80%、82%、84%、86%、88%,優選地至少90%、91%、92%、93%、94%、95%、96%,或特別優選地至少97%、97.5%、98%、98.5%、99%或99.5%相同,或
(iv)編碼具有選自seqidno:16、17、18、19、20、21、22、23、24、25、44、45、48、51、54、63、64和/或65的胺基酸序列的蛋白或所述蛋白的功能性部分,或
(v)編碼根據(iv)的蛋白的同源物、類似物或直系同源物或其功能性部分,或
(vi)與來自(ii)的序列在嚴格條件下雜交。
該核酸可以編碼蛋白或其功能性部分,其中所述蛋白或其功能性部分具有snare蛋白(尤其snarev蛋白)、磷脂酶(尤其磷脂酶a2或patatin磷脂酶)、甲基轉移酶(尤其rna甲基轉移酶)或線粒體輸入受體的功能(見表1)。可以如上所述實現所述核酸的用途,即為了在植物中介導單倍體誘導系的特性或增加單倍體誘導系的誘導能力,例如,所述核酸的表達率或所編碼的蛋白或所編碼的蛋白部分的活性或穩定性被轉基因地或內源地增加。由於該核酸或該核酸編碼的rna或該核酸編碼的蛋白或多肽對植物的單倍體誘導能力有正作用,在下文中,此處所定義的核酸命名為誘導促進核酸。下文進一步公開所述誘導促進核酸以及包含所述誘導促進核酸的物質的額外的方法和用途。
在本發明另一特別優選的實施方案中,所述核酸,其在植物中轉錄後或翻譯後,適於介導單倍體誘導系的特性或適於增加單倍體誘導系的誘導能力,可以是包含這樣的核苷酸序列的核酸,所述核苷酸序列
(i)具有選自seqidno:26、27、28、29、30和/或31的序列或其功能性片段,或
(ii)與來自(i)的序列互補,或
(iii)與來自(i)的序列至少80%、82%、84%、86%、88%,優選地至少90%、91%、92%、93%、94%、95%、96%,或特別優選地至少97%、97.5%、98%、98.5%、99%或99.5%相同,
(iv)編碼具有選自seqidno:32、33和/或34的胺基酸序列的蛋白或所述蛋白的功能性部分,或
(v)編碼根據(iv)的蛋白的同源物、類似物或直系同源物或其功能性部分,或
(vi)與來自(ii)的序列在嚴格條件下雜交。
這樣的核酸可以編碼蛋白或其功能性部分,其中所述蛋白或其功能性部分具有肌醇聚磷酸酯-5-磷酸酶(特別是肌醇-1,4,5-三磷酸-5-磷酸酶)或磷酸甘油酸變位酶的功能(見表1)。可以如上所述實現所述核酸的用途,即為了在植物中介導單倍體誘導系的特性或增加單倍體誘導系的誘導能力,例如,所述核酸的表達率或所編碼的蛋白或所編碼的蛋白部分的活性或穩定性被轉基因地或內源地減少。由於該核酸或該核酸編碼的rna或該核酸編碼的蛋白或多肽對植物的單倍體誘導能力有負作用,在下文中,此處所定義的核酸命名為誘導抑制核酸。下文進一步公開所述誘導抑制核酸以及包含所述誘導抑制核酸的物質的額外的方法和用途。
在本發明另一特別優選的實施方案中,所述核酸,其在植物中轉錄後或翻譯後,適於介導單倍體誘導系的特性或適於增加單倍體誘導系的誘導能力,其可以是這樣的核酸,所述核酸編碼具有雙鏈部分的rna,其中所述雙鏈部分的至少一條鏈具有與以下核酸的編碼序列中至少14、15、16、17、18、19、20、21、22、23、24、或25個,優選至少30、35、40、45、50、60、70、80、90、100、120、或140個,和特別優選至少160、180、200、250、300、350、400、450、500、600、700、800、900、或1000個連續核苷酸同源或相同的核苷酸序列:
(i)具有有義或反義方向的選自seqidno:26、27、28、29、30和/或31的序列或其功能性片段的核酸,或
(ii)與來自(i)的序列互補的核酸,或
(iii)與來自(i)的序列至少80%、82%、84%、86%、或88%,優選地至少90%、91%、92%、93%、94%、95%、96%,或特別優選地至少97%、97.5%、98%、98.5%、99%或99.5%相同的核酸,或
(iv)編碼具有選自seqidno:32、33和/或34的胺基酸序列的蛋白或所述蛋白的功能性部分的核酸,或
(v)編碼根據(iv)的蛋白的同源物、類似物或直系同源物或其部分的核酸,或
(vi)與來自(ii)的序列在嚴格條件下雜交的核酸。在轉錄後基因沉默中,例如在rnai方法和mirna方法中所描述的(fireetal.,1998),這樣的核酸可以用來抑制上文所述誘導抑制核酸的表達。dsrna編碼核酸也可以是編碼長非編碼rna(lncrna)的核酸。所述lncrna核酸然後優選包含核苷酸序列,其
(a)具有選自seqidno:35、36、37和/或38的序列或其片段,或
(b)與來自(a)的序列互補,或
(c)與來自(a)的序列至少80%、82%、84%、86%或88%,優選地至少90%、91%、92%、93%、94%、95%、96%,或特別優選地至少97%、97.5%、98%、98.5%、99%或99.5%相同,或
(d)編碼具有seqidno:40或41的胺基酸序列的多肽或所述多肽的部分,或
(e)與來自(b)的序列在嚴格條件下雜交。該lncrna,以下命名為lncrna1,可用於肌醇聚磷酸酯-5-磷酸酶例如肌醇-1,4,5-三磷酸-5-磷酸酶的表達或翻譯調控。此外,所述lncrna編碼核酸可以優選包含核苷酸序列,其
(w)具有選自seqidno:39的序列或其片段,或
(x)與來自(w)的序列互補,或
(y)與來自(w)的序列至少80%、82%、84%、86%或88%,優選地至少90%、91%、92%、93%、94%、95%、96%,或特別優選地至少97%、97.5%、98%、98.5%、99%或99.5%相同,或
(z)與來自(x)的序列在嚴格條件下雜交。該lncrna,以下命名為lncrna2,可用於磷脂酶尤其是磷脂酶a2或patatin磷脂酶的表達或翻譯調控。
在本發明另一特別優選的實施方案中,所述核酸,其在植物中轉錄後或翻譯後,適於介導單倍體誘導系的特性或適於增加單倍體誘導系的誘導能力,其可以是這樣的核酸,所述核酸編碼具有雙鏈部分的rna,其中所述雙鏈部分的至少一條鏈具有與以下核酸的內含子序列中至少14、15、16、17、18、19、20、21、22、23、24、或25個,優選至少30、35、40、45、50、60、70、80、90、100、120、或140個,和特別優選至少160、180、200、250、300、350、400、450、500、600、700、800、900、或1000個連續核苷酸同源或相同的核苷酸序列:
(i)具有有義或反義方向的選自seqidno:1、6、8、9、12、13、26、30、42、43、46、55、58和/或60的序列或其功能性片段的核酸,或
(ii)與來自(i)的序列互補的核酸,或
(iii)與來自(i)的序列至少80%、82%、84%、86%或88%,優選地至少90%、91%、92%、93%、94%、95%、96%,或特別優選地至少97%、97.5%、98%、98.5%、99%或99.5%相同的核酸,或
(iv)編碼具有選自seqidno:16、17、18、19、20、21、22、23、24、25、44、45、48、63、64和/或65,或選自seqidno:32、33和/或34的胺基酸序列的蛋白或所述蛋白的部分的核酸,或
(v)編碼根據(iv)的蛋白的同源物、類似物或直系同源物或其部分的核酸,或
(vi)與來自(ii)的序列在嚴格條件下雜交的核酸。在轉錄基因沉默中,例如在rddm方法中(shibuyaetal.,2009),這樣的核酸可以用來激活上文所述的誘導促進核酸的表達,或用於抑制上文所述的誘導抑制核酸的表達。dsrna編碼核酸也可以是編碼長非編碼rna(lncrna)的核酸。所述lncrna核酸然後優選包含核苷酸序列,其
(a)具有選自seqidno:35、36、37和/或38的序列或其片段,或
(b)與來自(a)的序列互補,或
(c)與來自(a)的序列至少80%、82%、84%、86%或88%,優選地至少90%、91%、92%、93%、94%、95%、96%,或特別優選地至少97%、97.5%、98%、98.5%、99%或99.5%相同,或
(d)編碼具有seqidno:40或41的胺基酸序列的多肽或所述多肽的部分,或
(e)與來自(b)的序列在嚴格條件下雜交。該lncrna,以下命名為lncrna1,可用於肌醇聚磷酸酯-5-磷酸酶例如肌醇-1,4,5-三磷酸-5-磷酸酶的表達或翻譯調控。此外,所述lncrna編碼核酸可以優選包含核苷酸序列,其
(w)具有選自seqidno:39的序列或其片段,或
(x)與來自(w)的序列互補,或
(y)與來自(w)的序列至少80%、82%、84%、86%或88%,優選地至少90%、91%、92%、93%、94%、95%、96%,或特別優選地至少97%、97.5%、98%、98.5%、99%或99.5%相同,或
(z)與來自(x)的序列在嚴格條件下雜交。該lncrna,以下命名為lncrna2,可用於磷脂酶尤其是磷脂酶a2或patatin磷脂酶的表達或翻譯調控。
表1:序列表和核苷酸以及胺基酸序列的編號。基因家族/蛋白家族的名字對應於公開模型。誘導系中結構的變化可以導致不同的功能。
在另一方面,本發明涉及包含本發明的核酸的載體。所述載體可以是質粒、粘粒、噬菌體或表達載體、轉化載體、穿梭載體或克隆載體;其可以是雙鏈的或單鏈的,線性的或環狀的;或其可以轉化原核或真核宿主,通過整合進其基因組或在染色體外。本發明的核酸在載體中優選與一或多個調控序列可操作體連接,所述調控序列允許在原核或真核宿主細胞中的轉錄以及任選地允許表達。調控序列(優選dna)可以與本發明的核酸同源或異源。例如,所述核酸在合適的啟動子或終止子的控制下。合適的啟動子可以是組成型誘導的啟動子(實例:35s啟動子,來自「花椰菜花葉病毒」(odelletal.,1985);那些組織特異性啟動子是特別合適的(實例:花粉特異性啟動子,chenetal.(2010),zhaoetal.(2006),或twelletal.(1991)),或者是發育特異性的(實例:開花特異性啟動子)。合適的啟動子也可以是合成的或嵌合的啟動子,其在自然界中不存在,包含多個元件,且含有最小啟動子,以及在所述最小啟動子上遊的至少一個順式調控元件,其是特定轉錄因子的結合位點。嵌合啟動子可以根據期望的特徵設計,且由不同的因子誘導或抑制。這樣的啟動子的實例在gurr&rushton(2005)或venter(2007)找到。例如,合適的終止子是nos終止子(depickeretal.,1982)。
除了上文描述的載體外,本發明還提供一種方法,包括將所描述的載體插入進宿主細胞。例如,可以通過接合、轉移(mobilization)、基因槍轉化、農桿菌介導的轉化、轉染、轉導、真空滲濾或電穿孔導入所述載體。這樣的方法,如同製備所描述的載體的方法,是本領域技術人員的常識(sambrooketal.,2001)。
在另一方面,本發明涉及包含本發明的核酸或本發明的載體的宿主細胞。本發明含義的宿主細胞可以是原核細胞(如細菌)或真核細胞(如植物細胞或酵母細胞)。所述宿主細胞優選是農桿菌,例如根癌農桿菌(agrobacteriumtumefaciens)或毛根農桿菌(agrobacteriumrhizogenes),或植物細胞,其包含本發明的所述核酸或本發明的載體。對於本領域技術人員,已知可以用來將本發明的核酸或本發明的載體導入農桿菌的許多方法(如接合或電穿孔),以及可以用來將本發明的核酸或本發明的載體導入植物細胞的方法如不同的轉化方法(基因槍轉化、農桿菌介導的轉化)(sambrooketal.,2001)。
在另一方面,本發明涉及轉基因植物細胞,其包含作為轉基因的本發明的核酸或包含本發明的載體,並涉及轉基因植物或其部分,其包含所述轉基因植物細胞。例如,這樣的植物細胞或植物是用本發明的核酸或本發明的載體轉化(優選穩定轉化)的植物細胞或植物。本發明的轉基因植物優選適於用作單倍體誘導系。在所述轉基因植物的一優選實施方案中,所述核酸與一或多個調控序列可操作地連接,所述調控序列允許在植物細胞中的轉錄以及任選的表達。調控序列(優選dna)可以與本發明的核酸同源或異源。由本發明的核酸和所述調控序列組成的總結構然後可以代表所述轉基因。植物的部分可以是受精的或未受精的種子、胚、花粉、組織、器官、或植物細胞,其中所述受精的或未受精的種子、所述胚或所述花粉在所述轉基因植物中產生,且本發明的核酸或所述載體整合進其基因組作為轉基因。本發明而且還包括所述轉基因植物的後代,其基因組中整合有本發明的核酸或載體作為轉基因,且其適於用作單倍體誘導系。
在另一方面,本發明涉及由本發明的核酸編碼的蛋白或多肽。所述蛋白或多肽優選適於在植物中介導單倍體誘導系的特性,或適於增加單倍體誘導系的誘導能力。特別優選由所述誘導促進核酸編碼的蛋白或多肽。本發明的蛋白或多肽優選包括選自seqidno:16、17、18、19、20、21、22、23、24、25、44、45、48、51、54、63、64和/或65,或選自seqidno:32、33和/或34,或選自seqidno:40和/或41的胺基酸序列。
在另一方面,本發明描述了用於產生適於用作單倍體誘導系的植物的方法。所述方法可以包括以下步驟:
a)誘變植物細胞且隨後從所誘變的植物細胞再生植物或誘變植物,和
b)鑑定植物a),其在與本發明的核酸相同的內源dna序列中具有至少一個突變,或在所述內源dna序列的調控序列(例如,啟動子、增強子、終止子或內含子)中具有至少一個突變,所述突變導致與未誘變野生型植物相比所述內源dna序列在所鑑定的植物中轉錄或表達率的變化,或者與未誘變野生型植物相比所述內源dna序列編碼的蛋白或多肽在所鑑定的植物中活性或穩定性的變化,其中所述至少一個突變在所鑑定的植物中導致要介導的單倍體誘導系特性或要增加的單倍體誘導系的誘導能力。所述轉錄率或表達率的改變,或所述活性或穩定性的改變,優選地至少在所鑑定的植物的花粉中或在所鑑定的植物的花粉組織中出現。
來自步驟b)的內源dna序列,或所述內源dna序列編碼的rna或所述dna序列編碼的蛋白或多肽,優選對植物中花粉管的生長、植物花粉的能量代謝和/或優選在生殖細胞(例如其發育成花粉)中的著絲粒的活性有影響。
所述用於產生適於用作單倍體誘導系的植物的方法的步驟b)的內源dna序列特別優選地編碼snarev蛋白;磷脂酶類的酶,特別是磷脂酶a2或patatin磷脂酶;肌醇聚磷酸酯-5-磷酸酶類的酶,例如肌醇-1,4,5-三磷酸-5-磷酸酶;磷酸甘油酸變位酶或甲基轉移酶,特別是rna甲基轉移酶,其中,在snare蛋白、磷脂酶和甲基轉移酶的情況下,所述轉錄率或表達率或所述活性或穩定性優選被改變至其被增加的程度,且其中,在肌醇聚磷酸酯-5-磷酸酶和磷酸甘油酸變位酶的情況下,所述轉錄率或表達率或所述活性或穩定性優選被改變至其被減少的程度。
所述用於產生植物的方法的步驟b)非常優選地是從a)鑑定植物,其a)在與所述誘導促進核酸或編碼所述lncrna1的核酸相同的內源dna序列中,或在所述內源dna序列的調控序列(如啟動子、增強子、終止子或內含子)中具有至少一個突變,其中所述至少一個突變實現所述內源dna序列的轉錄或表達的增加或由所述內源dna序列編碼的蛋白或多肽的活性或穩定性的增加;和/或b)在與所述誘導抑制核酸或編碼所述lncrna2的核酸相同的內源dna序列中,或在所述內源dna序列的調控序列(如啟動子、增強子、終止子或內含子)中具有至少一個突變,其中所述至少一個突變實現所述內源dna序列的轉錄或表達的減少或由所述內源dna序列編碼的蛋白或多肽的活性或穩定性的減少,其中a)和b)的所述至少一個突變在所鑑定的植物中導致要介導的單倍體誘導系特性或要增加的單倍體誘導系的誘導能力。所述轉錄率或表達率的改變,或所述活性或穩定性的改變,優選地至少在所鑑定的植物的花粉中或在所鑑定的植物的花粉組織中出現。
突變意指dna水平的修飾,並因此是遺傳學和/或表觀遺傳學的改變。例如,遺傳學的改變可以是所述內源dna序列中或所述內源dna序列的調控序列中的至少一個核鹼基的取代。如果這樣的核鹼基取代發生在例如啟動子,這可以導致所述啟動子改變的活性,由於例如順式調控元件被修飾使得轉錄因子與所突變的順式調控元件的親和力相對於野生型啟動子被改變,由此具有所述突變的順數調控元件的啟動子的活性被增加或減少,取決於所述轉錄因子是阻遏物還是誘導系,或所述轉錄因子與突變的順式調控元件的親和力被增強還是被弱化。如果這樣的核鹼基取代在例如所述內源dna序列的編碼區中發生,這可以導致所編碼的蛋白的胺基酸取代,其可以導致所述蛋白的活性或穩定性與野生型蛋白相比的改變。遺傳學改變的另一實例是在所述調控序列和/或所述內源dna序列中的核苷酸缺失,以及在所述調控序列和/或所述內源dna序列中的核苷酸添加。das&martienssen(1995)顯示在玉米中通過轉座子誘變插入核苷酸來調控基因。表觀遺傳學的改變可以通過dna改變的甲基化模式發生。
本領域技術人員已知本發明含義中的突變如何能夠通過用於產生適於用作單倍體誘導系的植物的方法的步驟a)的誘變過程實現。此處所述誘變包括常規誘變以及位點特異性誘變或「基因組編輯」。在常規誘變中,dna水平的修飾不是以靶向的方式產生的。植物細胞或植物被暴露至誘變條件如tilling,通過uv光暴露或使用化學物質(tilletal.,2004)。隨機誘變的另外的方法是藉助轉座子的誘變。uniformmu項目製備了大容量的可免費獲得的突變體庫。所述庫和所述方法描述於mccartyetal.(2005)。位點特異性誘變使得可以以靶導向方式在dna中的預定位置在dna水平導入修飾。例如,talens(wo2010/079430,wo2011/072246),大範圍核酸酶(silvaetal.,2011),歸巢內切核酸酶(chevalier2002),鋅指核酸酶(lloydetal.,2005),或crispr/cas系統(gajetal.,2013)可用於此。
步驟b)中植物的鑑定可以藉助例如分子標記或探針實現。例如,dna探針是引物或引物對,其可用於pcr反應。例如,可以通過在tilling群體中測序靶基因來驗證或鑑別tilling突變體,或者通過額外的驗證dna中錯配的方法,例如熔解點分析或使用錯配特異性核酸酶。對此,本發明同樣包括可用於此的引物/引物對,例如,針對磷脂酶、磷酸甘油酸變位酶、甲基轉移酶和磷脂酶的lncrna的引物。通過轉座子產生的突變體也可以通過在跨越整個群體的pcr中使用轉座子特異性引物和靶基因特異性引物以及隨後測序pcr產物來驗證。本發明也涵蓋這樣的引物。例如,花粉中表達率的改變可以用rt-pcr確定;穩定性的改變例如可以通過檢查泛素結合位點和預測三級結構的變化來確定。此外,野生型蛋白以及相應突變體蛋白的重組表達,和隨後的生化活性測試也是合適的。可用於在步驟b)中鑑別植物的額外的手段和方法是本領域技術人員從現有技術中知曉的。
本發明還涉及分子標記,其證明所述內源dna序列中或所述內源dna序列的內源dna序列的突變的存在或不存在。例如,這樣的標記基於snp且對所述突變特異(實例:kaspar或taqman標記)。
本發明還進一步涉及能夠用前述方法產生或用前述方法產生的植物,或該植物的部分,其中所述植物的部分可以是受精的或未受精的種子、胚、花粉、組織、器官或植物細胞,其中所述受精的或未受精的種子、所述胚、或所述花粉在所述轉基因植物產生,且所述至少一個突變存在於其基因組中。本發明同樣還包括所述植物的後代,其具有所述至少一個突變且適於用作單倍體誘導系。已經用前述方法產生的植物的兩個實例是這樣的植物(優選玉米或向日葵),其在內源dna序列中,具有以下核酸,所述核酸(i)具有選自seqidno:8、9和/或46的序列或其功能性片段;或(ii)與來自(i)的序列互補;或(iii)與來自(i)的序列至少80%相同;或(iv)編碼具有選自seqidno:21、22、23和/或48的胺基酸序列的蛋白或所述蛋白的功能性部分;或(v)編碼根據(iv)的蛋白的同源物、類似物或直系同源物或其功能性部分;或(vi)與來自(ii)的序列在嚴格條件下雜交,或在所述內源dna序列的調控序列中具有至少一個突變,所述突變導致與未誘變野生型植物相比所述內源dna序列在所鑑定的植物中轉錄或表達率的變化,或者與未誘變野生型植物相比所述內源dna序列編碼的蛋白或多肽在所鑑定的植物中活性或穩定性的變化,其中所述至少一個突變在所鑑定的植物中導致要介導的單倍體誘導系特性或要增加的單倍體誘導系的誘導能力。所述突變優選地是在seqidno:8或9的編碼序列中的改變(例如,點突變),其導致seqidno:21、22或23中胺基酸位置74-78之間的胺基酸取代,或者所述突變導致seqidno.46的編碼序列中的修飾,其導致seqidno:48相應編碼序列中的胺基酸取代。這可以包含根據seqidno:49-54的突變。通過tilling導致的在seqidno:49中的突變導致第74位處的編碼的胺基酸的胺基酸取代,其中天冬氨酸被天冬醯胺取代(d74n);在seqidno:52中的突變導致第78位處的編碼的胺基酸的胺基酸取代,其中甘氨酸被精氨酸取代(g74r)。
此外,本發明還涉及一種分離在植物中介導單倍體誘導系特性或增加單倍體誘導系的誘導能力的核酸的方法,包括以下步驟:
a)根據本發明前面描述的方法產生植物,或提供能夠用本發明前述的方法產生或用本發明前述的方法產生的植物;和b)從來自a)的植物的基因組分離核酸,所述核酸包含具有所述至少一個突變的內源dna序列。步驟b)中所述核酸非分離可以通過ctab提取或通過dna結合柱實現;所述突變的驗證可以通過測序或分子標記如基於snp的kaspar或taqman標記實現,或例如對於插入或缺失突變,通過基於長度多態性的標記實現。
本發明還包括通過或可通過前述用於分離的方法獲得的核酸,以及包含所述分離的核酸的載體。
在另一方面,本發明還涉及用於產生適於用作單倍體誘導系的轉基因植物的方法。所述方法可以包括以下步驟:
a)提供上文所述的核酸,其在植物中轉錄或表達後,適於介導單倍體誘導系的特性或適於增加單倍體誘導系的誘導能力;或提供上文所述的分離的核酸,所述核酸包含所述具有至少一個突變的內源dna序列;或提供上述的載體之一,
b)通過導入來自a)的核酸或載體轉化(優選穩定轉化)植物細胞,
c)從來自b)的經轉化的植物細胞再生轉基因植物,和
d)通過優選地在所鑑定的植物的花粉中的或在所鑑定的植物的花粉組織中的改變的表達模式從c)鑑定轉基因植物,其中單倍體誘導系的特性被介導或單倍體誘導系的誘導能力被增加。所述用於產生適於用作單倍體誘導系的轉基因植物的方法還包括提供兩種或更多種上文所述的核酸(或者本發明的核酸的不同的實施方式,且任選地在一或多種載體中)以及通過導入兩種或多種核酸轉化植物細胞。可選地或額外地,除了本發明的核酸之外,還可以提供或轉化已知可用於產生單倍體誘導系的一或多種額外的核酸(例如,經操作的cenh3基因(ravi&chan,2010))。
所述表達模式優選地被改變為實現
(i)與野生型植物(例如其再生自等基因未轉化的植物細胞)相比,所鑑定的植物中所導入的誘導促進核酸或導入的編碼lncrna1的核酸的轉錄或表達被增加,和/或
(ii)與野生型植物(例如其再生自等基因未轉化的植物細胞)相比,所鑑定的植物中所導入的誘導抑制核酸或導入的編碼lncrna2的核酸的轉錄或表達被減少,和/或
(iii)由於轉錄後基因沉默,與野生型植物(其例如再生自等基因、未轉化的植物細胞)相比,在所鑑定的植物中具有與所述誘導抑制核酸相同的核苷酸序列的內源dna序列的表達率通過所導入的核酸編碼的雙鏈rna被降低,所述導入的核酸如上所述與轉錄後基因沉默相關,和/或
(iv)由於轉錄基因沉默,與野生型植物(其例如再生自等基因、未轉化的植物細胞)相比,在所鑑定的植物中具有與所述誘導促進核酸相同的核苷酸序列的內源dna序列或編碼lncrna1的導入的核酸的轉錄率或表達率通過所導入的核酸(其在上文詳細描述,與轉錄基因沉默相關)編碼的雙鏈rna被增加;和/或與野生型植物(其例如再生自等基因、未轉化的植物細胞)相比,在所鑑定的植物中具有與所述誘導抑制核酸相同的核苷酸序列的內源dna序列或編碼lncrna2的導入的核酸的轉錄率或表達率通過所導入的核酸(其在上文詳細描述,與轉錄基因沉默相關)編碼的雙鏈rna被減少。轉錄率的驗證可以通過例如qrt-pcr實現。改變的蛋白穩定性可以通過例如蛋白印跡確定。
本發明還進一步涉及能夠用前述方法產生或用前述方法產生的轉基因植物或該植物的部分,植物的部分可以是受精的或未受精的種子、胚、花粉、組織、器官、或植物細胞,其中所述受精的或未受精的種子、所述胚或所述花粉在所述轉基因植物中產生,且本發明的核酸或所述載體整合進其基因組作為轉基因。本發明同樣還包括所述轉基因植物的後代,其具所導入的核酸作為轉基因且適於用作單倍體誘導系。
在另一方面,本發明還涉及用於產生單倍體植物的方法,所述方法包括以下步驟:
a)是本發明的適於用作單倍體誘導系的非轉基因或轉基因植物和相同屬優選相同物種的植物雜交,
b)選擇受精的單倍體種子或胚,和
c)從來自b)的種子或胚再生單倍體植物。
所述適於用作單倍體誘導系的植物優選用作花粉親本且和相同屬優選相同物種的種子親本雜交。所述適於用作單倍體誘導系的植物還可以用作種子親本且和相同屬優選相同物種的花粉親本雜交。因此,步驟a)中的兩種雜交配對物,種子親本和花粉親本,還可以是相同個體。所述雜交步驟因此代表自交。
所述單倍體受精的種子或胚的選擇可以包括驗證所述單倍性的步驟,已將所述單倍體受精的種子或胚與多倍體受精的種子或胚分開的步驟。所述受精的種子或胚的單倍性鑑定可以通過表型或基因型實現,例如,其中所述誘導系具有胚特異性可視標記,其在全部二倍體後代是可見的,但在誘導的單倍體後代則不可見。此外,所述倍性狀態可以通過流式細胞術確定。此外,分子標記完整的、純合的模式為單倍體植物提供指徵。例如,所述分開可以自動地基於單倍性驗證的數據實現。
本發明還進一步涉及單倍體受精的種子或胚,其通過所述用於產生單倍體植物的方法的步驟a)中的雜交而產生,以及涉及單倍體植物,其能夠用所述方法或用所述方法產生,或者該植物的部分,其中植物的部分可以是種子、胚、組織、器官或植物細胞。本發明同樣還包括所述植物的後代。此外,本發明還包括雙-單倍體(二倍體)植物或其部分,其中所述雙-單倍體(二倍體)植物或其部分通過所述單倍體植物或其部分的染色體加倍而產生。
在另一方面,本發明涉及本發明的核酸或本發明載體在植物中介導單倍體誘導系或增加單倍體誘導的誘導能力的用途,或本發明的核酸或本發明的載體用於產生適於用作單倍體誘導系的植物或轉基因植物的用途。此外,本發明還包括本發明如上所述的植物的用途,其適於用作單倍體誘導系以產生單倍體的受精的種子或胚,或單倍體植物。之前針對本發明的主題和方法的解釋也適用於所記載的用途。
在另一方面,本發明還涉及用於外部施加至植物的工具。提供該工具用於外部施加至植物且適於在植物中介導單倍體誘導系的特性,或適於增加單倍體誘導系植物的誘導能力。施加優選在花葯形成、花粉形成或受精的時間點進行。所述工具包含具有雙鏈部分的rna,其中所述雙鏈部分的至少一條鏈具有與以下核酸的編碼序列中至少14、15、16、17、18、19、20、21、22、23、24或25個,優選至少30、35、40、45、50、60、70、80、90、100、120或140個,和特別優選至少160、180、200、250、300、350、400、450、500、600、700、800、900或1000個連續核苷酸同源或相同的核苷酸序列
(i)具有有義或反義方向的選自seqidno:26、27、28、29、30和/或31的序列或其功能性片段的核酸,或
(ii)與來自(i)的序列互補的核酸,或
(iii)與來自(i)的序列至少80%、82%、84%、86%或88%,優選地至少90%、91%、92%、93%、94%、95%、96%,或特別優選地至少97%、97.5%、98%、98.5%、99%或99.5%相同的核酸,或
(iv)編碼具有選自seqidno:32、33和/或34的胺基酸序列的蛋白或所述蛋白的功能性部分的核酸,或
(v)編碼根據(iv)的蛋白的同源物、類似物或直系同源物或其部分的核酸,或
(vi)與來自(ii)的序列在嚴格條件下雜交的核酸。
用於產生本發明所述工具的雙鏈rna可以通過本領域技術人員已知的方法體外產生。例如,可以合成產生所述雙鏈rna,其中所述rna直接在體外形成。從雙鏈dna出發,所述雙鏈rna還可以通過例如形成mrna轉錄物而合成,所述mrna轉錄物然後形成髮夾結構。所述工具可以用作在植物中誘導單倍體的觸發器。例如,在植物開花前或後,可以通過以噴霧形式噴灑至植物組織上,或者通過本領域技術人員公知的額外的方式外部施加至植物組織上,或者通過噴霧或和其它添加劑混合來使用所述工具。例如,添加劑可以是潤溼劑、載體物質或rna穩定劑如脂質體。
令人驚奇地,本發明人已經發現,影響花粉管生長、花粉的能量代謝和/或著絲粒活性(優選在發育成例如花粉的生殖細胞中)的基因或基因產物特別適於將非單倍體誘導系轉換成單倍體誘導系。為此,可以鑑定具有顯著重要性的多個基因家族/蛋白家族。它們用於產生單倍體誘導系的用途之前在現有技術中沒有被描述或提示。因為花粉的產生以及受精過程(包括花粉管的生長)在單子葉和雙子葉植物中遵循廣泛適用的原理,在本發明的技術教導下,即使是對之前既不存在高效體內單倍體誘導系統或其它用於產生雙-單倍體植物的基於細胞培養的方法的栽培植物,本領域技術人員也接受開發單倍體誘導系的可能性。為此,使用他從本發明獲得的遺傳學信息,他可以通過常規勞動發現所描述的基因產物的同源物、直系同源物或類似物,並如本文所描述對它們進行操作。然而,本發明的技術教導也適於進一步針對它們的效率(即單倍體誘導率)改良已經存在的誘導系,並因此第一次使得它們能夠經濟地應用。此外,本領域技術人員將該技術教導和單倍體誘導的其它已知機制組合,例如操作cenh3蛋白(ravi&chan,2010),並因此進一步增加效率。
本申請中使用的一些術語在下文詳細解釋:
「b73」是玉米育種系,其在玉米遺傳學中用作模型基因型且被用於產生第一個玉米參考序列。
「介導單倍體誘導系的特性」或「單倍體誘導系的特性的介導」或相當的短語意指,通過使用本發明的核酸,使得植物在與來自相同屬(優選相同物種)的不具有單倍體誘導系特性的植物雜交時能夠產生具有單一染色體組(單倍體)的受精的種子或胚。單倍體誘導系的特性,定義為絕對單倍體誘導率,指的是至少0.1%、0.2%、0.3%、0.4%、0.5%、0.6%、0.7%、0.8%、0.9%、或1%,優選至少1.5%、2%、2.5%、3%、3.5%、4%、4.5%、或5%,或特別優選6%、7%、8%、9%、10%、11%、12%、13%、14%、或15%,或非常特別優選,至少20%、25%、30%、35%、40%、45%、或50%的所述受精的種子或胚具有單倍體染色體組。
「表達率的增加」或「增加的表達率」或「表達的激活」或相當的表述意指核苷酸序列的表達率與指定的參照相比增加超過10%、15%、20%、25%、或30%,優選增加超過40%、50%、60%、70%、80%、90%、或100%,或特別優選增加超過150%、200%、250%、300%、500%、或1000%。所述表達率的增加優選地導致其中表達率被增加的植物的表型變化。改變的表型可以是介導單倍體誘導系的特性,或單倍體誘導系誘導能力的增加。
「轉錄率的增加」或「增加的轉錄率」或相當的表述意指核苷酸序列的轉錄率與指定的參照相比增加超過10%、15%、20%、25%、或30%,優選增加超過40%、50%、60%、70%、80%、90%、或100%,或特別優選增加超過150%、200%、250%、300%、500%、或1000%。所述轉錄率的增加優選地導致其中轉錄率被增加的植物的表型變化。改變的表型可以是介導單倍體誘導系的特性,或單倍體誘導系誘導能力的增加。
核苷酸序列的「功能性」片段意指核苷酸序列的節段,其具有與所述功能性片段所源自的完整核苷酸序列相同或相當的功能。因此,所述功能性片段可以具有與所述完整核苷酸序列在至少50%、55%、60%、65%、70%、75%、80%、85%、90%、92%、94%96%、97%、98%、或99%的長度上相同或同源的核苷酸序列。此外,核苷酸序列的「功能性片段」還可以指核苷酸序列的節段,其改變總核苷酸序列的功能,例如在轉錄後或轉錄基因沉默中。因此,核苷酸序列的功能性片段可以包括完整核苷酸序列的至少14、15、16、17、18、19、20、21、22、23、24、或25個,優選至少30、35、40、45、50、60、70、80、90、100、120、或140個,或特別優選至少160、180、200、250、300、350、400、450、500、600、700、800、900、或1000個連續的核苷酸。
蛋白的「功能性部分」意指蛋白的節段,或編碼所述蛋白的胺基酸序列的區段,其中所述節段可以在植物細胞中執行與完整蛋白相同或相當的功能。蛋白的功能性部分在至少50%、55%、60%、65%、70%、75%、80%、85%、90%、92%、94%、96%、97%、98%、或99%的長度上具有與所述功能性部分所源自的蛋白相同或類似(在保守性和半保守性胺基酸取代下)的胺基酸序列。
「單倍體誘導系」也意指體內單倍體誘導系。
術語「異源(heterolog)」意指所導入的多核苷酸源自例如相同物種中具有不同遺傳背景的細胞或器官或另一物種,或者對原核或真核宿主細胞是同源的但是然後位於不同的遺傳環境且因此不同於可能的、天然存在的相應多核苷酸。除了相應的內源基因外,可以存在異源的多核苷酸。
在本發明含義中,「同源物」應理解為具有相同系統發生起源的蛋白,「類似物」應理解為執行相同功能但具有不同系統發生起源的蛋白,而「直系同源物」是來自不同物種執行相同功能的蛋白。
「雜交(hybridizing)」或「雜交(hybridization)」應理解為這樣的過程,其中單鏈核酸分子被添加至以最大可能程度互補的核酸鏈,即形成鹼基配對。用於雜交的標準方法例如描述於sambrooketal.2001。這應優選地被理解為所述核酸分子至少60%,更優選地,至少65%、70%、75%、80%、或85%,或特別優選地,90%、91%、92%、93%、94%、95%、96%、97%、98%、或99%的鹼基和以最大可能程度互補的核酸鏈形成鹼基配對。這樣的添加的可能性取決於雜交條件的嚴格性。術語「嚴格性」涉及雜交條件。當鹼基配對更難實現時存在高嚴格性;如果鹼基配對較容易實現則存在低嚴格性。例如,雜交條件的嚴格性取決於鹽濃度或離子強度以及溫度。一般而言,可以通過提高溫度和/或降低鹽含量來增加嚴格性。「嚴格雜交條件」應理解為其中雜交主要僅僅在同源核酸分子中發生的那些條件。術語「雜交條件」因此不僅僅涉及所述核酸實際添加時的條件,還涉及後續洗滌步驟的條件。嚴格雜交條件例如是這樣的條件,在所述條件下主要僅僅那些具有至少70%,優選至少75%、至少80%、至少85%、至少90%或至少95%序列相同性的核酸分子雜交。嚴格雜交條件例如是在4xssc中在65℃雜交,以及隨後在0.1xssc中65℃重複洗滌共大約1小時。本文所用術語「嚴格雜交條件」還可以指在68℃在0.25m磷酸鈉、ph7.2、7%sds、1mmedta和1%bsa中雜交16小時,以及隨後用2xssc和0.1%sds在68℃洗滌兩次。雜交優選在嚴格條件下發生。
「增加單倍體誘導系的誘導能力」或「單倍體誘導系的誘導能力的增加」意指具有單倍體誘導系的特性的植物的單倍體誘導率被增加。具有單倍體染色體組且獲得自所述單倍體誘導系和不具有單倍體誘導系特性的相同屬(優選相同物種)植物的雜交的受精種子的數目因此可以比不使用本發明的核酸獲得的單倍體受精種子的數目高至少0.1%、0.2%、0.3%、0.4%、0.5%、0.6%、0.7%、0.8%、0.9%、或1%,優選至少1.5%、2%、2.5%、3%、3.5%、4%、4.5%、或5%,以及特別優選至少6%、7%、8%、9%、10%、15%、20%、30%、或50%,即,單倍體誘導率可以相對於之前實現的單倍體誘導率增加至少0.1%、0.2%、0.3%、0.4%、0.5%、0.6%、0.7%、0.8%、0.9%、或1%,優選至少1.5%、2%、2.5%、3%、3.5%、4%、4.5%、或5%,以及特別優選至少6%、7%、8%、9%、10%、15%、20%、30%、或50%。
「可操作地連接」意指在同一核酸分子中連接,使得所連接的元件以這樣的方式相互定位和朝向使得核酸分子的轉錄可以發生。與啟動子可操作地連接的dna在該啟動子的轉錄控制下。
植物「器官」指的是例如葉、枝條(shoot)、莖、根、營養芽、分生組織、胚、花葯、胚珠或果實。植物「部分」指的是多個器官的組合,例如花或種子,或器官的部分,例如來自枝條的橫切。植物「組織」例如是愈傷組織、貯藏組織、分生組織、葉組織、莖組織、根組織、植物瘤組織或繁殖組織。例如,植物「細胞」應該理解為,例如具有細胞壁的分離的細胞或其聚集物,或原生質體。
在本發明含義中,如果不是另外指明,「植物」可以是來自雙子葉植物、單子葉植物和裸子植物的任何物種。這些中的許多是,例如,大麥(hordeumvulgare)、雙色高粱(sorghumbicolor)、黑麥(secalecereale)、黑小麥(triticale)、甘蔗(saccharumofficinarium)、玉米(zeamays)、狗尾草(setariaitalic)、水稻(oryzasativa)、小粒野生稻(oryzaminuta)、澳洲野生稻(oryzaaustraliensis)、高稈野生稻(oryzaalta)、小麥(triticumaestivum)、硬粒小麥(triticumdurum)、球莖大麥(hordeumbulbosum)、短柄草(brachypodiumdistachyon)、海濱大麥(hordeummarinum)、節節麥(aegilopstauschii)、甜菜(betavulgaris)、葵花(helianthusannuus)、daucusglochidiatus、daucuspusillus、daucusmuricatus、胡蘿蔔(daucuscarota)、巨桉(eucalyptusgrandis)、erythrantheguttata、genliseaaurea、棉屬物種(gossypiumsp.)、芭蕉屬物種(musasp.)、燕麥屬物種(avenasp.)、林菸草(nicotianasylvestris)、普通菸草(nicotianatabacum)、絨毛狀菸草(nicotianatomentosiformis)、番茄(solanumlycopersicum)、馬鈴薯(solanumtuberosum)、中果咖啡(coffeacanephora)、葡萄(vitisvinifera)、黃瓜(cucumissativus)、桑樹(morusnotabilis)、擬南芥(arabidopsisthaliana)、琴葉擬南芥(arabidopsislyrata)、arabidopsisarenosa、須彌芥(crucihimalayahimalaica)、卵葉須彌芥(crucihimalayawallichii)、彎曲碎米薺(cardamineflexuosa)、北美獨行菜(lepidiumvirginicum)、薺菜(capsellabursa-pastoris)、小擬南芥(olmarabidopsispumila)、硬毛南芥(arabishirsuta)、歐洲油菜(brassicanapus)、甘藍(brassicaoleracea)、芫菁(brassicarapa)、芸苔(brassicajuncacea)、brassicanigra、蘿蔔(raphanussativus)、erucavesicariasativa、甜橙(citrussinensis)、麻風樹(jatrophacurcas)、大豆(glycinemax)、和毛果楊(populustrichocarpa)。根據本發明的植物優選是玉蜀黍屬(zea)的植物,特別是物種玉米(zeamays),或者是高粱。
「減少表達率」或「表達率的減少」或「表達的抑制」或「減少的表達率」或相當的短語意指核苷酸序列的表達率與指定的參照相比減少超過10%、15%、20%、25%、或30%,優選增加超過40%、50%、45%、50%、55%、60%、或65%,或特別優選增加超過70%、75%、80%、85%、90%、92%、94%、96%、或98%。然而,其還可以指核苷酸的表達率被減少100%。所述表達率的減少優選地導致其中表達率被減少的植物的表型變化。改變的表型可以是介導單倍體誘導系的特性,或單倍體誘導系誘導能力的增加。
「轉錄率的減少」或「減少的轉錄率」或相當的表述意指核苷酸序列的轉錄率與指定的參照相比減少超過10%、15%、20%、25%、或30%,優選增加超過40%、50%、45%、50%、55%、60%、或65%,且特別優選增加超過70%、75%、80%、85%、90%、92%、94%、96%、或98%。然而,其還可以指核苷酸的轉錄率被減少100%。所述轉錄率的減少優選地導致其中轉錄率被減少的植物的表型變化。改變的表型可以是介導單倍體誘導系的特性,或單倍體誘導系誘導能力的增加。
與本發明相關,術語「調控序列」涉及影響表達特異性和/或強度的核苷酸序列,例如其中所述調控序列介導明確的組織特異性。這樣的調控序列可以位於最小啟動子的轉錄起始點上遊,但也可以在其下遊,例如在轉錄但不翻譯的前導序列中或在內含子中。
「適於用作單倍體誘導系」意指與相同屬(優選相同物種)的不具有單倍體誘導系特性的植物雜交時,植物能夠產生具有單一染色體組(單倍體)的受精的種子。單倍體誘導系的特性,定義為絕對單倍體誘導率,指的是至少0.1%、0.2%、0.3%、0.4%、0.5%、0.6%、0.7%、0.8%、0.9%、或1%,優選至少1.5%、2%、2.5%、3%、3.5%、4%、4.5%、或5%,或特別優選6%、7%、8%、9%、10%、11%、12%、13%、14%、或15%,或非常特別優選,至少20%、25%、30%、35%、40%、45%、或50%的所述受精的種子具有單倍體染色體組。
本發明的設計和實施方案通過示例的方式,針對附圖和序列進行描述。
圖1:與b73(agpv02)相比所鑑定的基因的基因組排列:
snarev1(grmzm2g179789):在rws花粉中增加的表達;
snarev2(grmzm2g412426):在rws花粉中增加的表達;
itp(肌醇-1,4,5-三磷酸-5-磷酸酶)(grmzm2g106834):在rws花粉中減少的表達;
pl(patatin磷脂酶)(grmzm2g471240):編碼序列中的多態性;
mito1(線粒體輸入受體):僅僅存在於rws中;
mito2:與mito1同源,但縮短。僅僅存在於rws;
pgm(磷酸甘油酸變位酶)(grmzm2g062320):在rws中缺失;
lncrna:pl的同源物:在rws中缺失;
ac213048:用於序列比較的錨定基因;
mt(rna甲基轉移酶)(grmzm2g347808):在調控區域中的多態性。
grmzm名字涉及agpv02中的注釋。
圖2:誘導系rws和三個非誘導系對照(ni1、ni2、ni3)中基因snarev1、rna甲基轉移酶和patatin磷脂酶的rt-pcr。
圖3:rws花粉的rnaseq數據,投射到來自agpv02的人工參照,其中snare和磷脂酶基因座被rwsbac的基因座取代。(t1:轉錄物1,snare2的同源物,但具有改變的內含子結構。t2:snare1的同源物。編碼133aa的蛋白;t3:snare1/2的同源物。來自圖2的rt-pcr片段)。
qtl分析和候選基因的鑑定:
在玉米單倍體誘導系rws(其被歸因於誘導系stock6(coe,1959))中,在染色體1(bin1.04)鑑定到主要-qtl並精細定位。基於這些工作,rws中的該qtl應當進行驗證以及分子分析以鑑別和功能性驗證其中的基因。測試來自rwsx對照1(母體誘導系x非誘導系)的qtl定位群體的誘導能力。由此可以顯示已知的qtl很可能也存在於誘導系rws。然而,也有可能發現強的等位基因遷移取代非-rws(對照1)等位基因。
為了分子描述所述基因座,選擇了在dna和rna水平的多種測序方法。由於誘導系和參照基因組b73的結構差異,僅僅小比例的經典的、基於參照的測序方法獲得成功。廣泛的和複雜的生物信息學分析顯示結構差異將然後需要通過其它技術進行檢查(圖1)。
在序列捕獲方法中,在三個stock6衍生的誘導系,以及rws和5個非誘導系對照中所鑑定的qtl周圍的3兆鹼基被測序,且分析誘導系特異性多態性例如存在-不存在變異、snp和indel。最初,由此鑑定到16個候選基因,其中3個基因通過測序和分析表達數據確認:一個基因編碼花葯特異性patatin磷脂酶a2,其具有rws誘導系-特異性單倍型;磷酸甘油酸變位酶,其不存在於誘導系rws;以及rna甲基轉移酶基因,其在調控序列具有突變(圖2)。
針對rws、emk(衍生自stock6的另外的誘導系)和對照1開發了bac文庫,並用沿著所鑑定的qtl分布的探針篩選。針對大約150kb的靶範圍(其由dongetal.2013提及為在誘導系uh400中很可能是誘導系相關的),提取rws、對照1和emk的bac並測序。對bac序列進行注釋並與針對rws、對照1、emk和b73創建的全面的轉錄組數據進行比較。
結果,此處確認了誘導系中的缺失。因此,所檢查的母體誘導系缺少在染色體1上68.26至68.36mb之間(b73參照序列的agpversion2)的100kb的區域。此外,在誘導系的靶區域之外出現基因類似區域的倒位和無法和b73參照基因組以及對照1比較的大的重複性序列節段。
儘管有缺失,已經鑑定的磷脂酶仍然存在於所述誘導系中,但顯示強烈不同於對照的前述單倍型,以及在啟動子區域的顯著遺傳變異。由於所述缺失,上文已經鑑定的磷酸甘油酸變位酶不再存在。
此外,在所述100kb缺失中,也鑑定到非編碼rna(lncrna)。如所述磷酯酶一樣,其是花粉特異性表達,且顯示與所鑑定的磷酯酶具有82%的同源性。所述序列自身互補,即所述lncrna形成髮夾結構。非常高的表達率、與所述磷酯酶顯著的同源性以及通過sanger測序確定的低snp密度表明該lncrna對所述磷酯酶的調控功能。理論上,從該轉錄物也可以翻譯出88個胺基酸長的所述磷酯酶蛋白的截短版本。
為了能夠測量來自該區域的所鑑定的基因的表達水平差異,除了在dna水平測量多態性,也實施了rt-pcr和rnaseq實驗。除了作為誘導系的rwp(rws的子系),使用了三種遺傳學上非常不同的對照品系。從這些植物收穫花粉、沒有花粉的花葯和通過自交或雜交授粉後6-7天的胚。所述磷酯酶此處顯示在來自rwp的花粉中輕微的表達增加。所述甲基轉移酶在rwp的花粉中顯示弱表達,而在對照的花粉中無表達。lncrna花粉特異性表達,其也如期望的,在rwp中不存在。
另外對相同材料的花粉進行rnaseq以進一步驗證前述結果。
將轉錄組數據(rws的花粉rna的rna-seq)投射到人工參照上,其中b73中所述qtl的區域被rws-bac置換。該分析顯示所述磷酯酶在花粉中的表達。該基因的外顯子-內含子結構對應於b73中的結構,但在5』端存在缺失,其導致終止密碼子並因此導致縮短的蛋白。此外,在所述磷酯酶上遊和下遊檢測到三種額外的rws-特異性轉錄物。具有兩種轉錄物的區域位於所述磷酯酶上遊大約60kb。第一種轉錄物是非編碼的;第二種轉錄物編碼192個胺基酸長的蛋白,其顯示與線粒體輸入受體(mito1)的同源性。在b73中,這僅僅在所述qtl(grmzm2g174696)上遊15兆鹼基。所述磷酯酶下遊大約90千鹼基(kb)是另一轉錄物,其反過來顯示和所述192個胺基酸長的轉錄物具有高度同源性。
為了獲得所述qtl外的誘導系-特異性表達,在基因組範圍評估rnaseq數據。出乎意料地,在上文記載的精細定位的區域之外但接近所述區域鑑別到新的候選基因,其之前很可能由於seqcapture方法的技術限制而無法被發現。來自所述精細定位區域的所鑑定的磷酯酶上遊大約400kb是基因複合物,其在rws的花粉中,與對照相比表達顯著不同(至少係數為2)。該基因複合物含有三個基因:兩個基因注釋為snarev基因,其相互具有高度同源性且在rwp中過表達,而一個基因注釋為肌醇-1,4,5-三磷酸-5-磷酸酶且其表達在rwp中被減少。這些基因的克隆的轉錄物與公共注釋有部分不同,使得它們也可以編碼具有不同功能的蛋白,或也可以作為lncrna起功能。可以從該基因座分離來自rws的bac並測序。該序列被整合進人工參照以在agpv02中重新分析rnaseq數據(圖3)。除了轉座酶外,兩種rna(t1(seqidno:55、56、57和63)和t3(seqidno:60、61、62和65))以及具有131個胺基酸的orf的rna在該基因座表達(t2(seqidno:58、59和64))。除了所述轉座酶,全部轉錄物位於所述兩個snarev基因內或之間。儘管推測起來它們本身不具有snare功能,它們可以參與調控同源基因。該區域的序列捕獲數據顯示誘導系、對照和參照基因組之間存在顯著的結構差異。bac測序確認肌醇-1,4,5-三磷酸-5-磷酸酶基因在所述誘導系的基因組水平不存在,以及來自b73的lncrna的不存在,所述lncrna與所述肌醇-1,4,5-三磷酸-5-磷酸酶共享轉錄起始位點,但從相反鏈閱讀。從所述snare基因之一(grmzm2g179789)分離cdna也表明所述誘導系中複雜的結構改變,由於所述cdna的一部分對應於正鏈而一部分對應於參照的負鏈。
基因功能
總之,因此可以鑑定7個基因,其對於在玉米中的體內單倍體誘導或體內單倍體誘導能力可能是重要的。
在這些對花粉管生長特別重要的四個基因中:
所述兩個snarev基因編碼已知參與泡囊運輸的蛋白(文獻)。在模式植物擬南芥(arabidopsisthaliana)中,snarev蛋白已經被證明在花粉管頂端,其中它們參與磷酯類和果膠類的運輸(文獻)。在所檢查的玉米誘導系中觀察到的snarev蛋白的過表達將導致增加的花粉管生長。
模式植物菸草(nicotianatabacum)中能夠顯示磷脂酶a2也顯著影響花粉管生長。因此,抑制磷脂酶a2導致花粉管生長的抑制(kimetal.,2011)。在所檢查的玉米誘導系種,所鑑定的與所述磷酯酶具有顯著同源性的lncrna的不存在可能導致所述磷酯酶基因減少的表達率或翻譯率,其將促進花粉管的生長速度。
在擬南芥中肌醇-聚磷酸酯-5-磷酸酶的敲除突變體中,顯示花粉管不受抑制地生長。在所檢查的玉米誘導系中,肌醇-1,4,5-三磷酸-5-磷酸酶的減少的表達水平因此同樣可能導致加速的花粉管生長。此處所鑑定的與肌醇-1,4,5-三磷酸-5-磷酸酶相關的lncrna對表達率有調控作用。
因此與非誘導系相比,所檢查的玉米誘導系顯示所述四個基因被修飾的調控/表達率。該破壞導致顯著更快的花粉管生長,其也被由於線粒體運輸蛋白的表達或其調控導致的可能增加的能量代謝促進。這能夠導致花粉管中的生殖細胞的運輸和其生長相分離。結果是,可能出現不完全或不正確的授粉以及隨後的染色體消除。
已知活性著絲粒在染色體分布中起關鍵作用,且通過在dna或組蛋白水平的染色質修飾(此外,通過轉錄、rna相互作用和rna結合)而表徵和修飾。所述甲基轉移酶基因調控的改變可以在早期胚胎發生期間影響誘導繫著絲粒的活性,其最終導致誘導系基因組在早期種子發育階段的消除。
在所檢查的誘導系中,其顯示所述磷酸甘油酸變位酶基因不再存在。該基因的不存在可能負面影響花粉的能量代謝,並因此對授粉有影響。此外,所述能量代謝可以被線粒體膜蛋白影響。
所述基因中任意基因單獨地或任意組合可以負責單倍體誘導的效果。
產生新的體內單倍體誘導系
為了在其它作物類型或玉米非誘導系基因型中開發新的誘導系,或為了增加誘導系基因型的誘導能力,如下進行:
在其它作物類型或玉米非誘導系基因型中鑑別相應的基因:在單子葉植物例如玉米、水稻、小麥、黑麥或大麥中,所述花粉-特異性patatin磷脂酶強烈保守,因此這些的同源物容易鑑定。與此相反,調控性lncrna在大多數單子葉植物不存在。然而,如果它們存在,它們同樣可以使用顯著的同源性被發現,正如它們也存在於所檢查的玉米誘導系中。在雙子葉植物中,其它磷酯酶類型執行相應的花粉管生長的任務。為了鑑定這些,創建花粉或花粉管的rna文庫並針對本發明的特定磷酯酶進行篩選。在花粉中強表達的patatin磷酯酶已經能夠通過向日葵花粉的rnaseq鑑定(seqidno:46-48)。
所述snarev基因和所述甲基轉移酶基因不需要是花粉特異性的。例如,所鑑別的snarev基因之一(snarev1)在玉米中也不是以花粉特異性方式表達的。snarev1在野生型花粉中根本不表達。在注釋的基因組中,可以通過blastp鑑定snarev蛋白的同源基因和功能性區域。在未注釋的基因組中,將需要rnaseq數據來注釋和選擇snare基因。
同源性肌醇-1,4,5-三磷酸-5-磷酸酶或磷酸甘油酸變位酶為了用作候選基因,必須在花粉中表達。可以如上所述進行鑑定,通過blastp和隨後的花粉中的rt-pcr或通過花粉rnaseq數據的注釋。
候選基因的操作
可能的誘導系或增加的誘導能力可以通過轉基因表達上述的磷酯酶和/或snare和/或甲基轉移酶和/或磷酸甘油酸變位酶和/或lncrna和/或線粒體輸入受體實現。為此,可以從誘導系rws克隆相應的基因,包括它們的啟動子。這些基因可以被克隆進合適的轉化載體並被轉化進期望的植物。
花粉表達的肌醇-1,4,5-三磷酸-5-磷酸酶可以額外地或專一地通過例如rnai減少它們的活性。例如,為此產生髮夾結構,其然後包含合適的啟動子和終止子,允許所述髮夾結構在花粉形成的時間點或之前轉錄。這些髮夾構建體可以被克隆進合適的轉化載體並被轉化進期望的植物。
可選地或額外地,可以通過tilling、轉座子誘變或其它誘變方法或「基因組編輯」產生具有穩定所述磷酯酶和/或snare和/或甲基轉移酶、增加表達或增加活性的突變(例如在所鑑定的基因中)的植物。突變的蛋白的二級和三級結構的結構分析對此可能有用,所述突變的蛋白顯示例如較緻密的結構,並因此較少蛋白酶的攻擊點。此外,也可以考慮所述蛋白中在泛素相互作用中起作用的區域。在所述基因活性中心的突變體可以直接測試它們的活性。為了驗證所述磷酯酶的功能,已經檢查了多種tilling突變體的誘導能力。取代d74n(第74位的天冬氨酸取代為天冬醯胺)或g78r(第78位的甘氨酸取代為精氨酸)導致0.2-0.4%的母體誘導率。為了可選地或額外地操作所述肌醇-1,4,5-三磷酸-5-磷酸酶或所述磷酸甘油酸變位酶,,必須搜索敲除突變體或搜索降低所述基因活性的額外的突變體。
也可以改良stock6衍生的誘導系。這是可能的,通過上述的轉基因方法和通過導入所鑑定的候選基因中的突變。此外,有可能通過轉基因或非轉基因方法操作基因組中所述基因額外的拷貝,只要它們在花粉中表達。誘導能力的測試:
為了測試潛在誘導系的誘導能力,例如有如下可能:
1.向具有可見的隱性標記(例如對於玉米,有光澤的(bordesetal.,1997)或無葉舌(sylvesteretal.,1990))的品系授粉。通過流式細胞術測試表達該特徵的後代的單倍性。
2.向與所述誘導系遺傳學不同,最優通過多個標記不同的品系授粉。使用這些標記以鑑定純合植物。通過流式細胞術測試這些植物的單倍性。
採用兩種可能來測試誘導能力。
參考文獻
barret,p.,brinkmann,m.,&beckert,m.(2008).amajorlocusexpressedinthemalegametophytewithincompletepenetranceisresponsibleforinsitugynogenesisinmaize.theoreticalandappliedgenetics,117(4),581-594.
bordes,j.,devaulx,r.d.,lapierre,a.,&pollacsek,m.(1997).haplodiploidizationofmaize(zeamaysl)throughinducedgynogenesisassistedbyglossymarkersanditsuseinbreeding.agronomie,17(5),291-297.
chen,l.,tu,z.,hussain,j.,cong,l.,yan,y.,jin,l.,...&he,g.(2010).isolationandheterologoustransformationanalysisofapollen-specificpromoterfromwheat(triticumaestivuml.).molecularbiologyreports,37(2),737-744.
chevalier,b.s.,kortemme,t.,chadsey,m.s.,baker,d.,monnatjr,r.j.,&stoddard,b.l.(2002).design,activity,andstructureofahighlyspecificartificialendonuclease.molecularcell,10(4),895-905.
coe,e.h.(1959).alineofmaizewithhighhaploidfrequency.americannaturalist,381-382.
das,l.,&martienssen,r.(1995).site-selectedtransposonmutagenesisatthehcf106locusinmaize.theplantcellonline,7(3),287-294.
deimling,s.,f.k.,geiger,h.h.(1997).methodikundgenetikderin-vivo-haploideninduktionbeimais.[methodsandgeneticsofinvivohaploidinductioninmaize]presentationpflanzenzüchtung,38:203-224.
depicker,a.,stachel,s.,dhaese,p.,zambryski,p.,&goodman,h.m.(1981).nopalinesynthase:transcriptmappinganddnasequence.journalofmolecularandappliedgenetics,1(6),561-573.
dong,x.,xu,x.,li,l.,liu,c.,tian,x.,li,w.,&chen,s.(2014).marker-assistedselectionandevaluationofhighoilinvivohaploidinducersinmaize.molecularbreeding,1-12.
dong,x.,xu,x.,miao,j.,li,l.,zhang,d.,mi,x.,...&chen,s.(2013).finemappingofqhir1influencinginvivohaploidinductioninmaize.theoreticalandappliedgenetics,126(7),1713-1720.
fire,a.,xu,s.,montgomery,m.k.,kostas,s.a.,driver,s.e.,&mello,c.c.(1998).potentandspecificgeneticinterferencebydouble-strandedrnaincaenorhabditiselegans.nature,391(6669),806-811.
gaj,t.,gersbach,c.a.,&barbasiii,c.f.(2013).zfn,talen,andcrispr/cas-basedmethodsforgenomeengineering.trendsinbiotechnology,31(7),397-405.
gurr,s.j.,&rushton,p.j.(2005).engineeringplantswithincreaseddiseaseresistance:whatarewegoingtoexpress?trendsinbiotechnology,23(6),275-282.
kato,n.,he,h.,&steger,a.p.(2010).asystemsmodelofvesicletraffickinginarabidopsispollentubes.plantphysiology,152(2),590-601.
kim,h.j.,ok,s.h.,bahn,s.c.,jang,j.,oh,s.a.,park,s.k.,...&shin,j.s.(2011).endoplasmicreticulum–andgolgi-localizedphospholipasea2playscriticalrolesinarabidopsispollendevelopmentandgermination.theplantcellonline,23(1),94-110.
lloyd,a.,plaisier,c.l.,carroll,d.,&drews,g.n.(2005).targetedmutagenesisusingzinc-fingernucleasesinarabidopsis.proceedingsofthenationalacademyofsciencesoftheunitedstatesofamerica,102(6),2232-2237.
mccarty,d.r.,marksettles,a.,suzuki,m.,tan,b.c.,latshaw,s.,porch,t.,...&curtishannah,l.(2005).steadyt‐astetransposonmutagenesisininbredmaize.theplantjournal,44(1),52-61.
odell,j.t.,nagy,f.,&chua,n.h.(1985).identificationofdnasequencesrequiredforactivityofthecauliflowermosaicvirus35spromoter.
prigge,v.,xu,x.,li,l.,babu,r.,chen,s.,atlin,g.n.,&melchinger,a.e.(2012).newinsightsintothegeneticsofinvivoinductionofmaternalhaploids,thebackboneofdoubledhaploidtechnologyinmaize.genetics,190(2),781-793.
ravi,m.,&chan,s.w.(2010).haploidplantsproducedbycentromere-mediatedgenomeelimination.nature,464(7288),615-618.
f.k.,gordillo,g.a.,&geiger,h.h.(2005).invivohaploidinductioninmaize-performanceofnewinducersandsignificanceofdoubledhaploidlinesinhybridbreeding.maydica,50(3/4),275.
sambrook,j.,russell,d.w.,&russell,d.w.(2001).molecularcloning:alaboratorymanual(3-volumeset)(vol.999).coldspringharbor,newyork:coldspringharborlaboratorypress.
shibuya,k.,fukushima,s.,&takatsuji,h.(2009).rna-directeddnamethylationinducestranscriptionalactivationinplants.proceedingsofthenationalacademyofsciences,106(5),1660-1665.
silva,g.,poirot,l.,galetto,r.,smith,j.,montoya,g.,&duchateau,p.(2011).meganucleasesandothertoolsfortargetedgenomeengineering:perspectivesandchallengesforgenetherapy.currentgenetherapy,11(1),11.
sylvester,a.w.,cande,w.z.,&freeling,m.(1990).divisionanddifferentiationduringnormalandliguleless-1maizeleafdevelopment.development,110(3),985-1000.
till,b.j.,reynolds,s.h.,weil,c.,springer,n.,burtner,c.,young,k.,...&henikoff,s.(2004).discoveryofinducedpointmutationsinmaizegenesbytilling.bmcplantbiology,4(1),12.
twell,d.,yamaguchi,j.,wing,r.a.,ushiba,j.,&mccormick,s.(1991).promoteranalysisofgenesthatarecoordinatelyexpressedduringpollendevelopmentrevealspollen-specificenhancersequencesandsharedregulatoryelements.genes&development,5(3),496-507.
venter,m.(2007).syntheticpromoters:geneticcontrolthroughcisengineering.trendsinplantscience,12(3),118-124.
wang,y.,chu,y.j.,&xue,h.w.(2012).inositolpolyphosphate5-phosphatase-controlledins(1,4,5)p3/ca2+iscrucialformaintainingpollendormancyandregulatingearlygerminationofpollen.development,139(12),2221-2233.
zhao,y.,zhao,q.,ao,g.,&yu,j.(2006).characterizationandfunctionalanalysisofapollen-specificgenest901insolanumtuberosum.planta,224(2),405-412.
wo/2010/079430(bonasetal.)modulardna-bindingdomainsandmethodsofuse.
wo/2011/072246(regentsoftheuniversityofminnesota)taleffector-mediateddnamodification.
wo2012/030893(monsantotechnologyllc)molecularmarkersassociatedwithhaploidinductioninzeamays.
序列表
kws種子歐洲股份公司
單倍體誘導系
kws0220pct
de102015004187.8
2014-11-12
65
patentinversion3.5
1
47944
dna
zeamays
misc_feature
(12578)..(12677)
nisa,c,g,ort
misc_feature
(22322)..(22421)
nisa,c,g,ort
misc_feature
(42238)..(42337)
nisa,c,g,ort
1
atggggagcagtgaggagcatgtttttttagatcccaccagaatatgtgcatccgtgtca60
cttcttgctcatgatctcattggccgaatgcttaatcgagaggtctcttcaaggcccaat120
gccaaagaagttctccgtaagttcaagcacccttgtaacttgtgctttatatatatgatt180
ctcaatttatcattgacttttcctaatggctttcaacacagggcaccatgggtcttattc240
tacactgattgcccgcagaaagctgaattctctaacatatgggatactaacaaaactgca300
gctcccatgattcatcgggagatagtcaggtttggttactgtgagtcttcatcttcaaaa360
tcctcaagtgacaactctgaagagcgagatgaatgcggtatagttgatgcactggtgaca420
acaataacacaggtgaggatctcagagcccaagaggagtcggctgttcagcctacccaac480
gggttgttgccgccaagcaggaacagtctccgaacatgaagatgatgaatccgtgtgtgg540
ctttctaacttgacctacctagctcccatccccatgcatgtataaacgacatttggggaa600
tgggtagaaaagcagagattagggattttcgtttccgtcggtgcagttttggtgttccaa660
tggagttgcgagatgtttatgtgccttagtcttcaatttgggggttgggggaaaagtaat720
tttatgtttttgttttgtgtctgcagattcggaagatggacttggaggcaaggagcctac780
agcctagcattaaggctggtttgcttgcaaagctgagggagtataaatctgacctcaaca840
acgtcaagagtgagctcaagaggatatttgcgcccaatgccaggcaggctacccgggagg900
agctcctagagtttggaatggctgatactctcgctgtgagctaatgctaggacttgactg960
tgtctacgagactgctcctaacaataaactgaagaaagcaaaagaaatcattcaacgtat1020
tcgccgaagagaactctacaaggtagtatgatgctttaattgctcatatacaagtgtcat1080
tttgtcatgtcattacacatggttaggatacatacttaagtttctaacgtaggcgtccac1140
acaacggattggtgcacggttctgccgatgtatcccacgcacgtgcatggaaggaggcag1200
gcacccttccccgccgccccggatctcgcgccagcccccgccctaccccgcctgcccttc1260
cactcttcccccgccgcccccggtcaacgtcacgaacccgggcctcgtgccgctcgtcgt1320
ggccacactgttcgacgagcgagtcatagagctgctgagcgtgctcgctgatgcggcggt1380
ggggcgaccaggcaggtggtccatcggcgaagcgccatggtcgtcgtcggggggcacgaa1440
ccaggcggtgtacgcgcgccgcgcgcccggctcttcatcgcctccacccgctccagcgtc1500
tccaccacttccttcatcgagggccgactgcttggctcgctggccaggcagccgagcatt1560
agttgcgccgcttggaacgcctgcttttgttgatcgtttgttttggtctgatttcagtgg1620
gtctatccgcagagaggaagaagcagaagctctccgagatccaatccggcgttgaggaag1680
ctgaatcgctggtaaatagatgtcgcgacgcgttctgttttggggatccccttggctaac1740
gggacatacgacatttggggaatgggtagaaaagcagagattagggatttttcgtttccg1800
tcggtgcagttttggtgttccaacagagttgcgagatgtttatgtgccttagtcttcaat1860
ttgggggttgggggaaaagtaattttatgtttttgttttgtgtctgcagattcagaaaat1920
ggacctggaggcaaggagcctacagcctagcattaaggctggtttgcttgcaaagccgag1980
ggattataaatctgacctcaacaacgtcaagagtgagctcaagaggatatctgcgcccaa2040
tgccaggtaggctacccgggaggagctcgtggagtctagaatggctgatactctcgcagt2100
gagctaatgctaggacttgactgtgtctacgagactgctcctaataataaactgaagaaa2160
gcaaaagaaatcattcaacgtattcgccgaagagaactctacaaggtagtatgatgcttt2220
aattgctcatatacaagtgtcattttgtcatgtcattacacatggttaggatacatactt2280
aagtttctaacgtaggcatccacacaatggattggtgcacggttctgccgatgtatccca2340
cgcacgcgcatggaaggaggcaggcacccttccctgccgccccggatctcgcgccagcca2400
tcgccctaccccgcctgcccttccactcttccccctgaaagtcgcatagagggggggtga2460
atagggcgaatctgaaatttacaaacttaagcacaactacaagccgggttaacgttagaa2520
atataaacgagtccgagagagagggcgcaaaacaaatcatgagcaaataaagagtgagac2580
acgatgatttgttttaccgaggttcggttcttgcaaacctactccccgttgaggtggtca2640
caaagaccgggtctctttcaaccctttccctctctcaaacggtcacttagaccgagtgag2700
cttctcttctcaatcaaacggaacacaaagttcccgcaaggaccaccacacaattggtgt2760
ctcttgccttggttacaattgagtttgatcacaagaagaatgagaaagaaaagaagcgat2820
ccaagcgcaagagctcaaatgaacacaaatgtcgctctctctagtcactatttgatttgg2880
agtgattccggacttgggagaggatttgatcttttggagtgtctagaattgaatgctata2940
gctcttgtaatatgttgaaggtgggaaacttggatgccattgaatgtggggtggttgggg3000
tatttatagccccaaaacaccaaaaaaggccgttggaaggctgctctcgcatggcgcacc3060
ggacagtccggtgcgccagccacgtcagcagaccgttggggttcgaccgttggagctctg3120
acttgtggggcctctgggctgtccggtggtgcaccagacaggtcctgtaggatgtctggt3180
gcgccaactgcacgtgctctgtcctctgcgcgcgcaggcgcgcattaaatgcgttgtagt3240
caaccgttgcgcgcgaagtagccattgctctgctggcacaccggacagtccggtgaatta3300
tagcggagcgccctctgattttcccgaaggtagcgagttcagcttcgagtgccctggtgc3360
accggacactgtccggtgcgccaaaccagggtgccttccgggatgtcttttgctctcttt3420
gtttgaaccctttcttggtctttttattggcttattgtgaacctttgacacctgtaaaac3480
ttatagactagagcaaactagttagtccaattatttgtgttggacaattcaaccaccaaa3540
atcaattaggaaataggtgtgagcctaattccctttcaatctccccctttttggtgattg3600
atgccaacacaaaccaaagcaagtatagaagtgcataattgaactagtttgcataatgta3660
agtgcaaaggttacttagaattgaaccaataaatattttcataagttatgcatggattgt3720
ttctttattttcatcattttggaccacgcttgcaccacatgttttgtttttgcaaatcct3780
tttgtaaatagtcaaaggtaaatgaataagattttgagaagcattttcaaaatttgaaat3840
tttctccccctgtttcaaatgcttttcctttgacttaaacaaaactcccccctcaaaaat3900
cctactcatagtgttcaagagggttttaagatatcaattttgaaaatgctactttctccc3960
ccttttgaatataataagatatcaattgaaaaattcatcattttaaaaccttttgaaaat4020
gggtggtggtgcggtccttttgctttgggctaatactttctccccctttggcatgaatcg4080
ccaaaaacgaatacttgagtgaaatataagcccctttaactactttctcctgctttggcg4140
aacataatatgagtgaagattataccaaagttggagagttgcttgaagcgatggtgaagg4200
atgagttatggagtggaggttaagcctttgtcttcgccgaagattccaattccctttcaa4260
tacacctatgacttggttgaaaatatacttgaaaacacattagtcatagcacatgaaaga4320
gatatgatcaaaggtatattaatgagctatgtatgcaagacatcaaaagaaattcctaga4380
atcaagaatatttagctcgtgtctaagtttgttcatctagtggcttggtaaagatatcag4440
ctaattgttccttagtgttaatataggcaatctcgatatctccctttttttggtgatccc4500
ttaggaaatgataccgaatggctatgtgtttagtgcggctatgctcaacgggattatccg4560
ccatgcggattgcactctcattatcacatagaagaggaactttggttaatttttaaccat4620
agtccctaagggtttgcctcatccaaagtaattgtgcgcaacaatggcctgcggcaatat4680
actcggcttcggcggtagaaagagctacggaattttgcttctttgaagcccaagacacca4740
gggaccttcccaagaactggcaagtcctcgatgtactctttctattaattttacaccccg4800
cccaatcggcatccgaataaccaatcaaaatcaaaatgtggatcccgtaggataccaaag4860
cccaaacttaggagtatgaactaaatatctcaagattcgttttacggccgtaaggtgagc4920
ttccttagggtcggcttggaatcttgcacacatgcatacggaaaagcataatatccggtc4980
gagatgcacataaatagagtaaagagcctatcatcgaccggtatacctttttgatcgacg5040
gatttacctcccgtgtcgaggtcgagatgcccattggttcccatgggtgtcttgatgggt5100
ttggcatccttcatcccatacttgtttagaatgtcttgaatgtacttcgtttggctaatg5160
aaggtgccctcttagcgttgcttcacttgaaatcacaagaagtacttcaactcccccatc5220
atagacatctcgaatttctgtgtcatgatcctactaaattcctcacatgtagattcatta5280
gtagacccaaatataatatcatcaacataaatttggcatacaaacaaatcattgtcaaga5340
gttttagtaaataaagtaggatcggcctttccgactttgaaaccattagtgataaggaaa5400
tctctaaggcattcataccatgctcttggggcttgcttgagcccataaagcgccttagag5460
agtttatatacgtgattagggtactcactatcttcaaagccgggaggttgctcaacatag5520
acctcttccttgattggtccgttgaggaaggcacttttcacgtccatttgataaagcttg5580
aagccatggtaagtagcataggcaagtaatatgcgaattgactcaagcctagctacgggt5640
gcataggtttcaccaaaatccaaaccttcgacttgtgaataacccttggccacaagtcgg5700
gctttgttccttgtcaccacaccatgctcatcttgtttgttgcggaagacccacttggtt5760
cctacaacattttggttaggacgtggaactaaatgtcatacctcattcctagtgaagttg5820
ttgagctcctcttgcatcgccaccacccaatccgaatcttgaagtgcttcctctaaccta5880
tgtggctcaatagaggaaacaaaagagtaatgttcacaaaaatgagcaacccgagatcta5940
gtagttacccccttatgaatgtcgccgaggatggtgtcgacggggtggtctcgttggatt6000
gcttggtggactcttgggtgtggcgggcgttgctcgtcctccttgtcttgatcatttgca6060
tctcccccttgatctatgccgtcatctagaggtggctcatttgattgatcttcttcttca6120
tcaacttgagcttcatcctcattttgagtcggtggagatgcttgcatggaggaggacggt6180
tgatcttgtgtatttggaggctcttcggattccttaggacacacatccccaatggacatg6240
ttccttagcgcgatgcatggagcctcttcatcacctatctcatcaagatcaacttgctct6300
acttgagagccgttagtttcatcaaacacaacgtcacatgaggcttcaactagtccagtg6360
gacttgttaaagaccctatatgcccttgtgtttgagtcataaccaagtaaaaagccttct6420
acagtcttaggagcaaatttagattttctacctcttttaacaagaataaagcatttgcta6480
ccaaaaactctaaaatatgaaatattgggctttttaccggtgaggagttcgtatgatgtc6540
ttcttgaggattcggtgtagatataaccggttgatggcgtagcaagcggtgttgactgcc6600
tcggcccaaaaccgatccgaagttttgtactcatcaagcatggttcttgccatgtccaat6660
agagttcgattcttcctctccactacaccattttgttgtggcgtgtagggtgaagagaac6720
tcatgcttgatgccctcctcctcaagaaagccttcgatttgagagttcttgaactccgtc6780
ccgttgtcgcttctaattttcttgatccttaagccgaactcattttgagcccgtcttaag6840
aatccctttaaggtctcttgggtttgagatttttcctgtaaaaagaatacccaagtgaag6900
cgagaataatcatccacaataactagacagtacttactcccgccgatgcttatgtaagcg6960
atcgggccgaatagatccatgtgtaggagctccagtggcctgtcggtcgtcattatgttc7020
ttatgcggatgatgagagccaacttgcttccctgcttggcatgcgctacaaatcctgtct7080
ttctcaaaataaacatttgtcaatcctaaaatgtgttctccctttagaagcttgtgaaga7140
ttcttcatcccaacatgtgcaagtcggcgatgccagagccaacccatgttagtcttagca7200
attaagcaagtgtcgagttcagctctatcgaaatctaccaagtatagctgaccctctaac7260
actcccttaaatgctattgaatcatcacttcttctaaagacagtgacaccaacatcagta7320
aaaagacagttgtagcccatttgacacaattgggatacagaaagcaaattgtaatctaaa7380
gaatctacaagaaaaacattggaaatagaatggtcaggagatatagcaattttacccaat7440
cctttgaccaaaccttgatttccatccccaaatgtgatcgctctttggggatcttggttt7500
ttctcatatgaggagaacatctttttctccccagtcatgtggtttgtgcacccgctgtcg7560
agtatccagcttgagcccccggatgcataaacctacaaaataattttagttcttgatttt7620
aggtacccaaatggttttgggtcctttggcattagacacaagaactttgggtacccaaac7680
acaagtcttggagcccttgtgcttgcccccaacatatttggcaactaccttgccggattt7740
gttagtcaacacataagatgcatcaaaagttttgaatgaaatgtcatgatcatttgatgc7800
actaggagttttctttctaggcaacttggcacgggttggttgcctagagctagatgtctc7860
acccttatacataaaagcataattaggaccagagtgagacttcctagaatgaattctcct7920
aattttgttctcgggataaccggcagggtataaaatgtaaccctcgttatcctgaggcat7980
gggagccttgcccttaacaaagttggacaatcttttaggaggggcactaattttgacatt8040
gtttcccctttggaagccaatgccatctttaatgcccgggcgtctcccattataaagcat8100
gccacgagcaaatttaaatttctcattttctaagttgtgctcggcaattttagcatctag8160
ttttgctatatgatcattttgttgtttaattaaggtcatatgatcatgaatagcattaac8220
atcaacatctctacatctagtacaaatagatacatgctcaacagtagatgtagagggttt8280
gcaagaattaagttcaacaatcttagcatgaagaatatcatttttatccctaagatcgga8340
aattgtagttttgcaaacatcaaaatctttagccttagcaattaaattttcatttttctg8400
ttctaaggctagcaagagaaatgtttaattcttcaatcctagcaagcaaatcatcattat8460
tatctttaggattgggaattgaaacattacaaacatgtgaatcaaccttagcatttaaac8520
tagtattttcatgtctaaggttgtcaatcatctcatggcaagtgcttagctcactagata8580
gtttttgacatttttctacttctagggcgtaagcatttttaaccttaacatgtttcttgt8640
tttccttaataagacaatcctcttgggaatccaaaaggtcatctttttcatgaatagcac8700
taattaattcatttaatttttccttttgttccatgttaagattagcaaaaagggtacgca8760
agttatcctcctcatcactagcattttcatcactagaggtttcatatttagtggaggatc8820
ttgattttaccttcttccttttgccgtcctttgccatgaggcacttgtggccgacgttgg8880
ggaagaggagtcccttggtgacggcgatgttggcggcgtcctcgtcgtcggaggagtcgc8940
ttgagctttcgtcggaatcccactcccgacaaacatgggcatcgccgcccttcttcttgt9000
agtacttcttcttctcctttcttctccccttcttgtcgtcgccacggtcactgtcactag9060
atatgggacatttagcaataaaatgaccgggcttaccacatttgtagcaaaccttcttgg9120
agcgggacttgtagtctttccccctcctttgtttgaggatttggcggaagctcttgatga9180
cgagcgccatttcctcattgtcgagcttggaggcgtctattggttgtcgacttggtgtag9240
cctcctccttcttttcttccgttgccttgaatgcgacgggttgagcttcggatgtggtgg9300
gttcgtcaagctcattgatctttctcgagccttcgatcatgcactcaaaacttacaaaat9360
gcccgataacttcctcgggggtcattttagtatatctaggattaccacgaattaattgaa9420
cttgagtagggttaaggaaaataagtgatcttagaataaccttaaccacctcgtggtcgt9480
cccactttatgctcccgaggttgcgcacttggttcaccaaggtcttgagccggttgtaca9540
tgtgttgtggctcctcccctttgcgaagccgaaaccgaccgagctccccctcgatcgttt9600
cccgcttggtgatctttgtgagctcgtctccctcgtgcgcggttttgagcacatcccaaa9660
cctccttggcgctcttcaacccttgaactttgttatactcctctctacttaaagaggcga9720
ggagtattgttgttgcttgagagttgaagtgctcgatttgggccacctcatcctcatcat9780
aatccttatcccctacggatggtacctgtgcaccaaactcaacaacatcccatatacttt9840
tgtggagtgaggttagatgaaatcgcattaaatcactccacctagcataatcttcaccat9900
caaaagttggtggtttgcctaacgggacggaaagtaaaggtgcatgtttagaaatgcgag9960
ggtagtgtaggggaatcttactaaacttcttacgctcttggcgtttagaagttacggagg10020
gcgcgtcggagccggaggttgatgttgatgaagtgtcggtctcgtagtagaccactttcc10080
tcatcctcttttgtttgtccccactccgatgaggcttgtgggaagaagatttttccttct10140
tctctttgtggtgagaagaagatttcttctccttccctttgttggaggagctcttcttct10200
tctccctccgtttggtgcgggactcttccaatgaagtgctctcgttgcttgtagtgggct10260
tttcgccggtctccatctccttcttggcgtgatctcccgacatcacttcgagcggttagg10320
ctctaacgaagcaccgggctctgataccaattgatagtcgcctagagggggggtgaatag10380
ggcgaaactgaaatttacaaatataaacacaactacaagccgggttagcgttagtaatga10440
agaaacgagtccgcgagagagggcgcaaaacaaatcgcaagcaaatgaagagtgtgacac10500
gtggatttgttttaccgaggttcggttctcgcaaacctactccccgttgaggaggccaca10560
aaggcccgggtctctttcaacccttccctctctcaaacggtccctcggaccgagtgagct10620
tctcttctctaatcaaagttgggaacaaaacttcccaacaagggccaccacacaattggt10680
gcctcttgccttgattacaatgggtttttgatcacaagaacaagtgcgaaagaaaagaag10740
caatccaagcgcaagagctcaaaaagaacacggcaaatctctctctctaatcactaaagc10800
cttttgtggaattggagaggatttgatctcttttggtgtgtctagaattgaatgctagag10860
ctcttgtagtagttgagaagtggaaaacttggatgcaatgaatggtggggtggttggggt10920
atttatagccccaaccaccaaacttgaccgttggctgggttgtctgttcgatggcgcacc10980
ggacagtccggtgcacaccggacagtccggtgcccctgccacgtcatcactgccgttgga11040
ttctagccgttgaagcttctgacttgtgggcccgcctgggtgtccggtgcacaccggaca11100
tctactgttccttgtccggtgtgccggagtgggcgcgcctgacatctgcgcgcgcagagc11160
gcgcattaaatgcgcggcagagagccgttggcgcggaaatagccgttgctctcgagtcgc11220
accggacagtccggtgcacaccggacagtccggtgaattatagtggacgggccgatggct11280
tttcccgaagctggcgagttcctgaggccgacctcccttggcgcaccggacactgtccgg11340
tgtacaccggacagtccggtgaattatagcggagtcgcctctggcaattctcgaaggggg11400
cgagttggagcttgagtcctctggtgcaccggacgctgtccggtgtacaccggacagtcc11460
ggtgctctcagaccagagggccttcggttcccactatgctcctttgttgaatccaaaaac11520
ttggtccttttattggctgagtgtgaaccttttacacctgtgtaatctataaacttgtgc11580
aaacttagttagtccaattgtttgtgttgggcaattcaaccaccaaaattaattagggac11640
taggtgtaagcctaattccctttcagttttcccgggcggtcatccatagaacaggtcctt11700
acggagaggcactcgagaaaccgctcgagcccccttgaagaccacaagcacaacatcata11760
ataagagaagggaaaacagcgtatcatagataatctcatcatgttcattgattagagtta11820
agcaatagcataaagctaaacagtaataatccaacccaaataggtgaacaaggacatgga11880
taacaaaagctagtcaatccttaggcataaatgtgtaaagcgggaggtgaattaaataat11940
gaataggacatagataggtcaagggacacttgcctccaccaaccgactgctgctcagggg12000
cttctcctgcgggttcctcgggctcttcaaccggatcgttctctatgcgagcgcaaacat12060
acacacatccacatatttaataccaaagaacagtacaccatacaatagaatgcaataagt12120
aaacagacgttccacgcgggctcgcgagtacggttaagagagaaagaggaaaagacagtc12180
gagaaacgatcacgttgcatgattataaattagccactagcttaatggaaggaaatttaa12240
tgtagacactatgtttagcgtaaagtaaagtcatgtttcatgtctaattattataagcag12300
gtggagacaaataaaaggatagccgcgcggcgagacgcgcgacaaagctctctaaaacaa12360
attaagaagttaacgactcgtcgcgcgactgagcacgcagcgagacacttcgccttagtt12420
aagaggagacgttaagcgtcgcgcgacgaagcgcacgacggcatacgtcgactaaactga12480
gtccaaagtggaacgtcgcgtcaattcccacgcggcgttacaccttaaacaacctgaaac12540
aaaatgaacgaatcaagcctgatcatccgccccccccnnnnnnnnnnnnnnnnnnnnnnn12600
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn12660
nnnnnnnnnnnnnnnnnaccgttcaacaaacaccgcccggggttccaaccgaccaggctc12720
cagatcgcccgaccgatcccccgtcccggcctcgcccccccccttcttcgctcgcccgcg12780
ctcgcggtgattgctcgttaagaagcgtgttgcgtcgcgcgcttcgccgcgcgacggatt12840
tatctaaaattcagattctatcctgtgttgcgtcgtgcgtttcgtcgcgcgacgatccat12900
tttgtttcaggttgtttaaggtgtaacgccgcgtgtgtattcacgcgacgttccactttg12960
gactcagtttagtcgacgtatgccgtcgtgcgcttcgtcgcgcgacgcttaacgtctcct13020
cttaactaaggcgaagtgtctcgctgcgtgctcagtcgcgcgacgagtcgttaacttctt13080
aatttgttttagagagctttgtcgcgcgtctcgccgcgcggctatccttttatttgtctc13140
cacctgcttataataattagatatgaaacatgactttactttacgctaaacatagtgtct13200
acattaaatttccttccattaagctagtggctaatttataatcatgcaacgtgatcgttt13260
ctcgactgtcttttcctctttctctcttaaccgtactcgcgagcccgcgtggaacgtctg13320
tttacttattgcattctattgtatggtgtactgttctttggtattaaatatgtggatgtg13380
tgtatgtttgcgctcgcatagagaacgatccggttgaagagcccgaggaacccgcaggag13440
aagcccctgagcagcagtcggttggtggaggcaagtgtcccttgacctatctatgtccta13500
ttcattatttaattcacctcccgctttacacatttatgcctaaggattgactagcttttg13560
ttatccatgtccttgttcacctatttgggttggattattactgtttagctttatgctatt13620
gcttaactctaatcaatgaacatgatgagattatctatgatacgctgttttcccttctct13680
tattatgatgttgtgcttgtggtcttcaagggggctcgagcggtttctcgagtgcctctc13740
cgtaaggacctgttctatggatgaccgcccgggaaaacagtgcaaccatgagggtggaat13800
ggggtgcccttagctgaataattagaggatccggggtgtagttcacttagccgtcgtgcc13860
gtcaatggggctcggtgtatgcggctcgctctgccaagtttgggttcgccccttggggag13920
gagtgcggtgcatttaggaaacctaacgggtggctacagtcccggggaatctttgtaaag13980
gctatgtagtgatgccctgctgggtcaccttggtagtgatcaatggagagtcatgatctc14040
cgggtagaatgggaatcacggcttgtgggtaaagtgcacaacctctgcagagtgtttgaa14100
aactgatatatcagccgtgctcacggttatgagcggccaagggagctccagtgattagtg14160
gtacttgatcagagatactttggtacaggtggttatgagatcgatgattctggttatgac14220
tatgatgctggtaagtggtactctttccgtttggaaaggagtacgtttgggttaataact14280
tgggttaatgctaaaacttggctttctattagtaaataataatctgaccaactaaaagca14340
actgcttgacttatccccacataaagctagtccactacagccaaacaggatacttgctga14400
gtatgttgatgtgtactcacccttgctctacacaccaaaccccccccccatccccaggtt14460
gtcagcattgcaaccactgctcagtcgaagatgaagctgtggaaggagacttccaggagt14520
tccaagattacgatgagttctaggtgtgggttagcggcaacccccagtcggctgcctgtg14580
aaggccgcggttatctacgtttcttttccgcactttgatttattgtaagaactatatgga14640
cgtctcagacgtatgatgtaatcgactatttcccttagtaatactattttgagcactgtg14700
tgatgatgtccatgttatgtaactgctgtgtacgtgaataactgatcctggcacgtacat14760
ggttcgcattcggtttgccttctaaaaccgggtgtgacataagtggtatcaaagccgtgc14820
tgactgtaggaccgctaacctagaatagaatggtcgctctaaggactatagacctctgtc14880
tctgccttgactttgatatcccttcaaaagttggtcataccgaccaaacctatgttctac14940
tatatattataccttgctgaaaatcatgttttattccagtccttcatttacttatgattc15000
attatttgctggtcatattaattctgttctcacctttttgcttgcgatgtcttttgtaga15060
tggctcgacttagacacactgcacgaaagtcagtcatccccttcttaccctcccgccttg15120
ctgagcgtccgcttcgccgtcccgtggccggacagtccagccacttggagagactacacc15180
accgcctgcgtgaggagcaggagcgtcgacgacaggagcagcagagctcttccttctcgc15240
tccaccaggagatagagtctgtgaggagctgctcccctgtgcttcctctggagccgcccc15300
ctccaccaccactgggcgccccagcttctggagtagctgctggaggagacccagacgatg15360
gagatggcgacgacagctcgagccacgacaccgacttctctgctaaccttgagccggaag15420
gatgggttactcgacccatcactcgcgacgctgctcgcgggtgtcacttccacgatgcgc15480
tcgacaccctgctacgtcgggcatttaaccagcatacttggtctgtcgagtatcgctgtg15540
tggtctaccagcacagtcgcggggtctacccggaccgctgggaggcaacctgcttggtgc15600
gctgcccggagaacagtctccagggtgcggaggcctgctcagagcactattctatctctg15660
agcgggactcagctgaggcagccatgcaagatgctgcacggcgtgcgctttcgcactact15720
gctcggttttcggtggggcagctgacggtcttgacctgaagtattacccccgccgtccat15780
ctggcagcacaggaggcgtgattgtctcacctgtcggtgagggcaatcctaggttgagca15840
gcacagtcaacctagccgccgtgctaaacacggagctggaccatgcattagacgagctga15900
gtagggctcgtgctgagatcgccctgctgcgggctgagcgcgcggaacgtcgtcacctgg15960
atggtggttcccccgctcccgtcgggactcagcacccgtaccgctcacctcagcgtggac16020
accagtcttatggcaatcccgcctgcaagaccaagataactctagaaccatatatcgtta16080
gagttggatcttgtaattaatacgaaatatatacatagaagcttcagtcttagcgttagt16140
ctcggtcttagttagtcttagttaaacagggtagtttgctatatcctgtgcatttatgtt16200
tgtcatgatgaactatgtttggtttggatctttgtaatgattgtcaccagagtgtgggta16260
ccccctgcattttggtttacctattatgttaatagagttagttatatagttgggaaacct16320
tttattccactctcctctttatctgagaaactgtgtggtctgtgttggagatcagtgaag16380
atgctcatctgttcagtgctgttgaagaattctattctcttttcttatgctgcaagattt16440
gccagatcagtcctgatgtgtggttgcattctgcagatgtcagagaacaggcgcagagga16500
ggaaggcgtgctcagcaggagcgagccgctcaacaggaggaggtgccccagcagcagcac16560
ctgccgcccccgcccccgatgtccatcgagcagatgtttctgatgcagactcaggcagtt16620
caagccatcggtcagactctggccgccattcagcagcagcagcagcagcaggccccacct16680
caacctcagatgcctcagatgcccagagacaagcgtgctgaattcatgagaggtcatcca16740
ccaacgttcgctcattcttctgaccctatggatgctgaagattggctgcgcactgtggag16800
cgggagttgcataccgctcagtgcgatgacagggagaaagtcctgtatggtccccgtctg16860
ttgagaggagcagcccagtcatggtgggagtcttacctcgccacccatgcccatcctgac16920
gccatcacctgggaagagttcagaggtagctttcgtcagtaccatgttcctgcaggtctg16980
atgacagtgaagaaggaggagttcctggccctcaaggaagggccattgtctgtcagtgag17040
taccgagacaggtttctgcaattgtctcgctatgctcctgaagatgtcaacaccgacgcc17100
aagcgacagtaccgtttcctgagaggcttggttgaccctctgcagtaccaactgatgaat17160
cacaccttcccgacattccagcacctgattgacagagcaatcatgacagaaggaagcgta17220
aggagatggaagatcgtaagcgcaagatcagtggaccccagcctggaagcagcaatcgtc17280
ctcgtttctcaggcaatcaacctcagcagttcaggcagaaccagcgtccacctcagcagc17340
agcagcaattccaaaggcagtatcctcagcaccagtaccagaaccgtcagagcaatcagt17400
caggaggtcagtttcagaggcagaatcagcaagcacctcgtcttcctgccccagcaaatc17460
agcagaacagtcaggcagcaccagctcaggttggaaacagagcatgtttccactgtggag17520
agcaaggccactgggtgatgcaatgtccgaagaaggcagcccagcagcagtcaggcccca17580
atgccccagcgaagcagaatgtgcctcagcctggagcaggcaatcgctctcagccgcgct17640
ataatcatggaaggctgaaccacttggaggctgaagcagtgcaggagacccccggcatga17700
tagtaggtatgttcccagtcgactcccatattgcagaagtgttatttgatactggagcaa17760
cgcattctttcattactgcatcatgggtagaagcacataaccttccaattactaccatgt17820
caacccccattcaaattgactcagccggtggtagaattcgagccgatagcatttgtttga17880
atataagtgtggaaataagggggatagcgtttcccgccaaccttatagtaatgggtactc17940
aggcaatagatgtcatcctagggatgaattggctagataagtatcaggcagttatcagtt18000
gtgataaaaggaccatcaagttggtgtccccactaggagaggaagtggtgaccgagttag18060
tcccgcctgagccaaagaaaggaagttgttatcagatagctgttgatagcagtgaagcag18120
acccaatcgagaggatcaaggttgtgtccgagttcccagatgtgtttccaaaggacttac18180
cgggtatgccaccagagcggaaagttgagtttgctatagagcttcttcccggaaccgccc18240
ctatctttaagagagcttacagaatatctggaccagagttggttgagcttaagaagcaga18300
ttgatgagctgtcagagaaaggttacattcggccaagcacctcgccttgggccgcccctg18360
tcctatttgtggagaagaaagatggcaccaagaggatgtgtatcgattatcgagctttga18420
atgaagtcacgatcaagaacaagtatcccttgcccagaatagaagatttgttcgaccagt18480
tgagaggagccagtgtgttctccaagattgatctgaggtcaggttatcatcagctcagga18540
tccgaccttcggacattccgaagacggcattcatttccaagtatggtttgtatgagttca18600
cagtgatgtcttttggtttgaccaatgcgccagcgttcttcatgaacttgatgaacagtg18660
tattcatggattatctcgataagtttgtggtggtattcattgatgacattctggtttatt18720
ctcaaagcgaagaagagcacgcagatcatttgaggttggtattgcagagattgcgagagc18780
atcagttgtatgcaaagttgagcaagtgtgagttctggatcagtgaggtcctgttcttgg18840
gtcacataatcaacaaagaaggattggttgtggatccgaagaaagtggcagacattttga18900
actggaaagcgccaacagatgctagaggaatcaagagtttcattggaatggccggatact18960
atcggcgattcattgaagggttttcgaagatcgcaaaaccaatgacagcgttgctaggca19020
acaaagttgagttcaagtggacccagaaatgtcaagaggcctttgaagcgctgaaagaga19080
agttgactacagcgcctgtcctagtcttgcctgatgtgcacaagcccttctcagtgtatt19140
gtgatgcttgttacacaggtttgggatgtgtgttgatgcaagagggaagagttgtggctt19200
actcgtcccgacaactgaaggttcatgagaagaattacccaatccatgatctagagttgg19260
cagcagtggttcacgcactgaagtcatggaggcactatctgtatggacagaaatgcgatg19320
tttacacagatcacaagagtctgaagtacatattcactcagtcagagttgaacatgaggc19380
aacgaagatggttagagttgatcaaagattatgagttggagattcattaccatccaggca19440
aagcaaacgtagtggcagatgctttgagcagaaagagtcaagtcaatctgatggtcgctc19500
gtccgatgccttatgagttggccaaagagtttgacaagttgagtctcggttttctgaata19560
attcgcgaggagtcaaagttgagttggaacctaccttggagcgcgaaatcaaagaagcgc19620
agaagaatgatgagaaaatcagcgagatccggcgactgattctagatggcaaaggcaaag19680
aatttcgagaagatgcagaaggcgtgatatggttcaaagaccgcttgtgtgttcctaatg19740
tccagtctattcgggagttgattctcaaggaagctcatgagacgtcctattcgattcacc19800
ctggcagtgagaagatgtatcaggatctgaaaaagaaattctggtggtacggaatgaaga19860
gggagatcgcagagcatgtggctaggtgcgatagttgccgaagaattaaggcagagcacc19920
agagacctgctggattgttgcaaccattgcagatccctcagtggaaatgggacgaaatcg19980
gtatggatttcatagtcggattgcctcgcactcgagccggctacgattccatctgggtag20040
tagtggaccgcttgaccaagtcagcccacttcatacctgtcaagaccaactacagcagtg20100
cagtattggcagaattgtatatgtctcggatcatttgtcttcatggtgtgccaaagaaga20160
tagtgtcagacagaggaacgcagttcacctctcatttctggcagcagttgcatgaagctt20220
tgggcacgcatctgaatttcagttcagcttatcatccacagacagatggccagaccgaaa20280
ggaccaaccaaattcttgaagatatgttgagagcctgtgcgttgcaagatcagtccggat20340
gggataagagattgccttatgcagagttttcctgtaacaacagttaccaggccagcttga20400
agatgtcaccatttcaggcgctctatggaaggagttgtagaactccgttgcaatgggatc20460
agcctggagaaaagcaagtgtttgggccagacattttgcttgaagccgaagagaacatca20520
agatggtccgagagaatctgaagatagcgcaatcgaggcagcgaagctatgcagacacaa20580
gaagaagagagctgagtttcgaagtcggagactttgtctatctgaaagtgtcaccaatca20640
gaggagtcagaaggttcggagtgaaaggcaagctagcaccccgctacattggtccgtatc20700
agattcttgcaagacgtggagaagtggcctatcagctcagtttgccagagaatttgtctg20760
ctgtgcatgatgtctttcatgtgtctcagttgaagaagtgcttgcgtgtgccagaagagc20820
agttgccagtagaaggtcttgaagtccaggaggacttgacctacgttgagaagccagtgc20880
aaatccttgaggttgcagaccgagtcacctggaggaagaccatcagaatgtgcaaagtca20940
gatgggatcatcactctgaggaagaagcaacctgggagcgtgaagatgatctgatggcca21000
agtaccctgagctctttgctagccaaccctgaatctcgagggcgagattcttttaagggg21060
gataggtttgtaacgccctgaatttgggggtagaatttttcttcttttctctcaccaaat21120
tcgggcgttactctcttttctctttccccgtttgctccttcttcccaatttcaaaccagt21180
atagcggcaggtgtccgtgtcatgtataaaccaaaacctaagtgtcatgggtgttgcatc21240
atgccgaagcacatttctttgtctgatgttgagtgttcgtctcgttccgttccggatttc21300
ggttcgcgatttaattccgtttagtggtccgcgctcgtcgcgggttttcgatccgcgaag21360
tggcccgacccatcccaacctagtccagcccagcccagccggcccggcccgccccggcct21420
gcgcgcccctggcgcccaaacccccccatgcgccccccctcctctctctctctcatttgg21480
atctcccgcgcaacaacctctcctctccctcttccacctctctctccccgtggtgcccta21540
ggatttggagacggcgatcaccggattttggaccccgaggtgagctcccctcccctcccc21600
ttctcttctctctctctctccctcctcttcttctccccacgcgcgccccccttctccccc21660
tgctcacgcgcgcccctgcccgcccccgctcaccggcggcgcggcgccccccgcccctgc21720
ccggccccgcgcggcggcgcccgccctcccccggccgcggcggcgctcgtccgccctcac21780
ccccggccgcggcgcccgcccctccccctcgcgcgcggaggcggcgcccgccctcaccac21840
gacccgcagcgaccccgccccgtccccgcctctccccccgcgcgcggcggcgcccgcccc21900
ccgcccctccccgcggcggcgctcgcccgcccgcccctgcccctccccgcggcggcgccc21960
gcccgcccctgctcgcgcgcagcgcggccccggcgcgcccccggcatggcccggcgcggc22020
cccggtggcccctgctcgccggcgcgaccccggcgtggcccccagcccccggcgcgtccc22080
cggcgcggccccggccggctcggccgcccctggccggttcaaccgccccccctggccggt22140
tcaaccgccctggccccagctcgcccgcccgttcccccgtcccggcctcgcccgaccgta22200
tcttcgctcgcccgcgctcgcggtgattgctcgttaattagcgtgttgcgtcgcacgcta22260
cgccgcgcgacggatttatctaaaattcagattctatcctgtgatacgtagtacaactga22320
cnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn22380
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnntgggtcggcgctgagatca22440
gcttgattcgtttttggttatacatgacacggacacctgccgctatactggtttgaaatt22500
gggaagaaggagcaaacggggaaagagaaaagagagtaacgcccgaatttggtgagagaa22560
aagaagaaaaaattctacccccaaattcagggcgttacatcaggctacaaaggatgccaa22620
tggtattgctgctctctatattgttcttgttctaatgtaaaaactacaacacaactcttt22680
acttgatcccagaaattccttctgcctcaaatggagacaatgacgagtggtcataagtac22740
agagattgcagacaaggtaaattttgcaatagaaataactaaccaaccattagtgcttga22800
aaaaaactggactggtgactggggcacgtggtttcatcaacatttggacctcaacggtct22860
aatcagtataacttagaagttggctagctcttgaaaaacactgcatgacactaagcattt22920
gtttattttcagctgcttacacccctatgatttcaagtaactacttgtctacttgtgata22980
atcacctgaatatgattatttgaaatgcttatcatgtctcgtcaattgcatttcttttat23040
gtgtacctgaagtctgctcttgcttcctaatagagttcgttttttaatacagaaaccact23100
ctgagatagccacaatatagtaaaagtggcagctaaggtactaaaaacacccatgcaaat23160
aagaaaaaaatgaatcttgtattttaattttgttaaatacctctatagtttggcgatata23220
ttatgttaccatcctgcttatagcctgtaggtcattttatatgagccatcaaattgcgat23280
gacagttgccacaaatccagtttcatatgaaggtattagctgtgtaacaagctaattgtt23340
gctctctgcccaataagttattcaattggattagtaggttgcatccaaggttattcaatt23400
ggatcagtaggttgcatccaaggtatactgctgctctctgcccaataagttattcaattc23460
gatcagtaggttgcatgttcccttcattttattaaaaaatacataataatataataagta23520
cttgtttgttctaaaaataatacttctgtaaatgaggatattaattttccttttggtaat23580
aatgcaggttgatgatactgaagtcatcagttttttgctgcaaactgaaataattcctct23640
gtgcttgcgaaccatggagatgggtagtgagctatccaaaattgtatgtagctagccaaa23700
tattctcattcaaatatcataatttatctcttctgcttaatactggcaaaggtgtaatag23760
tttttttagtattgatttgtcacctgaagtttatcttgtgcactactactttgccatcat23820
cagttatctctagaatactcttgtcctgtaccattttctctctgataagcctaaatttgt23880
acaattcataagcctaaaaggtgacttatataatatatacaaggaccctcaagagttgtt23940
tggcaattcagtgactgtcctgggtcctgttttggggagcttctggtagcttttgcttct24000
ccaaaagaaaagctagaagctccccccaaacagagcagcttcttcaagccggtaaaagct24060
tcaaaagctgtaattatactaaaaacagtgaagctccctcagagcagcttcccagctctc24120
taggagatgcttttggagaagctacagtttccccaaacagggccctgctctgttgaaccc24180
cccccttcctgatacatatttgaatatgagtttatagtgtgtgtgggggtgtaagtaggg24240
gggtaatgggttctaaattttatactataaaaattaaggatcggattagaattgagctct24300
atttctattcatttttgaactaaaattaattaagggctcaaatgaattatgaagaagcat24360
taggatcatgatccattaccacccctacgtgtaagatgttttttggtggttgtggttgat24420
tttgaattttaaggccgcatatgtctcatggaccacacaagctcatattcatctacattt24480
gtagccgtcactaacttagccaaatatgcatatgtggcggctagcaacaggtccttggtt24540
tcttgggttatttattctctttttatcgtgtttgaatgttttcgtgttcatttgcataac24600
atcttaggtctacattagtatatgaattgagatcaaatgtgaattggaccacacaagctc24660
atattcatctacatttgtagtcgtcactaacttagccaaatatgcatatgtccgcttctg24720
atttcattgtgtcttttcttcaggagtttggggatcaaggagaggactccattatcttgt24780
caccgcgactgaaggagattagtactcctgaccgccccactgccctccgtttcctaggta24840
cacgcataacagccattggtatgaatacatgttttatacgtgaatggagttccagtttaa24900
tttaaagattcaagttcactacaacaagattttacagtactgagcccatttgactttcct24960
tgagaaatagtgaaagggaattaggcttacacctagttcctaaataattttggtggttga25020
attgcccaacacaaataattggactaactagtttgctctagtgtacaagttatacaggtg25080
ccaaggttcacaacaagccaattaaaaagaccaaagttgggttcaaaatagagagccaaa25140
ggcatcccgaaaggctccctggtttggcgcaccggactgtccggtggcgcaccggacagt25200
gtccggtgcaccaggggacctcgcgcagaactcctcagcctcgggaatttttcggagccg25260
ccgcgctataattcaccggactgtccggtgtacaccggacagtgtccggtgctccaagaa25320
aacgcggctccggaacttggcagcctcaggaaatcagaacggctgctccgctataattca25380
ccagacatgtccggtgtacaccggactgtccggtgcaactgcggagcaacggctacttcg25440
cgccaacggtcacctgcaggcgcattcaatgcgcgccagaagcgcgcagaagtcaggcac25500
acccatgctggcgcaccggacactctacagtacatgtccggtgcgccaccggacatcaag25560
gcgggcccagaagacagaactccaacggtcaaattccaacgactttggtgacgtggctgg25620
cgcaccggactgtccggtgcaccatatgacagacagcctccaccaacggtcatgtttggt25680
ggttggggctataaataccccaaccaccccaccattcattgcatccaagttttccagctt25740
ccaaccactatataagagctagcattcattgcaaagcacaccaaaagagatcaagtcctc25800
tcccaactccacacaaagccttagtgattagagagagtgatttgtagtgttcatttgagc25860
tcttgcgcttggatcgcttcttttctttggcattctttcttgtgatcaaacactcacttg25920
taattgaggcaagagacaccaattgtgtggtggtccttgcgggaagtttgattcccaagt25980
gatttgagaagagaagctcactcggtccaagggaccgtttgagagagggaagggttgaaa26040
gagacccggcctttgtggcctcctcaacggggagtaggtttgagagaaccgaacctcggt26100
aaaacaaatccacgcgtctcacttcattattcgcttgcgatttgttttcacgccctctct26160
cggactcgttcttatttctaacgctaacccggcttgtagttgtgtttatatttgtaaatt26220
tcagtttcgccctattcaccccccctctaggcgactatcaattggtatcagagcccggtg26280
cttcattagagcctaaccgctcgaagtgatgtcgggagatcacgccaagaaggagatgga26340
gaccggcgaaaagcccactacaagccacgggagcacttcatcggaagagtcccgcaccaa26400
gaggaaggagaagaaagactcctccaaagggaaggagaagaagaaggactcctccaaagg26460
aaaggagaagaaatcttcttcacacaaagaaaagaaggagaagtcttcctcccacgagcc26520
gcaacggagtggggacaagaaaaagaggatgaggaaagtggtctactacgagaccgattc26580
ttcatcgacatccacctctggctccgacgcggcgtccgtcacttctaaacgccaagagcg26640
taagaagtatagtaagattcccctacgctaccctcgcatttctaaacatacacctttact26700
ttccgtcccattaggcaaaccaccaacttttgatggtgaagattatgctaggtggagtga26760
tttaatgcgatttcatctaacctcactccacaaaagtatatgggatgttgttgagtttgg26820
tgcacatgtaccatccgtaggggatgaagactatgatgaggatgaggtgacccaaatcga26880
gcacttcaactcccaagccacaaccatactcctcgcctctctaagtagagaggaatacaa26940
caaggtgcaagggttgaagaatgcgaaagaaatttgggatctactcaagaccgcgcacga27000
gggtgatgaactcaccaagattaccaagcaggaaacgatcgagggggagctcagtcgctt27060
ccgtcttcgccaaggggaggagccacaagatatgtacaaccggctcaaaaccttggtgaa27120
ccaagtgcgcaacctcgggagcaagaaatgggatgaccacgaggtggttaaggttattct27180
tagatcactcatcttccttaaccccactcaagttcaattaattcgtggtaatcctagata27240
tacactaatgacccccgaggaagttattgggaattttgtgagctttgaatgtatgatcaa27300
gggctcaaagaagatcaacgagcttgatgatccctccacgtccgaagcacaaccggtggc27360
tttcaaggcgacggaggagaagaaggaggagtctacaccaagtagacaaccaattgacgc27420
ttcaaagctcgacaacgaggagatggctttaatcatcaaaagctttcgccaaatcctcaa27480
gcaacggaaggggaaggattacaaatcccgttcaaagaaagtttgctacaagtgtggtaa27540
gcccggtcactttattgctaaatgtccattatcaagtgacagtgacagggataatgacaa27600
gaagggcaagaggagagaaaagaagaggtaccacaagaagaggggcggtgatgcccacgt27660
atgccgcgagtgggactccaacgagagctccaccgactcctccgacgacgaggacgtcgc27720
caacatcgccgacaccaagggactcctcttccccaacgtcggccacaagtgcctcatggc27780
aaaggacggcaaaaacaagaaggataaatctaaatcctccactagatatgaatcctctag27840
tgatgaaaatgttagtgatgaggaagataacttgcgatctctttttgccaacctcaacat27900
gcaacaaaaagagaaacttaatgaattgattagtgtcattcatgaaaaggatgatctctt27960
ggacacccaagaggacttccttattaaagaaaataagaagcatgttaaggttaaaaatgc28020
ttatgctctaaaagtagaaaaatgtgaaaaattgtctagtgagctaagcacttgccatga28080
gactataaacaaccttagaaatgagaatgctaatttgttagctaaggttgattctcatat28140
ttgtaatgtttcaagttccaatcctagagataataatgatgatttatttgctaggattaa28200
agatttgaacatttcacttgctagccttagaaatgaaaatgaaaaattgcttgctaaggc28260
taaagattttgatgtttgcaatgttactatttctaaccttagaagtgaaaacgacatatt28320
acatgctaaggttgtagaattaaaatcttgcaaacctcctacatctatagttgagcatgt28380
atctatttgtactagatgtagagatattgatgttgatgctattcatgatcacatgacttt28440
aattaaacaacaaaatgatcatatagcaaaactagatgctaaaattgccgagcataactt28500
agaaaatgaaaaatttaaatttgctagaagtatgctctatagtgggagacgccctggcat28560
caaggatggcattggcttccaaaggggagacaatgtcaaacttaatgcccctcctaaaag28620
attatctaattttgtaaagggcaaggctcccatgcctcaggataacgagggttacatttt28680
gtaccctgccggttatcccgagagcaaaattaggaggattcactctaggaagtctcactc28740
tggccctaaccatgctttcatgtacaagggtgagacatctagctctaggcaaccaaccca28800
tgttaagttgcctaagaagaaaactcctagtgcatcaaatgaacatagcatttcatttaa28860
gacttttgatgcatcttatgttttgactaacaaatccggcaaagtagttgccaagtttgt28920
tgggggcaaacacaagggctccaagacttgtgtttgggtacccaaagttcttgtttctaa28980
tgccaaaggacccaaaaccgtttgggtacctaaagtcaagaactaaaattgttttgtagg29040
tttatgcatccggaggctcaagttggatactcgacagcgggtgcacaaaccatatgacag29100
gggagaagaagatgttctcctcctacgagaaaaaccaggatccccaacgagctatcacat29160
tcggggatggaaatcaaggtttggtcaaaggtcttggtaaaatagctatatctcctgacc29220
attctatttccaatgtttttcttgtagattcattagattacaatttgctttctgtatctc29280
aattatgcaaaatgggctacaactgtcttttcactgatataggtgtcactgtctttagaa29340
gaagtgatgattcaatagcatttaagggagtgttggagggtcagctatacttagtagatt29400
ttgatagagctgaactcgacacttgcttaattgctaagactaacatgggctggctctggc29460
atcgccgactagcacatgttgggatgaagaatcttcataagcttctaaagggagagcaca29520
ttttaggattaaccaatgttcattttgagaatgacagggtttgtagcgcatgccaggcag29580
gaaagcaagttggagcccatcatccacacaagaacatcatgacgaccgacaggccgcttg29640
agctactccacatggatctattcggcccgattgcttacctaagcatcggcgggagtaagt29700
attgtcttgtgatagtggatgattattctcgcttcacttgggtgttctttttgcaggaaa29760
aatctcaaacccaagagaccttaaaaggattcttgagacgggctcaaaatgagttcgcct29820
taaggatcaagaaaataagaagcgacaacggaacggagttcaagaactctcaaattgaag29880
gcttccttgaggaggagggcatcaagcatgagttctcttctccctacacgtcacaacaaa29940
atggtgtagtagagaggaagaatcgaactctattggacatggcaagaaccatgcttgatg30000
agtacaagactttggatcggttttgggctgaggcggtcaacaccgcctgctacgccatca30060
accggttatatctacaccgaatcctcaagaagacatcttatgaactcctaaccggtaaaa30120
agcccaatatttcatattttagagtctttggtagcaaatgttttattcttgttaaaagag30180
gtagaaaatctaaatttgctcctaagactgtagaaggctttttactaggatatgattcaa30240
acacaagggcatatagagtctttaacaagtccactggacaagttgaagtttcttgtgacg30300
ttgtgtttgatgagactaacggctctcaagtagagcaagttgatcttgatgaaataggta30360
atgaagaggctccatgcatcgcgctaaggaacatgtccattggggatgtgtgtcctaagg30420
aatccgaagagcctccaaatgcacaagatcaactatcctcctccacgcaagcatctccac30480
cgactcaaaatgaggatgaagctcaagttgatgaagtagaagatcaagcaaatgagacac30540
ctcaagatgacgacaatgatcaagggggagatgcaaatgatcaagacaaggaggatgaag30600
agcataggccgccacacccaagagtccaccaagcaatccaacgagatcaccccgtcgaca30660
ccatcctcggcgacattcataagggggtaactactagatctcgtattgcacatttttgtg30720
agcattactcttttgtttcctctattgagccacacagggtagaggaagcactccaagatt30780
cggattgggtggtggcgatgcaagaggagctcaacaacttcactaggaatgaggtatggc30840
atttagttccacgtcctaatcaaaatgttgtaggaaccaaatgggtcttccgcaacaagc30900
aagatgagcatggtgtggtgacaaggaacaaagctcgacttgtggccaaaggatactccc30960
aagtcgaaggtttggatttcggtgaaacctatgcacccgtagctaggcttgagtcaattc31020
gtatattattggcctatgatacttaccatggctttaagctttatcaaatggacgtgaaaa31080
gtgccttcctcaatggaccaatcaaggaagaggtctatgttgagcaacctcccggctttg31140
aagacagtgagtaccctaaccatgtctataagctctctaaggcgctttatgggctcaagc31200
aagccccaagagcatggtatgaatgccttagagatttccttattgctaatggcttcaaag31260
tcggaaaagccgatcctacactctttactaaaactcttgaaaatgacttgtttatatgcc31320
aaatttatgttgatgatattatatttggatctactaacgagtccacttgtgaagagttta31380
gtaggatcatgacacagaaattcgagatgtctatgatgggggagttgaagtattttctag31440
gattccaagtcaagcaactccaagagggcaccttcattagccaaacaaaatacactcaag31500
atattctaagcaagtttggaatgaaggatgccaagcccatcaagacacccatgggaacta31560
atgggcatctcgacctcgacacgggaggtaagtccgtggatcaaaagctataccggtcga31620
tgataggttctttactctatttatgtgcatctcgaccggacattatgctttccgtatgca31680
tgtgtgcaagattccaagccgaccctaaggaagcccaccttacggccgtaaaacgaatct31740
tgagatatctggcttatactcctaagtttgggctttggtatcctaggggatccacatttg31800
atttgattggttattcggatgccgattgggcagggtgcaaaatcaataggaagagcacat31860
ccgggacttgccagttcttgggaagatccttgggtgtcttgggcttcaaagaagcaaaat31920
tcggtcgctctttccaccgccgaagccgagtatattgcccgcaggccactgttgcgcgca31980
actgctttggatgaggcaaaccctgcgggactatggttacaaactaaccaaggtcccttt32040
gctatgtgataatgagagtgcaatcaaaatggtcgacaatcccgtcgagcatagccgcac32100
taagcacatagccattcggtatcactttttgagggatcaccaacaaaagggagatatcga32160
gatttcatacattaatactaacgatcaattagctgatatctttaccaagcctcttgatga32220
acaatcttttaacaaacttaggcatgagctcaatattcttgattctaggaacttcttttg32280
ttaaattgcacacattgttcttttatatacctttgatcatatctcttttatatgctatga32340
ctaatgtgttttcaagtctatttcaaaccaagtcataggtatattgaaagggaattggag32400
tcttcggcgaagacaaaggcttccactccgtacctcatccttcgccatcacttcaagcaa32460
ctctccgttctcgggggagataagcatgagcatcaaagaaaaggactttgggggagaaat32520
gagcccaaagccaaaggaccggacttcgtctttggtataatcttaactcatttatttatg32580
accaaaagggaaaatagcacttcgagggctctaatgattccgtttttggcgattcatgcc32640
aaaaagggggagaaatgagcccaaagcaaaaggaccgcaccaccaccaatttcaaaaact32700
tagtgttgaatatttttcaatttgtatcttattttcaattggtatcttattgtgttcaaa32760
agggggagaaagtagtattttaaaatgatatatcaaaaaccctcttgaatactaagagga32820
ggatctcttttagggggagttttgtttaagtcaaaggaaaagcatttgaaacagggggag32880
aaaatttcaaatcttgagaatgctttgcaaaaatcctattcatttacctttgactatttg32940
caaaagaactttgaaaaggatttacaaaataatttgcaaaaacaaaactcgtggtgcaag33000
cgtggtccaaaatgttatataaagaaagaaacaatccatgcatatcttgtaagtattcat33060
attggctcaattccaagcaacctttacacttacattatgcaaactagttcaattatacac33120
ttctatatttgctttggtttgtgttggcatcaatcaccaaaaagggggagattgaaaggg33180
aattaggcttacacctagtccctaattaattttggtggttgaattgcccaacacaaacaa33240
ttggactaactaagtttgcacaagtttatagattacacaggtgtaaaaggttcacactca33300
gccaataaaaggaccaagtttttggattcaacaaaggagcatagtgggaaccgaaggccc33360
tctggtctgagagcaccggactgtccggtgtacaccggacagtgtccggtgcaccagagg33420
actcaagctccaactcgcccccttcgggaattgccagaggcgactccgctataattcacc33480
ggactgtccggtgtacaccggacagtgtccggtgcgccaagggaggtcggcctcaggaac33540
tcgccagcttcgggaaaagccatcggcccgtccactataattcaccggactgtctggtgt33600
gcaccggactgtccggtgcgactcgagagcaacggctatttccgcgccaacggctctctg33660
ccgcgcatttaatgcgcgctctgcgcgcgcagatgtcaggcgcgcccactccggcacacc33720
ggacaaggaacagtagatgtccggtgtgcaccggacacccaggcgggcccacaagtcaga33780
agcttcaacggctagaatccaacggcagtgatgacgtggcaggggcaccggactgtccgg33840
tgtgcaccggactgtccggtgcgtcatcgaacagacaacccagccaacggtcaagtttgg33900
tggttggggctataaatacccccaaccaccccaccattcattgcatccaagttttccact33960
tctcaactactacaagagctctagcattcaattctagacacaccaaaagagatcaaatcc34020
tctccaattccacaaaaggctttagtgattagagagagagatttgccgtgttctttttga34080
gctcttgcgcttggattgcttcttttctttcgcacttgttcttgtgatcaaaaacccatt34140
gtaatcaaggcaagaggcaccaattgtgtggtggcccttgttgggaagttttgttcccaa34200
ctttgattagagaagagaagctcactcggtccgagggaccgtttgagagagggaagggtt34260
gaaagagacccggcctttgtggcctcctcaacggggagtaggtttgcgagaaccgaacct34320
cggtaaaacaaatccacgtgtcacactcttcatttgcttgcgatttgttttgcgccctct34380
ctcgcggactcgtttcttcattactaacgctaacccggcttgtagttgtgtttatatttg34440
taaatttcagtttcgccctattcaccccccctctaggcgactatcaaaaacagtgcaacc34500
atgagggtggaatggggtgcccttagctgaataattagaggatccggggtgtagttcact34560
tagccatcgtgccgtcaatggggctcggtgtatgcggctcgctctgccaagtttgggttc34620
gccccttggggaggagtgcggtgcatttaggaaacctaacgggtggctacagtcccgggg34680
aatctttgtaaaggctacgtagtgatgccctgctgggtcaccttggtagtgatcaatgga34740
gagtcatgatctccgggcagaatgggaatcacggcttgtgggtaaagtgcacaacctctg34800
cagagtgtttgaaaactgatatatcagccgtgctcacggttatgagcagccaagggagct34860
ccagtgattagtggtacttgatcagagatactttggtacaggtggttatgagatcgatga34920
ttctggttatgactatgatgctggtaagtggtactctttccgtttggaaaggagtacgtt34980
tgggttaataacttgggttaatgctaaaacttggctttctattagtaaataataatctga35040
ccaactaaaagcaactgcttgacttatccccacataaagctagtccactacagccaaaca35100
ggatacttgctgagtatgttgatgtgtactcacccttgctctacacaccaaacccccccc35160
ccatccccaggttgtcagcattgcaaccactgctcagtcgaagatgaagctgtggaagga35220
gacttccaggagttccaagattacgacgagttctaggtgtgggttagcggcaacccccag35280
tcggctgcctgtgaaggccgcggttatctacgtttcttttccgcactttgatttattgta35340
agaactatatggacgtctcagacgtatgatgtaatcgactatttcccttagtaatactat35400
tttgagcactgtgtgatgatgtccatgttatgtaactgctgtgtacgtgaataactgatc35460
ctggcacgtacatggttcgcattcggtttgccttctaaaaccgggtgtgacacctgatta35520
ctctcaagcaaagcctataggtagtttaagaggttgagtacaatgagaaacatttcaatc35580
attatttgcaaaagaaacattttgatcatattaaggaaaatcataggagtgaaagaaaaa35640
caatgtgtgcaaataactgaacctcctgcagctccatcatgctggccaaagtatgcttcg35700
acgaattcaaaatatcatcatatgcttgctctatctgcttatcatgattgcaaccttgtt35760
cggatgggttcctggcttcctgcaacagcacagtagaaaacaaactggagtttagaattc35820
aacttgagacttcagagttgaaacaaaacctgtattcacatatgtagcaccgatgcatat35880
acaaatacctttatcagaaacatgaagatatgtgaatacattattttctcaataaaagtc35940
atgatagaattcatgcaaatttttttagaaataaataaatgatgaggcatactacaattc36000
taaaagcagtagcagtgcaacacgaacgaacattcaaattgccccacattcttgaaactg36060
tgctgctttgctctcgttccaaaaaaactccgctaaagtaaaacttggaaaggtctggtt36120
ttgcgtgagccaccaacaccaaccaaactttacgtcactatgacagtttcagcctttcgg36180
tcccggcgacagccatggcggacgcgggggtgacagggggtgctggccaagctgggtgag36240
ctgacggaggaggaggcgacgacgctgctgcgcgtggacgccgagatacgggcattgtgg36300
cagaagctggcctacctgcaggcgctcgtacgcggggccggccgccagcgccgcgaccgc36360
gcaagcgagctgctcctgctctggctacgcgagaccagagaggttgctttcgcgggtggt36420
tctgccatacatagcggcgatggctcttcctcccaggactaccatatctacacataacaa36480
tcatgctctcatgaagttgtgatgtaataggtcacacgattttaatgtataagattgtga36540
agagtaaattaattcaaatgaattcatggacatgggacaactatgtgttaaaaacagaat36600
ctcctatgtatcctaaccatgtgtaatgacatgacaaaatgacacttgtatatgagcaat36660
taaagcatcatactaccttgtagagttctcttcggcgaatacgttgaatgatttcttttg36720
ctttcttcagtttattgttaggagcagtctcgtagacacagtcaagtcctagcattagct36780
cactgcgagagtatcagccattccagactccaggagctcctcccgggtagcctgcctggc36840
aatgggcgcagatatcctcttgagctcactcttaatgctaggttgtaggctccttgcttc36900
caggtccatcttccgaatctgcagacacaaaacaaaaacataaaattacttttcccccaa36960
cccccaaattgaagactaaggcacataaacatctcgcaactccgttggaacaccaaaact37020
gcaccgacggaaacgaaaaatccctaatctctgcttttctacccattccccaaatgtcgt37080
atgtcccgttagccaaggggatccccaaaacagaacgtgtcacgacatctatttaccagc37140
gattcagcttcctcaacgccggattggatctcagagagcttctgcttcttcctctctgcg37200
gacagacccaccgaaatcagaccaaaacaaacggtcaacaaaagaaggctttccaagcgg37260
cgctactgacgctcggctgcctggccggcgagcctcgcagtcggccctcgatgaaggagg37320
tggtggagacgctggagcgggtggaggcgatgaagagccgggcacgcggcgcgcgtacac37380
cgcccggttcctgcccccccccccccccgacgacgaccatggcgcttcgccgatggccca37440
cttgcctggtcgccccgccgtcgcgtcagcgagcgcgctcagcagctccgtgacccgctc37500
gtcgaaccgcgtggccacggcgagcggcacgaggcctgggttcgtgacgttgaccggggg37560
gcggcgggggtgaaagggaattaggctcacacctatttcctaattgattttggtggttga37620
attgtctaacacaaataattggactaactagtttgctctagtctataagttttacaggtg37680
ccaaaggttcataataagccaataaaaagaccaagaaagggttcaaacaaaaagagcaaa37740
agacatcccggaaggcaccctggtctggcgcaccggactgtccggtgtgccaccggacag37800
tgtccggtgcaccagggcactcgaagctgaactcgctaccttcgggaaaatcagagggcg37860
ctccgctataattcaccagactgtccggtgaagcaccggactgtccggtgtgccagcgga37920
gcaacggctacttcgcgcgcaacggtcgactgcaacgcattcaatgcgcgcctgcgcgcg37980
cagagggcagagcactcacagttggcgcaccggacagtctacaggacctgtccggtgcac38040
caccggacagcccagaggccccacaagtcagagctccaacgatcgaaccccaacgatctg38100
ctgacgtggctggcgcaccggactgtccggtgcgccatgcgaccgcagccttccaacggc38160
catttttggtggtttagggctataaataccccaaccaccccacattcaatggcatccaag38220
tttcccaccttcaacacattacaagagctataacattcaattctagacactccaaaagat38280
caaatcctctcccaagtccggaatcactccaaatcaaatagtgactagagagagcgacat38340
ttgtgttcatttgagctcttgcgcttggatcgcttcttttctttctcattcttcttgtga38400
tcaaactcaattgtaaccaaggcaagagacaccaattgtgtggtggtccttgcaggaact38460
ttgtgttccgtttgattgagaagagaagctcactcggtctaagtgaccgtttgagagagg38520
gaaagggttgaaagagacccggtctttgtgaccacctcaatggggagtaggtttgcaaga38580
accgaacctcggtaaaacaaatcatcgtgtctcgctctttatatttctaacgttaacccg38640
gcttgtagttgtgcttaagtttgtaaatttcagattcgccctattcaccccccctctagg38700
cgactttcaattggtatcggagccggtgcttcattagagcctaactgctcgaagtgatgt38760
cgggagcatccgccatgagggatctcgggaccggcgacaagaccgcatgctcgggaagaa38820
ctcactcaagggagtccgcccacaagcataaggaggaatcgtcttcctccatcaagtccc38880
atcggatgggtgacaaaaagaagaagatgaggaaggtggtctactacgagaccgactctt38940
cgtcaccctccacctccggctcggaatcggcctccaccacttcaaagcgccatgagcgca39000
agaagtatagtaagatgccccttcgctatcctcgcatttctagacgcactccatcactct39060
tcgttccattaggcaaaccacctatatttgaaggtgaagattattctatgtggagtgata39120
aaatgaggcatcacctaacctcactccacaaaagcatatgggatattgttgagtatggag39180
tgcaggtaccaaagaagggagataaagattacgactcggaggaggttgaacaaatccaac39240
atttcaaatccaagtcgagaggagtataataaggtgcaagggttgaagagtgcaaaggat39300
atctgggacgtgctaaagaccgcgcacgaaggagacgaggtaaccaagatcaccaagcgg39360
gagacgatcgagggggagctcggtcgcttccggcttcgccaaggggaggagccacaagat39420
atgtacaaccggctcaagaccttggtgaaccaagtgcgcaacctcgggagcaaaaaatgg39480
gatgaccatgaaatggttaaggttattcttagatcacttgtgttccttaaccctacgcaa39540
gttcaattaattcgtggtaatcctagatatacactaatgactcccgaggaagtaatagga39600
aactttgtgagctttgagttgatgatcaaaggctcaaagaaaattatcgagcacgacggt39660
ccctccacgcccgaagcacaaccggtcgcattcaaggcgacagaggagaagaaagaggag39720
tctacatcaagtagacaacccatcgacgcctctaagctcgacaacgagaaaatggcgctc39780
atcatcaagagcttccaccaaatcctcaaacaaaggaaggggaaagattacaagccttgt39840
tccaaaagggtgtgctacaagtgtggtaagcccggtcatttcattgttaaatgtccttta39900
tctagtgatagtgacaggggcgacgacaagaagggcaagaggagagaaaagaggaggtat39960
tacaagaagaagggcggcgatgcccatgtgtgccgcgagtgggactccgacgagagttcc40020
tccgactcctcatccgacgaggacgccgccaacatcgccgtcaccaaagggctcctcttc40080
cccaacgtcggccacaagtgcctcatggcaaaggacggcaaaaagaagaaggtaaaatca40140
aaatcctccactaaatatgcatcctctagtgatgaagataatgctagtgatgaggaggat40200
aatttgcgtaccctttttgtcaacctaaacatgcaactacaggaaaaactaaatgaatta40260
attagtgctattcatgagaaagatgatctcttggactttcaagaggacttcctaattaag40320
gaaaataagaagcatgttaaggttaaaaatgcttatgctctagaagtagaaaaatgtgaa40380
aaattatctagtgagctaagcacttgccatgatactattaccatccttagaaataaaaat40440
actaaactaattgctaaggttgattctaatatttgtgatgtttcaattcccaatcttaga40500
gatgataatgttaatttgcttgctaagattgaagaattgaatgtctctcttgctagcctt40560
agggttgaaaatgaaaaattgattgctaaggctaaagaattagatgtttgcaatgcttcc40620
atttctgatcttagaaataacaatgatattttacgtgctaagattgttgaacttaattct40680
tgcaaaccctctacatctgccattgagcatgtcattatttgcactagatgtagagatatt40740
aacattgatgctattcatgatcatatggctttaattaaacaacaaaataatcatatagca40800
aaattagatgctaaaattgccgagcatgacttaaaaaatgaaaaatttaaatttgctaga40860
agcatgctctatagtgggagacgccctggcatcaaggatggcattggcttccaaaaggga40920
aacaatgtcaaacttaatgcctctcctaaaagattgtcaaactttgttaagggcaaggct40980
cccatgcctcaggataatgagggttacattttgtaccctgccggttatcccgagagcaaa41040
attaggagaattcattctaggaagtctcactctggccataatcatgcttttatgtataag41100
ggtgagacatctagctctaggcaatcaacccgtgcaaaattgcctaagaagaaaactcct41160
gctgcatcaaatgatcataacatttcattcaaaacttttgatgcatcttatgttttaact41220
aacaaatccgacaagatagttgccaagtatgttgggggcaaacacaagggatcaaagact41280
tgtgtttgggtacccaaagttcttgtatctaatgtcaaaggacccaaaaccatttgggta41340
cctaaaatcaagaactaaacttgttttgtaggtttatgcatccgggggcccaagttggat41400
catcgatagcgggtgcacaaaccatatgacaggggagaagaaaatgttctcctcctatga41460
gaaaaaccaagatccccaaagagcgatcacattcggggatggaaaccaaggtttggtcaa41520
aggattgggtaaaattgctatatctcctgaccattccatttccaatgtgtttcttgtaga41580
ttctttagattacaacttgctttcagtttcgcaattatgtaaaatgggctacaactgtct41640
ttttacagatataggtgttactgtctttagaagaagtgatgattcagtagcatttaaggg41700
agtgttagagggtcagctatacttggtagattttgatagagctgaactcgacacttgctt41760
aattgctaagactaacatgggctggctctggcatcaccgactagcacatgttggaatgaa41820
gaatcttcacaagcttctaaagggagaacacattttgggactaacaaatgttcactttga41880
gaaagataggatttgtagcgcatgtcagacagggaagcaagttggtactcatcatccaca41940
caagaacatcatgatgactgacaggccactcgagctcctacatatggacctattcggccc42000
gatagcttacataagcatcggcgggagtaagtactatctagttattgtggatgattatac42060
tcgcttcacttgggtattctttttgcaggaaaaatctcatacccaagagaccttaaaggg42120
attcttgggacgggctcaaaatgagttcggcttaagaatcaaatttgttttaagcgacaa42180
cgggacggagtcaagaatctcaaatcgaaggcacgatctcctagatccggcccaaaannn42240
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn42300
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnncgctgatgaatcagcttgattcg42360
tgacacgggaggtaagtccgtggatcaaaaggtataccggtaagataataggctctttac42420
tctatttatgtgcatctcgaccggatattatgctttccgtatgcatgtgtgcaagattcc42480
aagctgaccctaaggaagctcaccttacggccgtaaaacgaatcttgagatatttggctt42540
atactcctaagtttgggctttggtatcctaggggatccacatttgatttgattggttatt42600
cggatgctgattgggcggggtgtaaaatcaatagaaagagcacatcagggacttgccagt42660
tcttgggaagatccttggtgtcttgggcttcaaagaagcaaaattcggtcgctctttcca42720
ccgccgaagccgagtacattgccgcaggccattgttgcgcgcaattgctttggatgaggc42780
aaaccctgcgggactatggttacaaattaacctaagtccctttgctatgtgataatgaga42840
gtgcaatcaagatggcggataatcccgtcgaacatagccgcactaaacacatagccattc42900
ggtatcattttcttagggatcaccaacaaaagggagatatcgagatttcttacattaaca42960
ctaaagatcaattagccgatatctttaccaagcctcttgatgaacaaacctttaacaaac43020
ttaggcatgagctcaatattcttgattcgcgcaatttcttttgctaaattgcacacatag43080
ctcatttatatacctttgatcatatctctttcatatgctatgactaatatgttcttcaag43140
tctatttcaaaccaagtcataggtgtattgaaagggaattggagtcttcggcaaagacaa43200
aggcttccactccgtaactcatccttcgtcgtcgctctgggccactctccatctttgggg43260
gagagagcaaaagacttcgtctttggtacaatcttaactcatttatttatgaccaaaggg43320
gaagaaagtacttcgagggctctaatgattccgtttttggcgattcatgccaaaggggga43380
gagagtatgagcccaaagcaaacggaccgcaccaccaccaatttcaaaaacttagttttt43440
caaagagtattttcaattggtatcctattgtgttcaaaagggggagaaagtagtattttc43500
aaaaatgatatatcaaaaccctcttgaacactaagaggtggatctcatttagggggagtt43560
ttgtttagtcaaaggaaaagcatttgaaacagggggagaaaatttcaaatcttgaaaatg43620
cttcataaaatcgtattcatttacctttgactttttgcaaaagaactttgaaaaggattt43680
acaaaatagtttgcaaaaacaaaacatgtggtgcaagtgtggtccaaaatgataaaaaca43740
aaggaacgatccatgcatatcttgtaagtatttatattggctcaaatccaagcaaccttt43800
gcacttacattatgcaaactagttcaattatgcattttatacttgctttggtttgtgttg43860
gcatcaatcaccaaaaagggggagattgaaagggaattaggcttacacctagttcctaaa43920
taattttggtggttgaattgcccaacacaaataattggactaactagtttgctctagtgt43980
acaagttatacaggtgccaaggttcacaacaagccaattaaaaagaccaaagttgggttc44040
aaaatagagagccaaaggcatcccgaaaggctccctggtttggcgcaccggactgtccgg44100
tggcgcaccggacagtgtccggtgcaccaggggacctcgcgcagaactcctcagcctcgg44160
gaatttttcggagccgccgcgctataattcaccggactgtccggtgtacaccggacagtg44220
tccggtgctccaagaaaacacggctccagaacttggcagcctcgggaaatcagaacggct44280
gctccgctataattcaccggacatgtccggtgtacaccggactgtccggtgcaactgcgg44340
agcaacggctacttcgcgccaacggtcacctgcaggcgcattcaatgcgcgccagaagcg44400
cgcagaagtcaggcacgcccatgctggcgcaccggacactctacagtacatgtccggtgc44460
gccaccggacatcaaggcgggcccagaaggcagaactccaacggtcaaattccaacggct44520
ttggtgacatggctggcgcaccggactgtccggtgcaccatacgacagacagcctccacc44580
aacggtcatgtttggtggttggggctataaataccccaaccaccccaccattcattgcat44640
ccaagttttccagcttccaaccactatacaagagctagcattcattgcaaagcacaccaa44700
aagagatcaaatcctctcccaactccacacaaagccttagtgattagagagagtgatttg44760
tagtgttcatttgagctcttgcgcttggatcgcttcttttctttggcattctttcttgtg44820
atcaaacactcacttgtaattgaggcaagagacaccaattgtgtggtggtccttgcgggg44880
agtttgattcccaagtgatttgagaagagaagctcactcggtccaagggaccgtttgaga44940
gagggaagggttgaaagagacccggcctttgtggcctcctcaacggggagtaggtttgag45000
agaaccgaacctcggtaaaacaaatccacgcgtctcacttcattattcgcttgcgatttg45060
ttttcacgccctctctcggactcgttcttatttctaacgctaacccggcttgtagttgtg45120
tttatatttgtaaatttcagtttcgccctattcaccccccctctaggcgactatcaaata45180
gccagtgctttttggtctgcgagttcctgcacttggttaatcaactgtgtcgcttgatct45240
tctacttgtttgcacgagaaggtcaaagccactttcgaagctattagttcagaacacaca45300
acatctagctaaatacatcaccagtttgaagtcattgattgtattcttgatatcatcttt45360
attcttgaatgtcatttgtgccagttcatttaactcttgtgctgcaaaccaacctgacat45420
cgtcaattcatttaatctctcaatctcagtttcctttttttgtttcacattgaagctccc45480
taattgttgcttctcgatgtgcagtggcctgattagctacaagaagctcttgttccatgg45540
actcgatctctggaccgcactatcgttgcctcagcccctaggtcgtgctgccctctggcc45600
tcctcatcgtacaattcaccaacatctccaatgtaagtgcagcaggttcagtaatgaact45660
cagaagtggcatcagaatactccaagagttttttgatctttttgcctggatatataccaa45720
gggaaatgcattcaaaactcctatagatgacgaatcccatctctccctcttttctcggac45780
acggatccccaggtccgtctccgtgctttactcatttgttttttacaagttcagatccac45840
ttgcgtactcacacggtggacatctgttatgcacatgtgtaaaccagcataagtccttac45900
actcgaaaatgcatgtgttatttagcttgagaataaataaaattattagcaaggagaaaa45960
caaaaaaataggactaaacaatagagtcacattggtttaaattagtacctagaagtaaaa46020
aaagatgatctaaattagatacatcataccaaataccatattactattccagttaccccg46080
tctactatgcctagatatcaaattcttgaaggttggccttctcattttcagtaatagcct46140
gacgaaagtagagtatgtttgtatgagcaattatgctgctcactactgccttgcgctata46200
ataggccactactgattttacatgctttttctacattagatagctcacaaacatgctacc46260
tcaaaaaaatgatggcaaaggggagccacaaaatgtcaattattttgtcaagtattagca46320
gttttcttgtgtatgtgatcagactaacactgcatgtctttgttttcctgtaaaactatg46380
tatgatgaaaccatggtgtgattgtattggctggccttaccctgttttgttgcaatgcat46440
tcgttgttgtacaggtaatatgttgaaacacaattcattgcatatgacaattctgttttt46500
tctttctagaatattgacatattgtttgatcattattttctaagcaataatcatggctat46560
tcttatagtattgcataatacctttttcttttcgaaccctagcgcattgattctttagtg46620
aagtgattatagtgattccagcgggagagtagggtggggagcagagggttgattctggac46680
tgatttcggtggagattaaatggggagcagtgaggagcatgtttttttagatcccaccag46740
aatatgtgcgccattttgctatttggctgaggagtgatgctcagggagaatccgttctca46800
ggagctgtgccaaatgtcgccttaggttttatgatatgacctgacttctgtgttaatatt46860
tgttagatctttattttatttgaggttacaaaggtggtgttctcaagctagaaacaaagt46920
tgtggctaggtcaaaactagatgatgctcttgaaccgtgtcttttgactctgttacttgt46980
tgcaggtttgatgttcactaatatgttgttcaactttgagcaggtcagcaattgactggt47040
gttgctggtagcctggcatatctggcacctgaggttctactaggaaattactcccaaaag47100
gtagatgtatgggctgccggggtgcttctgcatgttctgttgatgggcactcttccgttc47160
caaggaaaatctatcgaagctatctttgatgttataaagactgctgaacttgactttcac47220
aatagtcagtgggcatctgtgtcacttcttgcttatgatctcattggtcgaatgcttaat47280
cgagaggtctcttcaaggcccgatgccgaagatgttctccgtaagttcaagcacccttgt47340
aacttgtgctttatatatatatataatatatatatatatatgattctcaatttatcattg47400
acttttcctaatggctttcaacacagggcacccatgggtcttattctacactgattgcct47460
gcagaaagctgaattctctaacctatgggatactaacaaaactgcagctcccatgattca47520
tcgggagatagtcaggtttggttactgcgagtcttcatcttcaaaatcctcaagtgacaa47580
ctctgaagagcgagatgaatgcggtatagttgatgcactggcgacaacaataacacaggt47640
gaggatctcagagcccaagaggagtcggctgttcagcctacccaacgggttgttgccgcc47700
aagcaggaacagtctccgaacatgaagatgatgaatccgtgtgtggctttctaacttgac47760
ctacctagctcccatccccatgcatgtataaacgagataaacgagctctgtgattttata47820
gatggaaaattttcaccgtggttgatgttttgcgattgctagctcgctgagcctgcaatc47880
ctctgtaaatatatcattgttgtcatcatttttgtacatcgatgacaccgtaattgattc47940
gatt47944
2
1524
dna
artificialsequence
cdna
2
atggggagcagtgaggagcatgtttttttagatcccaccagaatatgtgcatccgtgtca60
cttcttgctcatgatctcattggccgaatgcttaatcgagaggtctcttcaaggcccaat120
gccaaagaagttctccctcccatgattcatcgggagatagtcaggtttggttactgtgag180
tcttcatcttcaaaatcctcaagtgacaactctgaagagcgagatgaatgcggtatagtt240
gatgcactggtgacaacaataacacagattcggaagatggacttggaggcaaggagccta300
cagcctagcattaaggctggtttgcttgcaaagctgagggagtataaatctgacctcaac360
aacgtcaagatgggtctatccgcagagaggaagaagcagaagctctccgagatccaatcc420
ggcgttgaggaagctgaatcgctgattcagaaaatggacctggaggcaaggagcctacag480
cctagcattaaggctggtttgcttgcaaagccgagggattataaatctgacctcaacaac540
gtcaagagtgagctcaagaggatatctgcgcccaatgccagtggcctgattagctacaag600
aagctcttgttccatggactcgatctctggaccgcactatcgttgcctcagcccctaggt660
cgtgctgccctctggcctcctcatcgtacaattcaccaacatctccaatgtcagcaattg720
actggtgttgctggtagcctggcatatctggcacctgaggttctactaggaaattactcc780
caaaaggtagatgtatgggctgccggggtgcttctgcatgttctgttgatgggcactctt840
ccgttccaaggaaaatctatcgaagctatctttgatgttataaagactgctgaacttgac900
tttcacaatagtcagtgggcatctgtgtcacttcttgcttatgatctcattggtcgaatg960
cttaatcgagaggtctcttcaaggcccgatgccgaagatgttctccggcacccatgggtc1020
ttattctacactgattgcctgcagaaagctgaattctctaacctatgggatactaacaaa1080
actgcagctcccatgattcatcgggagatagtcaggtttggttactgcgagtcttcatct1140
tcaaaatcctcaagtgacaactctgaagagcgagatgaatgcggtatagttgatgcactg1200
gcgacaacaataacacaggtgaggatctcagagcccaagaggagtcggctgttcagccta1260
cccaacgggttgttgccgccaagcaggaacagtctccgaacatgaagatgatgaatccgt1320
gtgtggctttctaacttgacctacctagctcccatccccatgcatgtataaacgagataa1380
acgagctctgtgattttatagatggaaaattttcaccgtggttgatgttttgcgattgct1440
agctcgctgagcctgcaatcctctgtaaatatatcattgttgtcatcatttttgtacatc1500
gatgacaccgtaattgattcgatt1524
3
1082
dna
artificialsequence
cdna
3
cctgcccttccattcttcccccgctgcccccggtcaacgtcacgaacccgggcctcgtgc60
cgctcgtcgtggccacactgttcgacgagcgagtcacagagctgctgagcgtgctcgctg120
atgcggcggtggggcgaccaggcaggtggtccatcggcgaagcgccatggtcgtcgtcgg180
ggggcacgaaccaggcggtgtacgcgcgccgcgcgcccggctcttcatcgcctccacccg240
ctccagcgtctccaccacttccttcatcgagggccgactgcttggctcgctggccaggca300
gccgagcattagttgcgccgcttggaacgcctgcttttgttgatcgtttgttttggtctg360
atttcagtgggtctatccgcagagaggaagaagcagaagctctccgagatccaatccggc420
gttgaggaagctgaatcgctgattcagaaaatggacctggaggcaaggagcctacagcct480
agcattaaggctggtttgcttgcaaagccgagggattataaatctgacctcaacaacgtc540
aagagtgagctcaagaggatatctgcgcccaatgccagattcggaagatggacctggaag600
caaggagcctacaacctagcattaagagtgagctcaagaggatatctgcgcccattgcca660
ggcaggctacccgggaggagctcctggagtctggaatggctgatactctcgcagtgagct720
aatgctaggacttgactgtgtctacgagactgctcctaacaataaactgaagaaagcaaa780
agaaatcattcaacgtattcgccgaagagaactctacaagatatggtagtcctgggagga840
agagccatcgccgctatgtatggcagaaccacccgcgaaagcaacctctctggtctcgcg900
tagccagagcaggagcagctcgcttgcgcggtcgcggcgctggcggccggccccgcgtac960
gagcgcctgcaggaagccaggaacccatccgaacaaggttgcaatcatgataagcagata1020
gagcaagcatatgatgatattttgaattcgtcgaagcatactttggccagcatgatggag1080
ct1082
4
2321
dna
artificialsequence
cdna
4
tgaggagcatgtttttttagatcccaccagaatatgtgcatccgtgtcacttcttgctca60
tgatctcattggccgaatgcttaatcgagaggtctcttcaaggcccaatgccaaagaagt120
tctccgtaagttcaagcacccttgtaacttgtgctttatatatatgattctcaatttatc180
attgacttttcctaatggctttcaacacagggcaccatgggtcttattctacactgattg240
cccgcagaaagctgaattctctaacatatgggatactaacaaaactgcagctcccatgat300
tcatcgggagatagtcaggtttggttactgtgagtcttcatcttcaaaatcctcaagtga360
caactctgaagagcgagatgaatgcggtatagttgatgcactggtgacaacaataacaca420
ggtgaggatctcagagcccaagaggagtcggctgttcagcctacccaacgggttgttgcc480
gccaagcaggaacagtctccgaacatgaagatgatgaatccgtgtgtggctttctaactt540
gacctacctagctcccatccccatgcatgtataaacgacatttggggaatgggtagaaaa600
gcagagattagggattttcgtttccgtcggtgcagttttggtgttccaatggagttgcga660
gatgtttatgtgccttagtcttcaatttgggggttgggggaaaagtaattttatgttttt720
gttttgtgtctgcagattcggaagatggacttggaggcaaggagcctacagcctagcatt780
aaggctggtttgcttgcaaagctgagggagtataaatctgacctcaacaacgtcaagagt840
gagctcaagaggatatttgcgcccaatgccaggcaggctacccgggaggagctcctagag900
tttggaatggctgatactctcgctgtgagctaatgctaggacttgactgtgtctacgaga960
ctgctcctaacaataaactgaagaaagcaaaagaaatcattcaacgtattcgccgaagag1020
aactctacaaggtagtatgatgctttaattgctcatatacaagtgtcattttgtcatgtc1080
attacacatggttaggatacatacttaagtttctaacgtaggcgtccacacaacggattg1140
gtgcacggttctgccgatgtatcccacgcacgtgcatggaaggaggcaggcacccttccc1200
cgccgccccggatctcgcgccagcccccgccctaccccgcctgcccttccattcttcccc1260
cgctgcccccggtcaacgtcacgaacccgggcctcgtgccgctcgtcgtggccacactgt1320
tcgacgagcgagtcacagagctgctgagcgtgctcgctgatgcggcggtggggcgaccag1380
gcaggtggtccatcggcgaagcgccatggtcgtcgtcggggggcacgaaccaggcggtgt1440
acgcgcgccgcgcgcccggctcttcatcgcctccacccgctccagcgtctccaccacttc1500
cttcatcgagggccgactgcttggctcgctggccaggcagccgagcattagttgcgccgc1560
ttggaacgcctgcttttgttgatcgtttgttttggtctgatttcagtgggtctatccgca1620
gagaggaagaagcagaagctctccgagatccaatccggcgttgaggaagctgaatcgctg1680
attcagaaaatggacctggaggcaaggagcctacagcctagcattaaggctggtttgctt1740
gcaaagccgagggattataaatctgacctcaacaacgtcaagagtgagctcaagaggata1800
tctgcgcccaatgccagattcggaagatggacctggaagcaaggagcctacaacctagca1860
ttaagagtgagctcaagaggatatctgcgcccattgccaggcaggctacccgggaggagc1920
tcctggagtctggaatggctgatactctcgcagtgagctaatgctaggacttgactgtgt1980
ctacgagactgctcctaacaataaactgaagaaagcaaaagaaatcattcaacgtattcg2040
ccgaagagaactctacaagatatggtagtcctgggaggaagagccatcgccgctatgtat2100
ggcagaaccacccgcgaaagcaacctctctggtctcgcgtagccagagcaggagcagctc2160
gcttgcgcggtcgcggcgctggcggccggccccgcgtacgagcgcctgcaggaagccagg2220
aacccatccgaacaaggttgcaatcatgataagcagatagagcaagcatatgatgatatt2280
ttgaattcgtcgaagcatactttggccagcatgatggagct2321
5
1082
dna
artificialsequence
cdna
5
cctgcccttccattcttcccccgctgcccccggtcaacgtcacgaacccgggcctcgtgc60
cgctcgtcgtggccacactgttcgacgagcgagtcacagagctgctgagcgtgctcgctg120
atgcggcggtggggcgaccaggcaggtggtccatcggcgaagcgccatggtcgtcgtcgg180
ggggcacgaaccaggcggtgtacgcgcgccgcgcgcccggctcttcatcgcctccacccg240
ctccagcgtctccaccacttccttcatcgagggccgactgcttggctcgctggccaggca300
gccgagcattagttgcgccgcttggaacgcctgcttttgttgatcgtttgttttggtctg360
atttcagtgggtctatccgcagagaggaagaagcagaagctctccgagatccaatccggc420
gttgaggaagctgaatcgctgattcagaaaatggacctggaggcaaggagcctacagcct480
agcattaaggctggtttgcttgcaaagccgagggattataaatctgacctcaacaacgtc540
aagagtgagctcaagaggatatctgcgcccaatgccagattcggaagatggacctggaag600
caaggagcctacaacctagcattaagagtgagctcaagaggatatctgcgcccattgcca660
ggcaggctacccgggaggagctcctggagtctggaatggctgatactctcgcagtgagct720
aatgctaggacttgactgtgtctacgagactgctcctaacaataaactgaagaaagcaaa780
agaaatcattcaacgtattcgccgaagagaactctacaagatatggtagtcctgggagga840
agagccatcgccgctatgtatggcagaaccacccgcgaaagcaacctctctggtctcgcg900
tagccagagcaggagcagctcgcttgcgcggtcgcggcgctggcggccggccccgcgtac960
gagcgcctgcaggaagccaggaacccatccgaacaaggttgcaatcatgataagcagata1020
gagcaagcatatgatgatattttgaattcgtcgaagcatactttggccagcatgatggag1080
ct1082
6
3646
dna
zeamays
6
atggcacactttgatgaactagaggataaaacaacagattatgttgatttatcggttcaa60
gaatttgctcttaagcaacctcaatgtggcatggcttataattactatggaaatttaagg120
ctttatgtagtagccaataaagctgaattggcctcttcaatatttgaaatcgataaggta180
aacaaaggcggagttaatgcatctatgccagtgaccacttccactcctaattcgaatcaa240
aattcatgaaccggttatggaacaaatagagaatcaagtttcggtgaggtataatacgat300
tcccctaacccatggaatttacctagtaaaaatcctgtagttaatagtgtactagtaact360
tctgtcaccgacttgaataaagctttgaatgagtataaaaatgagatgtctaaatttatt420
gagaatagcttggtgtatagattaagcctagtagaaacacttataacaagttgtatgctt480
caattttttttgatttttttggaagctactcatagttggagggtaccaaatttacaaaaa540
aaaattggtgattataatagtaaatctaccatagaacatgttagcttgtttcttgctctg600
agaggtgaagctagtagcatgaaaattgaatgtgcgttatttttctttttcacttactgg660
tacaatttttgcatggtttatgttgttgccctgcttgttgtattggttcatgggctggtc720
tgtgaaataatttggcgatagccattttcttttttgagatacattgcttttgctatatat780
atctagatatggtgcatatttaaatgcataataaaaatgtaaaaatctaaaacgtcttat840
aatttaggacagatgaaagtactagatattagacatttttagtgtttttattaaaatgga900
atatgtaccgcctttgatgctacaacttttacttagcttttaaaacacaccattctaaat960
tgtaaaaaaatattaaaaatgtgttttgcaagatgaatatactaacctttgttatgataa1020
tagttttcatatgttaatggaacaagctaaaaagtttggcaaagtatagtcctatagctt1080
ccatttcgactcagagagagtatgttgtatccactaaccgtgtacacaagatagcccaac1140
taattaattattttgtgagctatcacccaaccttctgtttatcatggattcatggaaaaa1200
tgtaattgccatcattacactaaaaactaaaacttatgaaggagaaccattgtcttgcta1260
tatatgagatgacaaaattttccaaagaagagagaagccggcagaacccatcctgtttca1320
aatctcttctactacttaagtttctaacgtaggcgtccacaaaacggattggtgcacggt1380
tctgccgatgtctcccacacacgcgcatggaaggaggcaggcacccttccccgccgcccc1440
ggatctcgcgccagccccagccctaccccgcctgcccttccattcttccccagccgcccc1500
ccggtcaacgtcacgaacccgggcctcgtgccgttcgccgtggccacgcggttcgacgag1560
cgggtcacggagctgctgagcgcgctcgctgacgcggcggcggggcgaccaggcaggtgg1620
gccatcggcgaagcgccatggtcgtcgtcggggggcaggaaccaggcggtgtacgcgcgc1680
cgcgcgcccggctcttcatcgcctccacccgctccagcgtctccaccacctccttcatcg1740
agggccgactgcgaggctcgccggccaggcagccgagcgtcagttgcgccgcttggaacg1800
cctgcttttgttgatcgtttgttttggtctgatttcggtgggtctatccgcagagaggaa1860
gaagcagaagctctccgagatccaatccggcgttgaggaagctgaatcgctggtaaatag1920
atgccgcgacacgttctggtttggggatccccttggctaacaggacatacgacatttggg1980
gaatgggtagaaaagcagagattagggatttttcgtttccgtcggtgcagttttggtgtt2040
ccaacggagttgcgagatgtttatgtgccttagtcttcaatttgggggttgggggaaaag2100
taattttatgtttttgttttgtgtctgcagattcagaaaatggacctggaggcaaggagc2160
ctacagcctagcattaaggctagtttgcttgcaaagctgagggagtataaatctgacctc2220
aacaacgtcaagagtgagctcaagaggatatctgcgcccaatgccaggcaggctacccgg2280
gaggagctcctggagtctggaatggctgatactctcgcagtgagctaatgataggacttg2340
actgtgtctacgagactgctcctaacaataaactgaagaaagcaaaagaaatcattcaac2400
gtattcgccgaagagaactctacaaggtagtatgatgctttaattgctcatatacaagtg2460
tcattttgtcatgtcattacacatggttaggatacataggagattctgttttttaacaca2520
tagttgtcccatgtccatgaattcatttgaattaatttactcttcgcaatcttatacatt2580
aaaatcgtgttacctattacatcacaacttcatgagagcatgcttgttctgtgtagatat2640
ggtagtcctgggaggaagagccatcgccgctatgtatggcagaaccacccgcgaaagcaa2700
cctctctggtctcgcatagccagagcaggagcagctcgcttgcgcggccgcagcgctggc2760
ggtcggccccgcgtacgagcgcctgcaggtaggccagcttctgctgcaatgcccgaatct2820
cggcgtccacgcgcagcagcgtcgtcgcctcctcctccgtcagctcacccagcttggcca2880
gcacccccgtcacccccgcgtccgccatggctgtcgccgggaccgaaaggctaaaactgt2940
cacaatgacgtaaagtttggttggtgttggcggctcacgcaaaaccagacctttccaagt3000
tttactttagcggagtttttttggaacgagagcaaagcagcacagtttcaagaatgtggg3060
gcaatttgaatgttcgttcctgctgcactgctactgcttttagaattgtagtatgcttca3120
tcatttatttatttctaaaaaaacttgcatgaattctatcgtgacttttattgagaaaat3180
aatgtattcacgtatcttcatgtttctgataaaggtatttgtatatgcatcggtgctaca3240
tatgcgaatacaagttttgtttcaactctgaagtctcaagttgaattctaaactccagtt3300
tgttttctactgtgctgctgcaggaagccaggaacccatccgaacaaggttgcaatcatg3360
ataagcagatagagcaagcatatgatgatattttgaattcgtcgaagcatactttggcca3420
gcatgatggagctgcaggaggttcagttatttgcacacattgtttttctttcactcctat3480
gattttcctcaatatgatcaaaatgtttcttttgcaaataatgattgaaatgtttctcat3540
tgtactcaacctcttaaactacctataggctttgcttgagagtaatcaggctacaaagga3600
tgccaatggtattgctgctctctatattgttcttgttctaatgtaa3646
7
3646
dna
artificialsequence
cdna
7
atggcacactttgatgaactagaggataaaacaacagattatgttgatttatcggttcaa60
gaatttgctcttaagcaacctcaatgtggcatggcttataattactatggaaatttaagg120
ctttatgtagtagccaataaagctgaattggcctcttcaatatttgaaatcgataaggta180
aacaaaggcggagttaatgcatctatgccagtgaccacttccactcctaattcgaatcaa240
aattcatgaaccggttatggaacaaatagagaatcaagtttcggtgaggtataatacgat300
tcccctaacccatggaatttacctagtaaaaatcctgtagttaatagtgtactagtaact360
tctgtcaccgacttgaataaagctttgaatgagtataaaaatgagatgtctaaatttatt420
gagaatagcttggtgtatagattaagcctagtagaaacacttataacaagttgtatgctt480
caattttttttgatttttttggaagctactcatagttggagggtaccaaatttacaaaaa540
aaaattggtgattataatagtaaatctaccatagaacatgttagcttgtttcttgctctg600
agaggtgaagctagtagcatgaaaattgaatgtgcgttatttttctttttcacttactgg660
tacaatttttgcatggtttatgttgttgccctgcttgttgtattggttcatgggctggtc720
tgtgaaataatttggcgatagccattttcttttttgagatacattgcttttgctatatat780
atctagatatggtgcatatttaaatgcataataaaaatgtaaaaatctaaaacgtcttat840
aatttaggacagatgaaagtactagatattagacatttttagtgtttttattaaaatgga900
atatgtaccgcctttgatgctacaacttttacttagcttttaaaacacaccattctaaat960
tgtaaaaaaatattaaaaatgtgttttgcaagatgaatatactaacctttgttatgataa1020
tagttttcatatgttaatggaacaagctaaaaagtttggcaaagtatagtcctatagctt1080
ccatttcgactcagagagagtatgttgtatccactaaccgtgtacacaagatagcccaac1140
taattaattattttgtgagctatcacccaaccttctgtttatcatggattcatggaaaaa1200
tgtaattgccatcattacactaaaaactaaaacttatgaaggagaaccattgtcttgcta1260
tatatgagatgacaaaattttccaaagaagagagaagccggcagaacccatcctgtttca1320
aatctcttctactacttaagtttctaacgtaggcgtccacaaaacggattggtgcacggt1380
tctgccgatgtctcccacacacgcgcatggaaggaggcaggcacccttccccgccgcccc1440
ggatctcgcgccagccccagccctaccccgcctgcccttccattcttccccagccgcccc1500
ccggtcaacgtcacgaacccgggcctcgtgccgttcgccgtggccacgcggttcgacgag1560
cgggtcacggagctgctgagcgcgctcgctgacgcggcggcggggcgaccaggcaggtgg1620
gccatcggcgaagcgccatggtcgtcgtcggggggcaggaaccaggcggtgtacgcgcgc1680
cgcgcgcccggctcttcatcgcctccacccgctccagcgtctccaccacctccttcatcg1740
agggccgactgcgaggctcgccggccaggcagccgagcgtcagttgcgccgcttggaacg1800
cctgcttttgttgatcgtttgttttggtctgatttcggtgggtctatccgcagagaggaa1860
gaagcagaagctctccgagatccaatccggcgttgaggaagctgaatcgctggtaaatag1920
atgccgcgacacgttctggtttggggatccccttggctaacaggacatacgacatttggg1980
gaatgggtagaaaagcagagattagggatttttcgtttccgtcggtgcagttttggtgtt2040
ccaacggagttgcgagatgtttatgtgccttagtcttcaatttgggggttgggggaaaag2100
taattttatgtttttgttttgtgtctgcagattcagaaaatggacctggaggcaaggagc2160
ctacagcctagcattaaggctagtttgcttgcaaagctgagggagtataaatctgacctc2220
aacaacgtcaagagtgagctcaagaggatatctgcgcccaatgccaggcaggctacccgg2280
gaggagctcctggagtctggaatggctgatactctcgcagtgagctaatgataggacttg2340
actgtgtctacgagactgctcctaacaataaactgaagaaagcaaaagaaatcattcaac2400
gtattcgccgaagagaactctacaaggtagtatgatgctttaattgctcatatacaagtg2460
tcattttgtcatgtcattacacatggttaggatacataggagattctgttttttaacaca2520
tagttgtcccatgtccatgaattcatttgaattaatttactcttcgcaatcttatacatt2580
aaaatcgtgttacctattacatcacaacttcatgagagcatgcttgttctgtgtagatat2640
ggtagtcctgggaggaagagccatcgccgctatgtatggcagaaccacccgcgaaagcaa2700
cctctctggtctcgcatagccagagcaggagcagctcgcttgcgcggccgcagcgctggc2760
ggtcggccccgcgtacgagcgcctgcaggtaggccagcttctgctgcaatgcccgaatct2820
cggcgtccacgcgcagcagcgtcgtcgcctcctcctccgtcagctcacccagcttggcca2880
gcacccccgtcacccccgcgtccgccatggctgtcgccgggaccgaaaggctaaaactgt2940
cacaatgacgtaaagtttggttggtgttggcggctcacgcaaaaccagacctttccaagt3000
tttactttagcggagtttttttggaacgagagcaaagcagcacagtttcaagaatgtggg3060
gcaatttgaatgttcgttcctgctgcactgctactgcttttagaattgtagtatgcttca3120
tcatttatttatttctaaaaaaacttgcatgaattctatcgtgacttttattgagaaaat3180
aatgtattcacgtatcttcatgtttctgataaaggtatttgtatatgcatcggtgctaca3240
tatgcgaatacaagttttgtttcaactctgaagtctcaagttgaattctaaactccagtt3300
tgttttctactgtgctgctgcaggaagccaggaacccatccgaacaaggttgcaatcatg3360
ataagcagatagagcaagcatatgatgatattttgaattcgtcgaagcatactttggcca3420
gcatgatggagctgcaggaggttcagttatttgcacacattgtttttctttcactcctat3480
gattttcctcaatatgatcaaaatgtttcttttgcaaataatgattgaaatgtttctcat3540
tgtactcaacctcttaaactacctataggctttgcttgagagtaatcaggctacaaagga3600
tgccaatggtattgctgctctctatattgttcttgttctaatgtaa3646
8
10605
dna
zeamays
8
aggaatcttaaacatgtggaacaggtgctcaacacatttagcaactagttgttgatgacc60
cataactttgcagccttcataatgcacacaattgatgcatcaattgcatacctcctgtct120
ttgtcaacattttcaacaccttttttcttctcatcaacaggaggcgatggaatccaaaaa180
gagtgacaacaaaaatattagtataataactaaactctaagtctaccaaacagtgaaaga240
atagtgaaacaggaaataccttaattcttatcattatgttaataaatttaaaattgaact300
aaaaaacatcattaggtgagatggatctatttgtcggttccctgttaaggatcttgtatt360
tcacacaagaaagtgaatggagcaaacacacactacatcatctgacatgtttagttgtgt420
tgcaatttcaaaatatctaagcctgtcggatccattgaacaatagtagtaatgtggcatg480
tcaaaaaatgtcaaacatgttaatatagcaccattttttgtgatgcagaaatgacccgag540
attacctgatatgtcataacaacgagctcaatagggttagcatcaaacttcgcacttacg600
ttgcaggttcccatccatgtcagcttcatgtgcttcgttcttgtttgatggaactccttg660
aaaacatctacacatttcagtgcctcatgagactcgccatagcgagcggggtggaggatg720
gtgacaatttcggcagcctcgccgggggcagcactgcgacgagcgaggcgggaggggtgg780
caatctcgtgggggaggggccaggcacgaccgtccaaccacggcgggcggagacacggta840
tccatgtgagatatagccacctcatctagccttatctccaagattttaaatcactgatca900
tcataagcgatcagatggaggccagttcacatcgaccgacaggactgatactaacaggac960
catccacacttgcacatactataaagattaataagagattacataagaactaagtagtga1020
atcagacacccattgctgaccttgttaatcagcccatcgccaatgaaggtgctgctgatc1080
ttcttgaacaacctgtacaatgcctcgaccttgttcacgctgactgcaagaatgacaagc1140
agaggaaacccaaccaggcaggaaaatgatgaccaacactgaatagaaaaagtaaatgaa1200
cacctccagctatcaaaattttaaagcaatgtgaagtcctcaaagaaccaggacaacact1260
catgattttttataactaagggaattgtttatcatcaattcattctaaaatacaagacaa1320
tcaaaagaactaagcaaagcatgagatacaaaaattcaaagcacatgtatagtgtcttgg1380
taaaaaatttacaagatggtgaatgaattcaactcaggttgtctacttcagcattagttt1440
gcactgtccagaaaaagaacaacagcaagattggaataatgctatggccaccagaataaa1500
aggtcagagctgtcttttaatgctaatattgttcatgccaaacatttctttgttagcttg1560
tgaatttatacttggacactggactgggccttgatcgacgctggcaatatcatgctgaac1620
tctgaaggcaccaaaactgttagctccttccctcgtcaatttgtcaattcaacatgtctg1680
cttcaaaatggttatgcgtaggttgaagaaaagttgggagtttacaaaataatacaatgg1740
gatgcctgttctatcatctaacttaagccatgtatcaaggttgcaagttacataaaatac1800
gcttatattctgatggttggaaccacacattctacacgtttcccaaaacaatgaaaaagg1860
tagttgtcgaaagatttaagcatctaaagtgtccactctctctgagagcatcaaaataaa1920
gtagtacgtcttatgttttaaactatttattgaagtaccaaactatacggctactaaaga1980
tttatttagatgagtaaacgaaataatttatggtatataaattaagaaggggtgattagt2040
catgaaaaataaaatgtcacaattaccagcagcacgtgattttctaaataatttaagcat2100
gtgcggtgctcttccagataaaacttaggggacgaccacctagttcattgaaagagggga2160
ataaaccaagctccaactttcaagcttgtcaaggcttgtcattattaatttaaacaggac2220
agccaattctcagacatgatgttccaaactgctaatgaatatataatgctcaaaataaac2280
aactaggttcttaactgtcaattacacccacaagatgcacataattagaaaaggtaaaag2340
agaaggcaaatggaataccaggaattatatgactactaaatcatttatttagataagtag2400
atgaaataatttatggtacataatataagaacgggtgattagttatgagaaataaaaggt2460
caccattaccagcagcatgtgttgttctaaataatttaagcctgtgtggtattctttgag2520
ataaaacataggagacgaccacctaattcattggaagaggggaacagacgaagctccaac2580
cttcgagcttgtcaaggcttggcattattaatttaaacaggacagacaatgctcaatctg2640
aactgccattgtatctacaatactcaaaataaacaactagattctgaacaaccagattat2700
ttgtactcattccatgtctcataaacaaggaaaaaataacaaccagattatttgtactca2760
ttccatgtctcataaactttgggcaccatccatccaacacatccaatctaaacacaccaa2820
acgatggggaatggaaagagcagtattcgattcaacaatggcaaacaaatatcactgaat2880
tagaccaagaataaacctaattagacaacgacctcccaaccatcattcgtcaggctgtaa2940
agaagataaagctgccatggggcatggatcaagcagaacaccagagatgaatccaaacac3000
acagaaaatcacgcgcgctgtctacaatgacaacaagccccacatttcattgcagtacac3060
tgggctacaaaggcacgtacaacaaagagctagggaaacattgcggagggcacgagagag3120
cagctaacttgacaatatagcagactgagcttgcactgttagcaggcgaggaagggaatc3180
atggggacggagaatggggtccatgcccgcgaaggagaaggcggacgccgccacggtggc3240
accggcgcacgcgcacacagggaacccgcacaggcagccatggatgctgcctcgccattg3300
cgccggtcgtctctgccacgctcctctctctctcccgctgcatcgccgtggatggggcaa3360
gcagagagcagggactgcgacgatctgggcggaggactcgccttggagagcgcggacgca3420
gacgggattctagggagagagcgaagacggggcgcgcgcggcgctcgcgcggcgtggtgg3480
cggcgagattagcgggggtggggggagggcggagccgtggtgagggtgtggacgccctcc3540
ttaccctcttaagtagtagtagagatataatccgttccaaaatatccatccgttcaattt3600
atatttcgtttgatctttttaccctaaatttgattgactcatcttattaaaaaagttcat3660
aactattattaatctttattgagatatcatttagcatataatatactttaagtgtggttt3720
tagattttttttaaaaaaaaaaattcgcaaaaattaaatgaaacgacccaatcaaacttg3780
aaaagtaaaactaattataaatttgaacggaaggagtaagaggatgtttgaatgtactag3840
agctaatagttggttgctttaaaatttgctagtagaattagctagctaataaatatctag3900
ataactattagctaatttgctaaaacagctaatagttgaactattagctagattgtttgg3960
atgtattcggctaattttaatggctaactattagctatagtacaatattcaaacacctcc4020
taattaaaatggacaaatatctcttcttttggtcccttgcgttagatttttcatatctcc4080
ttatttagtataaaagaatcatcaaaaagtggacaacccctagtggaacaccattttagt4140
agtggttgcatgaaacctttcgcgcaccagtttctatgtgtcactctaaaaatgggacag4200
catgtacgtagtgcctatatatatacaagtcatctatcgttgcctcctcagttcatcact4260
aatcacacttattgtgccctcgacgagtatctatagctagctcattaatcgattcggggg4320
tgtgttgtcgaaggcggcaatggcgagctactcgtcgcggcgtccatgcaatacctgtag4380
cacgaaggcgatggccgggagcgtggtcggcgagcccgtcgtgctggggcagagggtgac4440
ggtgctgacggtggacggcggcggcgtccggggtctcatcccgggaaccatcctcgcctt4500
cctggaggccaggctgcaggagctggacggaccggaggcgaggctggcggactacttcga4560
ctacatcgccggaaccagcaccggcggtctcatcaccgccatgctcaccgcgcccggcaa4620
ggacaagcggcctctctacgctgccaaggacatcaaccacttttacatgcagaactgccc4680
gcgcatctttcctcagaagtgagtccgatgctgccgccattgttcttgcatccatccagc4740
atcgtacgtacgtcctctatacatctgcggatcatcatgtgcgcatgtttgtggcatgca4800
tgcatgcatgtgagcaggagcaggcttgcggccgccatgtccgcgctgaggaagccaaag4860
tacaacggcaagtgcatgcgcagcctgattaggagcatcctcggcgagacgagggtaagc4920
gagacgctgaccaacgtcatcatccctgccttcgacatcaggctgctgcagcctatcatc4980
ttctctacctacgacgtacgtacgtcgtcacgaatgattcatctgtacgtcgtcgcatgc5040
gaatggctgcctacgtacgccgtgcgctaacatactcagctctttcctatctgctgcgcc5100
aatttgcaggccaagagcacgcctctgaagaacgctctgctctcggacgtgtgcattggc5160
acgtccgccgcgccgacctacctcccggcgcactacttccagactgaagacgccaacggc5220
aaggagcgcgaatacaacctcatcgacggcggtgtggcggccaacaacccggtaactgac5280
tagctaactggaaaacggacgcacagactccatgtccatggcggcccacaaggtcgatgc5340
taattgttgcttatgtatgtcgcccgattgcacatgcgtagacgatggttgcgatgacgc5400
agatcaccaaaaagatgcttgccagcaaggacaaggccgaggagctgtacccagtgaagc5460
cgtcgaactgccgcaggttcctggtgctgtccatcgggacggggtcgacgtccgagcagg5520
gcctctacacggcgcggcagtgctcccggtggggtatctgccggtggctccgcaacaacg5580
gcatggcccccatcatcgacatcttcatggcggccagctcggacctggtggacatccacg5640
tcgccgcgatgttccagtcgctccacagcgacggcgactacctgcgcatccaggacaact5700
cgctccgtggcgccgcggccaccgtggacgcggcgacgccggagaacatgcggacgctcg5760
tcgggatcggggagcggatgctggcacagagggtgtccagggtcaacgtggagacaggga5820
ggtacgaaccggtgactggcgaaggaagcaatgccgatgccctcggtgggctcgctaggc5880
agctctccgaggagaggagaacaaggctcgcgcgccgcgtctctgccatcaacccaagag5940
gctctagatgtgcgtcgtacgatatctaagacaagtggctttactgtcagtcacatgctt6000
gtaaataagtagactttattttaataaaacataaaaatatatatatgttcttgaatataa6060
aattgataaccaaattaaaattcgaaccatcacttatacataattttactttatttttta6120
taaaacgtgaacgggaaggactaccgtgaatgactatagaaccaatcatactagtataaa6180
atatatgatgacactacgggagagacaaactttgtctggcgctaaatattttgccgagtg6240
tgaattcacgggcactaggcaaagatcttctttgccgagtgttacgctgggcaaagtaag6300
acactaggtaaatcagtcatttgccgagtgtccgccactaggcaaagcaaaacactggca6360
aatcaaaagtttacctagtgccagacactaggcaaaaaaaaaacgctcggcaaatcggaa6420
gtttccctagtgccagacactagacaaagaaaaacacttgataaactagcgtcgtcagct6480
aacaccatccaccaaccgttaacgttgccgagtatctgacttcgacactcggcaaagaag6540
gtctctttgcctagtgtcggtctggaacactaggcaaagaggcactttacctagtgtcgt6600
attttgacactcagtaaaataattttttttctttctgcttccaaactttttatgatgtgt6660
tcctatagcacctagaactacatgtcaagttttggtaaaatttttgaagtttttgctata6720
tttacttaatttattttatttaattgaatttcttttgataattcaaatttgaactcggca6780
aggtaagaagcgagggtagcctggaaacacactttgcctagtgttacactcggtacagga6840
gcctcccctgcctagtgctgcactcgacaaaagattcgcctttgcctagcgctgcactcg6900
gcacaggagtcgcctttgcctagtgctgcactaggcaaagcctccgttaccgtgccttcc6960
atcgtcatggaaacttttcttcgccgagtgacgtgtggcactaggcaaagtttttgccga7020
gtgcccgagaaatggcactcggcaaggactctttgccgatcccttcgttgccgacttctt7080
tttgccgagtgcaacactaggcaaaccatttgccgagtgtaaaagaggctttgcctagtg7140
tctgtggcactaggcaaagaagacgagtcctgtagtgaacctagtaggccagtgcgggac7200
cattccaaaaaatacctataaaaataaatttaatattaaattaaacatatggtccacgta7260
ccaagatattaaactcaaaagaacaattattacaatttatcttagctaaaaggccgagaa7320
aaagtatatgttaaaaaggagtgtgatcccatttttatagctcgctcggtcgatcgcccg7380
tccacttttaggtaacgaggtggtaccatgtaggagtgttgcgttgcgtgcgacttccta7440
tcatgttgggcttaggtggcttctcacgacccaatgataggcgagaagtgtggaagatga7500
acaaacctacttgtttcgtgcacgacgcatgtgtttgaacaacgagttagattagaaaaa7560
aaatataatgacttttttttttgcaaaagtgaggataatgaaaaccagaaaaactggtgc7620
ttcataagagtagagatttgatggtaaatatagtagtaatgcaatggctatactacacgc7680
gagagtccaatggcaagccggtgtgttggggcgaaggcgaagacgctacccttcgctcca7740
ggcctttgtcaactcgctgcaccaacagaggcaagatgaccggcgcggcccacccttcgt7800
cctcttcactgcaagacgaaggcctacgacgaagtctctccatcccacgtcctcgcctta7860
cctggaggcccacgtgggattcggcccatcgtaacaggccccgcacggacaggcgtgtta7920
cgggtttgatttgtaatagcttttctgtaatgacagtttgtaaccctcccttatgggaat7980
attctggggataatccaggtgtctgagggcataagcgtccttacatcgggacgttgggcg8040
ctcgggcacctataaatacccccgtacagtgcccttgagaggctggattaacatagcaat8100
tgccatctcgagttaaaccttgcttgcatcctttccactctcccgttggatcaacttgcc8160
caagagagctagttccaacatttggcgcccaccgttcgtgctacgagcaaaccacccgcg8220
atggcacccaaaagagctagttcgaaggcagccccatccgtcgacgaagcggcgaaggca8280
gcactgctagctgagaaaaagggcaaggccctcgcagacaacacccaccaagaagctggc8340
gaagacgaagcactcagtaagagacagcgcaacgatcaacacactctcgaaggcaccctc8400
cgcacctacagctccggaggccaaccacaagtaccacccctaggcttcgctccactagag8460
ggcgaggacacaacagaggacggcgaagtcatcggcgtctcagcagaagaacaactacag8520
ttatgggccctgcgcctcaagaaccgcaacctccaaaagcagaaagaaatcctcgaagcc8580
aagcgccaacgcgtctccgcgcaagccaaagtgcgttagatgatacgagacgaggagcag8640
agggcccgggaactagagcaagagattgcgctcatgcagagcgaaggacagcatgatcta8700
cagcatggcccacccctccagcagcgcgcgccagctagagatttattcattccccagcgc8760
gggcccttcatcccacacgccgcagctttccaaggcatcaactaccttgatgagcgaagc8820
cccctggcgccgcaactccaagtgtcaccttggcccgccaacttcagggcagggagctac8880
cccaagtacaatggcagcaccgacccagcacaatacatcatgagctatcaagtcgctgtc8940
gcatcatccggaggggacgacgccacaatggccaagtccttcatcatcgccctcgaaggt9000
ccggccttgacctggtacaccaggttgcccccactgtccatcgactcctggcgaagtctc9060
cgggacaagtttctgcttaactttcaagggtaccgcccagacatcgatgccttggccaag9120
ctgtcactctacaaacaacaagagaaagaaaccctacgggagtactaccgcaagttcctg9180
gctctcaagtcgcaactgccctcggtcgacgaccaaatcgccatacactacgccatcagt9240
ggccttcgggctggcgtcctatacagtcactgcatcaggtacccacccaaaaacctccaa9300
gagctctatcagttgtttgaaaagtacgccagatccgaagagctccatcagcgcaaggtc9360
gagtctcaaagaaagcccaaggaccctccgcagtctagccaaacatggacaagaccttca9420
cagtcagactccggtcgggacaaccgcagtcagcagcaggtgcataacattgccaaccag9480
caccccgccagcgaagcccctcgccgccaagattatccccccagggccgcggcaatggca9540
cgcgtggtcggggctggggacgggcgcaacagccgtgcagatattactgcctgttttcac9600
ggcgaagactgcacgcacccaaccaaggattgtccggaaacgaaggccaccagggacagg9660
atgtctcgggcacaacccgccgacaacccaagagttgtcgcgcacacataccaacaccac9720
cacccacaaccatacaaccacggccccgcccagcatctacccaaccacgcatatcaacac9780
caccaggagttacaagtcataccacctccacccccgcctccgcatcaaccaaacatccac9840
caccaaaatcaccccaagcaccaaaacaggaagacttcgctgatcagccgtatcgcggag9900
tcattcacatgatcaccggaggggtccagcattgactttgacacgaagcgacaaaagagg9960
aatcactaccgaagcatcaaccacgtcgccatcaccggctcggtcgtgcaaacgaagtgg10020
tcacatgtgccgctaaccttcgacgccagagatgttgatctgcgcagcgcaccccacatt10080
gatgccatggtaatcaactgcagtgtggcaggctgggacctgcacaaagtcctagttgac10140
aacggcagccaggcggacatcatcttcctccatgccttcgaccgcatgggcatcagccac10200
agccctctcaagccttcgaacaatcccctatatggcttcggcggcaagggcaccttccct10260
gtgggcaagatagagctacccctatccttcggcgtagcacccaatgcgcgaagcgaatag10320
gtcacctttgacatcattgacatggtctatccctacaacgccataatgggtcggggctct10380
atcaacaaatttgaagcggcaatccacggattatacctctgcatgaaaattccgggtcca10440
caatgcgtaataacggtgtacgggaaccagcagactgcgcataacattgagagagatttc10500
gttcccgatcaacggaacgtacactgccttacgacgcagcgcgaagtccccgaggctacc10560
tgcctagctgccaacaaaaatgaaaaggcacagctaaaaagcaac10605
9
11001
dna
zeamays
9
aaatggccgaagctattttggatgaagccatctctcgactattaaacgaagctgcggaag60
cagttttaaaagaagaatagttgttattgtaaaaacatttggaatgtaatatttgctgaa120
caaagtgtgtaatatttttataatttgaatgtaatatataagctgctcgtaactcaattc180
tttacgatgcatgaaactttacgtacataccgtttttgagccttcggcgaaaaaacacct240
tcccttcttttcatgcttcgtgaagaatatccatacttcgtaaaaacattatgcttcata300
agcaatagatctctttttcatattagagttgatgaagttgtacttgttcaaaacttattg360
tgccttggcactgcttcttcgaaacaatctcgaagatcaacattgtatccccttcttgtg420
ttattgatgcaatatgatgttatgctatgcaaaatgatgtgatgatgttatgctatgcaa480
aatgatatttatgtcgaagatacataaacattcccacagtagagcacacaatctttttgc540
cgtttatttttcggcttcaccgcttatttttcggtgtatcagcgctgacttttcgctgta600
agcctcccttaggtgcttcttcgccttttacttcggcggtatttgcgttgactttttgcg660
cttcgccttatacttcggtggaatcagcgtttatttctcgctgtaagctctgcattccct720
ttggaacgacttttgagcagaaaacttacgctgcgctcccttagaaatgactttttgtaa780
cttcggcaaacttacgctgcgtttcatagaacgacttttttgtagtttcggagatacttt840
ctgtagccacaagttcttaagaacgagttttcatgcttcatcaactttttgaattccgta900
agtctgtggagaagatatattttcactatgacaaaaacaaagctgttacaagaaattgaa960
aacaacaagaaaaacttaggctttcaatgattgttctttattaaaaagaaaaatgataac1020
taatgcaagaactatttcagaagtaggatatctgttagtagatgtgctttgactctggca1080
caatactgttgactgtgcgagcttcggactcctctctgaagtctcgttgctgatgagtgt1140
gctggctcccttctggctgctggcctcgttgtattggtggtggaggtggaagctgttgcc1200
aagatgcctgaggttggcttgccgaagcaacagaagctgcaggatggttacccttaccca1260
catactctggaatgtacggtgagtggtacgaagcagtatgcataacttgcttcggctggc1320
tctgttgggctgcagtttctgctatctctttctgtttctggatggtgacatggcacatcc1380
tggtagtatggcccttgtcctcaccgcagaatagacaataaattttcctgggctgatccc1440
caaaccttcctccgaagcccctggctcctctgccccttggggctggaggccgaggataac1500
tctgttgctgccccgaagcctaggaggaatattgcggcctctgttgctgacttcccctgt1560
cgtcattctgagtggaatgaattgatctgacatgtctagggtgtactctccctccgaagc1620
ccctggtcatctcggagaatctgtaagaaggcccccaacacctctctttcccattcgttt1680
aggtgagaaaacacattggggagcactagggagttctttcctagtgaggcgtctgtgatt1740
tgataatcacaagattaaggatttcattagtgcatgtgtagtagcaagtgtgcatccacc1800
ttcctcattaagcttgtttaggataagccagagtttgtgccggttactcttgatgttcaa1860
caacaccaagatggcttggtggtaattaagagcttggtgatctctcagtggtgctcgtga1920
gagtcccaactcattgtgtaataaaagattataggtgattcaccatgccggagtggtgaa1980
taatcaacccgtagagagcattgagtccttgaatggatcgatggggggctacacccttgt2040
gtgggtcaagtcagagttttagcagttcttgcacccatgatctcatcgtgaagcatagat2100
aaatttaaattcttttgaattatttatatatgacaacactattcgtcgctctaggtgact2160
atcacctaccctaaaatgacttaacaaatctttattaattgttaagtcattcacattttt2220
gttaatccactccaaagtcagggtgtttagtgtttttacatccatgtctccttagactca2280
cggtgtctctcccagattctctctcaccctcacctctctctcactagccactagggaacg2340
caacacccatcgatggctcttcgccccatgaaacgttcacacaatcgcaattgtcgaggc2400
atgcatggctgggagagcagacatggaggcatacgtgctagggttgcacatgggcaagag2460
ggtgggtgtggctattcagatatgcatggtgagcaagatgggtgaggttgtgggcatgat2520
gaggggataaggaagaataagatctcttttgttaggctgtctccagcagctatcgtatcc2580
cattccctatcgcatcccctattttaaactttactatgcaaacaatgtaatatatagtgc2640
agattccctattttacacaatgtgttgtagacaaccttggagctcttgcataaaagctct2700
agttttggctctagctcctctgagaaaacaatccccaccatgtttttaggaagaatccct2760
gaagggcaccccatttggttggaaatacatctcctcctacaggattatgtttgacttttt2820
tttgcaatgtgggacccacaggggagaggaggacgagaaggaaccggagagcctattttt2880
tgggctcctggcttcgcttggtttctaggggcggctccttcctattttcacaaaggagct2940
agtagaggagcctcccatttcatgattttttgaaggatctatttaaggagccttgaaaga3000
gccctaccaaggtaggcctagaaataataaaggaggaaaaagagaaggtatcacaacttt3060
tgtctacaacgtgaaaatgtttggctaaatagataaaacagtttgaattttatcgattca3120
attgtttattgagggcatgtttgggagggctttagttctagcttctttcgcgaaaaatcc3180
agagccctacaaaatgacgtttggtaaaacgacttcttccgaaaaacacccaaaaaccca3240
agatattttatactacgaaggaaaggtcacacatcctagttagcttcactggttctagct3300
ccttccaattttgcaaaaaagtcacaaaggataagccattttttcaaatgatttgtgaaa3360
tgcctacgctaaaaagtctacttttccaaaaaaactagagctagagccgtttttggcaag3420
tcagaaccctaccaaatagtccctcagtttaagcaaagtgaggctatactgaagctaaat3480
tatgccaaattgggcctacatctccatattttcaaccaaatgctttagggtttcttgtaa3540
tcgacatgatttgtttcttcataaatagtatatggaccgctccaaaatactccatccgtt3600
tcaatttatattacgtttgatctttttaccctaaatttgatcgactcgtcttattaaaaa3660
agttcataactattaataatctttactgtgatatcatttagcatataatatactttaagt3720
gtagctttgattttttttttgcaaaaattaaatgaaacgacccaatcaaacttgataaaa3780
aagtaaaactaattataaatttggacataaggagtaggagggtgtttgaatacactagag3840
ttaatagttagttgtcttaaaatttgctagtacaattagctagctaacaaatatttaggt3900
aactattagctaatttgctaaaaacagctaatagttgaactattagttgaactattagct3960
agactgtttggatgtattcaactaattttagcagctaactattagttatagtataatatt4020
caaacacctcctaattaaaatggacaaatatctattcccttggtcccttgcgttagattt4080
tccatatatcctcatttagtataaaaagaatcatcaaaaagtggacaacccctagtggaa4140
caccattttagtagtggttgcatgaaacctttcgcgcatcagttactatgtgtcactcta4200
aaaatggggcagcatgtacgcagtgcctatatttatacaaggcatctatcgttgcctcct4260
cagttcatcactaatcacacttattgtgccctcgacgagtatctagctagctcattaatc4320
gatcaatcggggtgtgcggtcgaaggcggcaatggcgagctactcgtcgcggcgtccatg4380
caatacctgtagcacgaaggcgatggccgggagcgtggtcggcgagcccgtcgtgctggg4440
gcagagggtgacggtgctgacggtggacggcggcggcgtccggggtctcatcccgggaac4500
catcctcgccttcctggaggccaggctgcaggagctggacggaccggaggcgaggctggc4560
ggactacttcgactacatcgccggaaccagcaccggcggtctcatcaccgccatgctcac4620
cgcgcccggcaaggacaagcggcctctctacgctgccaaggacatcaactacttttacat4680
ggagaactgcccgcgcatcttccctcagaagtgagtccgatgctgccgccattgttctcg4740
catccatccagcatcgtacgtcctctatacatctgcggatgatcatttgcgcatgtttgt4800
ggcatgcatgtgagcaggagcaggcttgcggccgccatgtccgcgctgaggaagccaaag4860
tacaacggcaagtgcatgcgcagcctgattaggagcatcctcggcgagacgagggtaagc4920
gagacgctgaccaacgtcatcatccctgccttcgacatcaggctgctgcagcctatcatc4980
ttctctacctacgacgtacgtacgtcgtcacgaatgattcatctgtacgtcgtcgcatgc5040
gaatggctgcctacgccgtgcgctaacatactcagctctttccgatctgctgcgccaatt5100
tgcaggccaagagcacgcctctgaagaacgcgctgctctcggacgtgtgcattggcacgt5160
ccgccgcgccgacctacctcccggcgcactacttccagactgaagacgccaacggcaagg5220
agcgcgaatacaacctcatcgacggcggtgtggcggccaacaacccggtaactgactagc5280
taactgcaaaacgaacgcacagactccatgtccatggcggcccacaaggtcgatgctaat5340
tgttgcttatgtatgtcgcccgattgcacatgcgtagacgatggttgcgatgacgcagat5400
caccaaaaagatgcttgccagcaaggacaaggccgaggagctgtacccagtgaacccgtc5460
gaactgccgcaggttcctggtgctgtccatcgggacggggtcgacgtccgagcagggcct5520
ctacacggcgcggcagtgctcccggtggggcatctgccggtggctccgcaacaacggcat5580
ggcccccatcatcgacatcttcatggcggccagctcggacctggtggacatccacgtcgc5640
cgcgatgttccagtcgctccacagcgacggcgactacctacgcatccaggacaactcgct5700
ccgtggcgccgcggcaaccgtggacgcggcgacgccggagaacatgcggacgctcgtcgg5760
gatcggggagcggatgctggcacagcgggtgtccagggtcaacgtggagacagggagcga5820
ggtacgaaccggtgaccggagaaggaagcaatgccgatgccctcggtgggctcgctaggc5880
agctctccgaggagaggagaacaaggctcgcgcgccgcgtctctgccatcaaccccagaa5940
gctctagatgtgcgccctacgatatctaagacaagtggctttactgtcaatcacatgctt6000
gtaaataagtagactttattttaataaaatataaatatatatatattctgataaccaaga6060
ttcgaaccctcacttatacacaattttatcttattttttataaaatgagaatggaaagga6120
ctaccgtgaacgactatagaaccaatcatactagtttaaaatgctcgtaagctatgacga6180
acctagtaggccggtgctggaccattccaaaaaacctataaaaataaatttaatattaaa6240
ttaaacatatggtctatatatcagatattaaactcaaaagaataattattataatttatc6300
ttagctaaaaggttgagaaaggtatgcgttaaaaaagagttttaacccatttttatagct6360
tatttgatcgcccgtccacttttagggagcgaggtggtactatgcagaagtgttgcgctg6420
tgtgcgacttactatcatgttgggtttaggtggattctcacgacccaatgatagacgaga6480
agtgtgggagatgaacaaacctacgcatttcgcgtacgacacatgtgtttgaacaacgag6540
ttagattggaaaaaatataatgaccttttttgcaaaaatgactacaatgaaaaccaggaa6600
aaccggtgcttcataggagtagagatttgacggtaaattgttacgatctactggtatttg6660
ctgcgaggatgtattcgcttggtgaaaacagaattacagagtagcagtagcagggaagac6720
agtagcgagaggagaagaagaaacttgaggaagaagaagataaatgtagttgttacatcc6780
tgccttcgccgtaggtctcagcgagcatatatcttcaggtcctccattctgggccctgga6840
atctcacattggccttacgctggcgtgttcctcttctcggcccaactgtagtcttctctt6900
gaggcccaccagtctccacattcctttgttgctgctatagctcctcggacacggctgctt6960
ccgcctgctgctgcacctggatgtcttctgaagtcgacttgcgtggagggacagtgctgc7020
cattcccctcccgataacacgctgcttgtccccaagcaggcgctcgagggaacctctgac7080
gaagtggaatcaggtcctcccaagttgccagagatggatgcaactcagaccacagaatca7140
accgttgtgatgctccattaggcccccatcggcattgtagtactcgttcaggaatttggt7200
ggttgtcaaggcgatcaggaagctgtgccaccaccactaaagaacccactgccttcttga7260
gttgtgaaacatggaacacgggatggatagtagaagtaactggaagctccagcctgtaag7320
caacagatcccaacttagcagcaactggaaatggcccaaaaagcgaaaatctagcttctg7380
atttgcccgaggtgcaagcgatgactgcacataaggctgcataacagccttctcttgcag7440
ccactctccgaggataggcacaggagtggaatccaaaatgtcaatgccaaaatgcttggg7500
tgcataaccatatagcacctcaaatggagacattttcaatgctgaatgccaactagaatt7560
gtaccaaaattctgccaagtacaaccagtcaatccatttgtgaggacaagcatgcacaaa7620
acatctcaaaaaggtctcaaggcactgattaactctctttgtttgtccatcggattgagg7680
gtgataggaagaactcatattcagtgatacaccagccagggtaaacaatgatttccagag7740
ctgactagtaaagattttatcgcgatcagagaccatagcagatggcataccatgcaggcg7800
ataaatgtgttgcataaaggccttggccaccactgcagctgtgaaggggtgtttaagggg7860
aatgaaatgtccaaacttggagaatttgtcaaccacgacaagaatacaatttttacctcc7920
ggacacaagcaagccttcaacaaaatccatagtgatcgtttgccaagctcctgaaggcac7980
atgaagaggttggagtaggcctgggtatttcactctctcaggtttagcttgctgacaagt8040
ggcacatgctgcaatgaactggatgacagatttcttcatgttcggccaggcaaacaactg8100
cttcagccgatggtaagcgactgctatacccgagtgacccccaacagcagaactatggag8160
agcagacaatatagactgttgaagtgtatgattgttaccaacccaaatacggcctttaaa8220
cttaagcaacccttcttgaagagtaaaatgaggaacaacatcttggtcaacaaccaactt8280
agataacaaggtcttagccgaagggtccaacaaatatccatccattaccaaacgagtcca8340
ctatggtgaacagactgagagagcatgtaatgtgatagcatgttgtcttctcgacaaggc8400
gtcagcaactctattttcatgtccatgcttgtatacaatcttatattgcaaccccaaaag8460
tttagtaaagacttttgctgccatggagtgttaagccattgctcattcaaatgcaccaaa8520
ctcttttggtcagtataaataacaaactccccatgaagtaagtaagctcgccattgttcc8580
accgcgaccaatatggctaagtactccttctcataggttgataagccttgagtcttaaca8640
ccaagaggtttgctgagaacgctaatggatgaccattctgcaaaagtacagctcccaccc8700
cattcttgcaagcatcggtctcaatagcaaaaggttggtgaaagttggataatgctaaca8760
ccggggctgagatcacagcttgcttcaaggtattgaaggagatttcttgatcttgagtcc8820
aaacatagaacacccctttctttcacagtgcatttagaggtttggcaataatagcaaaat8880
gactgacaaatcgcctataataacccgccaaaccaaggaagctccttaactctttaacat8940
tggagggcacaggccagttcaacacagcatcaacctttgcaggatcagtatccactccag9000
cagcactgatcacatgacccaagtaagcaatagatgtttgagcaaatttacacttagact9060
tcttgacaaaccagtggtctttttggagaatggtgagaacttgggccaagtgagatacgt9120
gatcgtcaaatgacctgctgtagactaagatgctatcaaagaagactacaacacacttcc9180
tcaacaaaagggccaaagaagagttcatagcgccctgaaaggtaccaggtgctcccgaca9240
atccaaaaggcatgactcaaaactcaaattgaccatgatgtgtctagaacgctgttttaa9300
actcttccctaggcttcaacctcacccgatgataccctgaagccaagtccaatgtggtga9360
accaacatgcaccatgcaactcatccattaactgttcaaagatggggatggaaaacgggc9420
tttgaccgttaaagcattcaaatagcgataatccacacaaaactgaaaagtgccatcctt9480
ctttctcaccaacagcacaggagaattaaaagatgatgcactgggtctgataatccccga9540
ctgaaccatttctgccacctgacgttcaatttcatctttcggagctggtgggtatcgata9600
gggtctgatattaactgggctggcaccagcaaccaatggtatactatgatcacaacttct9660
ttctagaggtaaggacataggtttagcaaaaaccgactgaaactggttcaacagctgaac9720
aatttcaggaggaagggtagcctcatcagtcagagcaacagacacttgagatagggaaat9780
ctgaaccaacaattcgtcaaccggctctacagtttccccttgcaagagcacttgcacacc9840
atgataagggatcaacatccatcgttgtttccaatgcacttccataggactgaaagtctc9900
taaccaatcaagtcccaatatcacatcaaaggattgtaatggcaacaccttcagatcaaa9960
ggaaaacccatacccctaaattgtccattgagcttgagtaaacacttgggagcaagtcat10020
aacaccttcattagccactttaacctggagacaagttggtaccaaagtaatgtcggttaa10080
cctggaaatcatagtgtgactgataaaagagtgtgaactgcttgaatccacaagaataac10140
gatttcatactcctgtatagaccctctcaacaaaatggatcctttggcccttgatcttga10200
aacagcctcagcagatagagccaagaagagttgttcctcgtggactgaatcttctgatgg10260
agaagactcattcagaaatgcaggcataagatcccagacttcctgcagggcatgaagttg10320
aactgtagtattgcagacatgtcccctatgccatttttcagcacaacggtcacaaagacc10380
acgtgcatgacgataagcacggagtgcagccagtttttcatcagtgcttcgatcagtctg10440
catgagacgctgcttaggtaaggatctcggcgcactggcacttgctgcagcagtagttgg10500
ttgaggaagaggcagggtcgtcttgtgctgggatttcgaccaaataccacgatcaccctt10560
gcagaattcactccgcataggtggagcagcgacctcatcctgcaaaaaagcgagggaaca10620
ggcaatatccagatccatcgacctttgaagcataacaacagcacgcatatcatcgcaaag10680
accatccacaaaccgtgtaacaaaatataacggatcaaccccagactcatacacagagag10740
ttgatctacgagagaagaaaattgttcaatatattgagttaaactacctaactatttgat10800
gcgaaataagtggtgcaacaacagctgatactggtccttaccaaaccgttcgttcatcaa10860
ttgacacaagagaggccaagacaaacggggatgacgaaacataactgacagaaaccagca10920
cgctactgtaggcgacaagtgcatagaggcaattttgacccatgaattcagatcaacttg10980
atatatatcaaagtaattctt11001
10
1284
dna
artificialsequence
cdna
10
atggcgagctactcgtcgcggcgtccatgcaatacctgtagcacgaaggcgatggccggg60
agcgtggtcggcgagcccgtcgtgctggggcagagggtgacggtgctgacggtggacggc120
ggcggcgtccggggtctcatcccgggaaccatcctcgccttcctggaggccaggctgcag180
gagctggacggaccggaggcgaggctggcggactacttcgactacatcgccggaaccagc240
accggcggtctcatcaccgccatgctcaccgcgcccggcaaggacaagcggcctctctac300
gctgccaaggacatcaaccacttttacatgcagaactgcccgcgcatctttcctcagaag360
agcaggcttgcggccgccatgtccgcgctgaggaagccaaagtacaacggcaagtgcatg420
cgcagcctgattaggagcatcctcggcgagacgagggtaagcgagacgctgaccaacgtc480
atcatccctgccttcgacatcaggctgctgcagcctatcatcttctctacctacgacgcc540
aagagcacgcctctgaagaacgctctgctctcggacgtgtgcattggcacgtccgccgcg600
ccgacctacctcccggcgcactacttccagactgaagacgccaacggcaaggagcgcgaa660
tacaacctcatcgacggcggtgtggcggccaacaacccgacgatggttgcgatgacgcag720
atcaccaaaaagatgcttgccagcaaggacaaggccgaggagctgtacccagtgaagccg780
tcgaactgccgcaggttcctggtgctgtccatcgggacggggtcgacgtccgagcagggc840
ctctacacggcgcggcagtgctcccggtggggtatctgccggtggctccgcaacaacggc900
atggcccccatcatcgacatcttcatggcggccagctcggacctggtggacatccacgtc960
gccgcgatgttccagtcgctccacagcgacggcgactacctgcgcatccaggacaactcg1020
ctccgtggcgccgcggccaccgtggacgcggcgacgccggagaacatgcggacgctcgtc1080
gggatcggggagcggatgctggcacagagggtgtccagggtcaacgtggagacagggagg1140
tacgaaccggtgactggcgaaggaagcaatgccgatgccctcggtgggctcgctaggcag1200
ctctccgaggagaggagaacaaggctcgcgcgccgcgtctctgccatcaacccaagaggc1260
tctagatgtgcgtcgtacgatatc1284
11
1140
dna
artificialsequence
cdna
11
atggcgagctactcgtcgcggcgtccatgcaatacctgtagcacgaaggcgatggccggg60
agcgtggtcggcgagcccgtcgtgctggggcagagggtgacggtgctgacggtggacggc120
ggcggcgtccggggtctcatcccgggaaccatcctcgccttcctggaggccaggctgcag180
gagctggacggaccggaggcgaggctggcggactacttcgactacatcgccggaaccagc240
accggcggtctcatcaccgccatgctcaccgcgcccggcaaggacaagcggcctctctac300
gctgccaaggacatcaactacttttacatggagaactgcccgcgcatcttccctcagaag360
agcaggcttgcggccgccatgtccgcgctgaggaagccaaagtacaacggcaagtgcatg420
cgcagcctgattaggagcatcctcggcgagacgagggtaagcgagacgctgaccaacgtc480
atcatccctgccttcgacatcaggctgctgcagcctatcatcttctctacctacgacgcc540
aagagcacgcctctgaagaacgcgctgctctcggacgtgtgcattggcacgtccgccgcg600
ccgacctacctcccggcgcactacttccagactgaagacgccaacggcaaggagcgcgaa660
tacaacctcatcgacggcggtgtggcggccaacaacccgacgatggttgcgatgacgcag720
atcaccaaaaagatgcttgccagcaaggacaaggccgaggagctgtacccagtgaacccg780
tcgaactgccgcaggttcctggtgctgtccatcgggacggggtcgacgtccgagcagggc840
ctctacacggcgcggcagtgctcccggtggggcatctgccggtggctccgcaacaacggc900
atggcccccatcatcgacatcttcatggcggccagctcggacctggtggacatccacgtc960
gccgcgatgttccagtcgctccacagcgacggcgactacctacgcatccaggacaactcg1020
ctccgtggcgccgcggcaaccgtggacgcggcgacgccggagaacatgcggacgctcgtc1080
gggatcggggagcggatgctggcacagcgggtgtccagggtcaacgtggagacagggagc1140
12
16619
dna
zeamays
12
atcttttattggtttgagttgaacctatatgcacctgtagaatataatctagagcaaact60
agttagtccaattatttgtgttgggcattcaaccaccaaaattatttataggaaaaggtt120
aaaccctatttccctttcatccgggcccttgcggcggaccgtccgcgacaccagggtgag180
ccttggacaggaacactgcaaaaacacaagttaacactacggatcgtccgatggagaagc240
gagcaccgtccgagaccaagcacggaccgtccggcctcaggcgcgaatcgcccggtcgtt300
gaaaaaccagaaaaacccgaaggtgacgggttcggtaaaatgcatttttagcgtccttgc360
ggatcgtcctgggtgcacggtcggaccgtccacgactgctttatctgacatttgacgacg420
cattaaaagctctatagccgttactcctgaccgttgtgatttcagtcgttgatgtgcagg480
ggtacggaccgtccgcggtcggtagaaaatgagcaacgactaggaagtggttggaggcta540
taaatacaagagaaaattcctggtatgccatcagattttatactcatcccttgtgtgcca600
ctgagtggcatataagtatatttttttgtgtctatgacatgtggggccagtggcatacaa660
ggaatgagtatatttttcagtggcatacagggaattggccctaaatacaaccccaaccac720
ctccattcaaatgatccaagcactccactcattcacattcaatacaggagctagcaatac780
attccaagacacactcaaagctttcaatctctcaaagtcccacaatttagacaagtgatc840
attagtgcttagtgacttgagagagtgtgatctatgtgttatttgtcgctcttgttgctt900
ggctttcacaattgggctttcttcatctctttctcaaccttctaagtgaattataaagca960
agcaagagacacctaattttgtggtgatccttgtggggtcttagtgacccgtgtgattaa1020
gaagaagcactcgaccggtctaagtgaccgactgagagagggaaagggttggaatagacc1080
cggactttgtggcctccttaacggggactaggttctttggaatcaaacctcggtaaacaa1140
atcgctgtgtttatttgtgttgattttcactcgatttgtttcccctcccttcctctctct1200
aaaattcccttgctcatattgttgtgagttggctctcaaagttatctgcattgattgggc1260
aactacttgcaaggataactatcttccgcactccgaattatttctgacattaaccccggg1320
cataatgtgtgttttaagtgtataattttcatgtttcgcctatttacccccctctaggcg1380
actttcaaatgttctccttcacttgtgatgtctacaaccataatcagctcaacatttgga1440
ctatcacccttgaacacttatgttgaactttaaaagttgtgcactaagcacttgtccaac1500
acttaacacacttgtcagtcctttaattgggttgtcatctaaaccaccaaaaaccacaaa1560
gagatctttcaccggggtccgtggttcatggccgtactgctcggtctcaagatttttatt1620
ataaaatcactagagctctactatttatggttcggtgtgccatcgaaccgtctccaacgg1680
gtacatccaacgagcgccagcacaccaacgactagttgacgtggtctacggtccagaggc1740
tcatcagacttgtcaggaggctcgtcgactggtctggtgccacacccgtcgacttgggat1800
cgaacaagaatgatggcaagaggacgaagcgcatcaagaagatcatctactacgactcct1860
cttacccttcacacaaggacgacgattccacctcctccaagaaaaatacggttaaacaag1920
gttactctaagacatcttttatttattctcgcattccttacaatttcaatgctcatttgc1980
tttctattcatcttggcaagcctgctcgctttgatgggaggactattcttggtggagcca2040
taaaatgcgtagccatatttttttgctccaccctagcatttgggatgtcgtagacaatgt2100
aatgcaattgctaggtagcgatgataaaaattataatactattattgcccaataatctat2160
tcataatagcgcccaagctaaagaggtgcgctaagggatcctttttatagcccaaagagg2220
tccataggcgttgctccttccttctaaacatcgatgaaattgtgttgtctgcgaggacat2280
caaaccgggtctgtgcacaccttctccgggatctgatttggtgctttccttaactgattc2340
acagcggaccgatccgatgcaccaccggaccgtctgccacatcagctagtcgttggcctt2400
caaactccaccgggaagtagccgttggaggcggtccggtgcgccattggaccgtgcaaag2460
tgaagggtcgatgatttttataaataaaatctcgagacctcaaagttcagctctgtaggc2520
ggtccggtgcacatggacctggttcgatgcattaggtcgcctaacactaagttctatgcc2580
ctgcggacttgatccggtgcaccaccagacgagtccgatgaggcctagaacaacccaagt2640
aaggctgttttgagcctaacttcttcaaatccttttggctattcttgggagctttccaac2700
aacttagacaaacataattagcacatattccaattgattaggtgtggagaactcaccttt2760
tactttgtcgttcaccatgatttgcattttggcttaatctaagtattcgaaccacttttc2820
tcacaggatagagttagagttcaaataaagtgctaaacacatagtattagacacatgcaa2880
cttatctaagtaatcaaacctcatgattttacccttttgtccaaagctgcacactttagc2940
cttcattttagttctttaggatctagtactttcaaattgacttcaagtgcttgtgctcgt3000
actcatatcaaattagttagtccatgttgttgtgctaaacacttaatcactaaaacatgt3060
agaaatggttatctaacacatttttctttcataagtaaaacaggagatttatattgtaga3120
tgttattgtttgttgatgaaatttgaaataagagatataagagcaactcgaaaagcctag3180
ctaaatcgatttgtatcggtaaaaatagaaaactgatgattaaaataggatccaacaaac3240
tctctttgctcctctctatgctatcctgctcagcatcacgtcgaggtctctagccatatt3300
tgctgagctcacctgcctcgccatcttcattctctcgtgcatcctcaccgtccctgcgcc3360
ttgccgtcgttgcagctcactgtcccacgccctgctgtcgccatgcctcgccgaccctgc3420
ggctcaccttaccgcgcctcgtcatcgtcgcggctcgccgtctcgtgcctcgccaacccc3480
gctccttgacgtcctcacggcttacagtccccatcgtcctcgtggctccacgcctcgcca3540
tatttgcggttcactgtccacgtgccctgccatcctagagcccgacatcgatatgtcaaa3600
tgagacaggagaaaaagaaatgaacatgtgattataatcagtgatttgaatattgataga3660
taagatttgaagagtctgttgtgtcatatcacctttttatgaaactctttattttttaga3720
gtttttataaaactctaaatttagctaaaattatatctagtcttttagagttactctaac3780
aagagatgagatagcgagctggcgagctgctggagacagccgagagtagagatagtagag3840
gagactagaaactccattaggcctagcccagagtcagctagggatgccgcccgatccact3900
acgtactgaaagcgatcccggcccatgaacgctagtgggttgtaattcggcttacccaag3960
tacccagccgttcctccacacctctgcactacccgaaacccacgcccaacggccgtttcc4020
cgcaccccctatccgggaaggaagaaagccagaactcacccctgcttcgtctggcggcgc4080
cgtttcccgcacgcgatgccgtcgacggcggaggatttcccggcgataaggaagctgggg4140
aagctcttccggcttaccgaagtgtacctctggtaagtctcctgtctcccctacccgctt4200
ctcagcgtagggtttgccgtttgcgaggagtacgtctcctcaaactactctctcttcctg4260
cagggacgattcgtatggcgctggacctcacgatggacagaagaacgggcgctcggcgga4320
ggctgctctcgtggtacggtctctgacccgctttggtgtagctcactcttaagctttctg4380
agttgggggtgcgcttgtgcttcgactagtagatggctaatttcgtcgggctactggtaa4440
tttcttggtatctgcattgtcgagaaagaaggcccgacgcataatttcatgcttgcccaa4500
gagtctacttaacacaaggaattggttttgtggcgtggtttgtgcattgcgcccaaactg4560
tagcctgtacaaaatgttaatcgtcgcgtgcattttaaacaaagttttgtattatacgac4620
aaaataccctcggcacatttgttacagactaccgataagtgcatacctatttctcctagt4680
tctatcaggaaataatcctggacctcgaaatgacagcctcgtctggctagaaccctacta4740
aacattttgagtgatcacttttcattactcattttcttgatgaaagcacattactgacat4800
ggaagtttgctacataagacataacacttccttgtagtgctttatttaattattgaccga4860
tgatctttttggaaaattaagctgtattaaacaattgtagcttcggtgatgattgttgga4920
ttaagcattagtctgctgcagtcctcttcttgattctgatatgacagttatttgttgatt4980
aaaataatgatggtttgctttacacttcgatctcctttgagaggaaaacatgtgaaggtg5040
tggactagatcatgtatagaccaacagcattatcttattaaaacacttctaaataactta5100
gcaatttcataaccatttttacacttctgaggaattcatcttgtcgtgaaagagtcaatt5160
aacttagctgcttagcagactgtgtcaagcttattacttgtatgttgtgccctacaaatt5220
actatgaggtttataatgtacatagcaatttgacgaccttcaacttttcaggactctcat5280
acagataaaacatgcaatgaagcatccaatagcactgacaaaggttagatgtatttttct5340
tgtattctagtatcttccttggtcaattttctttacagaggatgttacaatgtactctac5400
tttttttgtgtggaaacaacccactagagaaaaaaaatcacttcatttgagaaatcttaa5460
gaatctgaactctgaagctcagcatgcttccacaccaccacttttcaggccactgtctct5520
tcattggagtaaatgacgcttcttttagagagaaagagaggggggggggtctgtttataa5580
ttgaatcaagagattttattggtcacctgatttctgttgcatgacgtgggacctggatag5640
acttcagatttgccttagttgataagttcaccggcactagtgaaagaaaagtatatggta5700
caggtactcttatgaacaggcaaccacttagcaattcagcatctaatagagaaggaccaa5760
gtcttcaactaaagtcacaatatacctttagtactattagaggggctgaccctccttgtc5820
tagtgcttgtgagatcataggagatggggcgtggtggttaagattgtggatcatattcgc5880
caagctcccagggtcaatgtgattgagggatgtggtatgctacttgtgaaagggttcaaa5940
aaggccaggatgacattgttcctacattcctggaatggggatgacacccccaggacaagg6000
aaggtggcacagttccacgttcctggtgacatgttgtgtatcaaatgggaggccaaatca6060
ccacgggattactctaggaagggaagatgatgttgataatttgagtcattgttgcaacat6120
ctgttcatggtttcatgcctcattttataagtcatattgcccacacataacattgtaata6180
gtaaaatcaacaccagttattttacgttttcccttgtatgtcaaccgatttcttactgtg6240
tatatgatctgtctatcaataggccatttctttgttgaagatttggaattggtcaatctc6300
atgggttctttggggcttcctgtttcattcagcacaagtaaagtggtcagttgctcacat6360
agtgtgacaccatgttactgttccctaccggttccattgcttaattcctttatgcattgc6420
agaacaagaacacatgcaacaagggaaagaaaaaaggaagacaagcaccgctcaaagcag6480
caaacactcaaatcaatgatgctgtgaggatatgtatcaatactgaagatagagaaaatt6540
ctgttgaatcattggatgctatggagcaaacgcactcatgcaatttatttgtgacaccac6600
tgggtcaaaatgaaccctcccgtgatgacactgacaagaggcttagggaagacagctctt6660
gtgttgaagaacaagaagagtctggctgtagcaccatctactctgctggcaaagcccctg6720
gctgtgatgctaaaaatcatctcactgaacttggggcttttgagctttctgataacttgg6780
ccaactcagcaaaagaagaatactcaattcaagaaaatcaagcttatgaaagtgtgttgc6840
tagattctgaagagatgtcaaggaatgactgtgttgatgatgaatctacacattcctgtg6900
ttggcatttatcaggatgaaagagtgtccacaaggggagatcaaacatctgaagaaactc6960
tatcagtaccccatgattacaatgatgttggcagagaagctagtctaagtttggcagagc7020
catcatctattgatgagcatgcacaaagctctgccaacaacttttactatgactatggtg7080
aatggagggttatctgggatccattctataatcggtattatttttacaacatccagacac7140
aagagtccacatggtgtcctcctgaaggactggaggattttgcatcatattgtagcccag7200
ataccactaaagagctagctgaactgggatctcagtgttcaagcatggcaccacaagaga7260
acagtaaaaaccctagtctcgtcctttgcagttgacattacgaatagttatatgcactac7320
gataaaaactttctacaatatgtaacacttgagcatgtggcaatgggtgtaaacatttaa7380
taataaggtagtgaaatccattacacacagtattgaattttgcactacaaatgctgaagg7440
agaaacctaaattgtcaatgctttttggtgacattaattattgccattgatttcctgctt7500
gtaggtgcttcatttatctgtctccaatttactcatatgctagcttcttgtttgggacta7560
aaggctttgctgttgttttagtatgtcacacatttctctttaatctcaccatcacagatc7620
tggctactcatgtcaatcatttagaagcacaggagcaagatcactgcattcatgatttat7680
ctgacattcctgttgaaaagccaatatatcaaaggtagggaataccaaactgtacaatgt7740
tgaacaagttattgtttttttttgttaattctgttcatctatgcagtatgataactacct7800
ctgacaaagcacagcacactgaaaataagtacagcgattcaacaactactgtgttagaga7860
tgaaccaggaagttgctagcaccaaaacgaaaaagagagtaaggagatctcgatcgtgta7920
aggcgataatatatggcatctgctttctaggagtttgttcctgttacaattttaggttgc7980
gcatttacacaatagtttcttggtttctttgagcaaatgcagctttgcatgactgctaca8040
ttgcctacttatgtctaggtaacttttctttgcaaactgcaaagttatgtctaggtaact8100
atgccttctagaaaacctccttgttagctatgtattagtgagacttgcctaatatttatt8160
ttcttgtggtccgttcttgtgctctttgtacatatttgccaataaccattttaattgttc8220
tacagatcattcatgccaagacatggcagggaacgtctctaatgacatcatcaagtactg8280
ggctcagcggtattcacttttctcactttttgatagtggtataaagatggatgaagtagg8340
gtggttttcagttacgccagagccaattgcaaagcatcatgcatctcgtgtgggtgcggg8400
agtaatgattgattgtttcacaggagttggtggaaatgccatccaatttgccaaaaagta8460
cgtcaatgttatcttgcaattgagttatgtgatggtctaatgtatcatttgcttgaacac8520
ttcctgtttagtagcaactgttatttttcttatgtcacgagaatgcaatggctatatcac8580
cttaagcagtatgctatgtccactgtccagtttaactaaggcatctgcttccagtaatat8640
gcaaggctcttcttacttttgctgttatttaatatatggaagtgtccttacggaggtgtt8700
attgtggacattttgagcatgttcatcatgtcacttgagttagtagagccagccttagtt8760
gtttgcagtgtaggtggatttattttatgttatcaatgtttcttctacagtactaagact8820
attgttccacattaactatgtctccttttccaggtgcaagcatgtaattgcagttgatat8880
tgatccacaaaagattgattgcgcgcatcataatgcatccatttatggagtaaatgatca8940
catagatttcattgtaggtgattttatacatatagctcctcatctgaaggtaatgccttt9000
ttcttggaattattacttttaagtttctcaacacgtcacttctattagctatatgttttt9060
gtagctgtttgcgagagtgaatttattgttgacattgttctcatttgcccacccatttta9120
ggataggggcttggtactacaaatatcttgatacttcaagtcctacaaaaagaaatttat9180
gtttcatattttttccatttgaacgtcgagattttatggtcccatggagttctccctatt9240
tttcgatgatgcccatcttttggcagtaccttctttgtgtacacaataaatgggaggata9300
ttttctgcagggagaaactgctttcatgtcgcctccttggggtggccctgactatgccaa9360
agttgatgtttatgatatgaaaagcatgcttattccttgtgatgggttagttccttgttt9420
ctattttaagagagtaatttctttcagtttgcactcactgatgtttacttactttgtgag9480
taaaacgcaccagagatccattaacctttaaggaggtgttatctatgtccatcaacactc9540
aaactgcatttttgggttcctaaactttttaagtgattcaccggagttccgtaccccttc9600
gtttatatttgtattttgcagaaacctcactctgttttatttctccttcgcatgtagggg9660
tggtaatggatcacgattcaaacgtttctccacgattcgtttgagcccttaattaatttt9720
agtacaaaaataaatagaaatagagatagagcctgatcctaatatgatttgatcctcaaa9780
ttttatagcgtagaatttagagcccattaccatttaccacccctattcacatgcctaccc9840
ctctccatcttctggattgaatgttccaacctaatttacactcgtagtttctttgatctg9900
ccaatcaaatccagagcctaattgctataacattagaacgaacacgccatattaccagaa9960
tactcgatgcagatatggatagaagcgaggcgctaagcgcagccagccttggcttcttgc10020
tctgcaggccgatcagggcgccagccaaagccaaccatgcgcgcacgtgactgcaatgct10080
actctctcttcgcctttgccatcgtcgtcgcaggatgttacgttgtgcttatgctggctc10140
ccacgagtgccgccgcccagagcgagctgagcgcacgcagccactgcttgtggttcacga10200
gcgtgagcatgccctcaccacctgctgctgcgcccttgctgcttgcttgcttgccggtgg10260
tgtacattatggacggattaaattgaatggatttacctgttccagaaaaagatctgatcg10320
acgatgggatgctatcttgtatggctccggatcaatgaagattaatggaacaaccaatcg10380
aaggctcagagcaggctagttggtgcccggaagactctggccagaagatggaaatgggta10440
agcgtgtgaaggaaaaaagaaatagagggggatttctacaaaaaacaaacataatgaaga10500
ggtatggatttcaggtgaaccacttaaaaaataaaaagggcatacccagtgccgtaggct10560
tcccgcactgtgcggggtcgtctggggaagggtatctttaagcgtcaagtcttacccgca10620
taatatgcagaggctggggctcgaacccgggacctttcggttatagacggtaggctctac10680
cgccgcaccaagcccgcccttgaaccatttaaaaaatttaggactcaaaaatacagtttg10740
acagttgatggacctagatgacacctccttaaaaaatttaatggacctatggtgcatttt10800
aatcttttgttgatcgactatactcaatgttgaactctttaggtactctctttttaaact10860
cggaaccatgatagcttcaagagtagtgatgtttcttcctcgcaacattgacctaaacca10920
attggcggacatgtccttgtctgtggatcccccgtgggcagttgaggtaagcccattttt10980
gctgattttgtgccaagctgacgtttcctatagatgtcacagtggtctctctctctgcag11040
gtcgagaagaacttcctcaacggaaagctgaaagccataacagcttactttgaagaacag11100
gatcgttgaaccaagcatcggcgctggtgatacaaatcatcttgttagctatgactcacg11160
acaaatttttgtggtgaccctaaacagaacctttgtgttcggagacagaaagaagcggtt11220
tatcatcttcaccgagcatagataatttatttgcagagatgagtcattggtatcatacaa11280
aagcagctcagcttatctcaattcacagcaagtgaaactgtcgaaggaaaactacaaggc11340
tgacagtcgaacgcgtgggagttagcttaattttgccttatgataagcaagcatgcttcc11400
tggtttatttcatacagctactagtagtttcagctgcaacagttgtgcgttggtgtgcgt11460
gtgattctcacatatctggtcctgcggatgtgagtgatgcaaatgtatgtgtcatcatcc11520
catgtaagggtttgtttgtttgctctcaatctatgtagattgagtgggattaagtgagtt11580
taaatctcagacaagtcaaaaaaaaatgttttcaatctcatccaatccacatatgatagt11640
aatacccgagtaaggcttagatgtaatagttggaataagaaaaacaagtcagccattttg11700
aagttttgtccttggagttctattaaaaggcattactgataaatctccaacagatttgca11760
gttgaagcaacatgtgaaacatatttatcatgttaaaacaatttgccttagtattcgatt11820
atgccatgaaatctgacatttccttacacatcccagtttatcattgtcaactgtctttag11880
gaatgtattgtatctgctgtttttacttgtatatgtatgttattttttgtcgttgtatgt11940
atatgttttattataaacatggccactaaggttgttctattcgttaaaataacacagatc12000
tataaacgactaaacaagcttcttgggataaagaatcatatggaggctggattttcgagg12060
agtctggtgcactgttttgctaatgatcagccccccccccccccccctctaaaaataaag12120
aaaatactggatttcctgttcatttattacattcatatgtaaatgcttctgtccttttct12180
atatctgggctggactttttgtgtgctcgtcactcaagttggttagtgtggttaatttta12240
ttatgctccgtgctctttcctaccgaacttggtctttgttagtatcattatcagtcagtt12300
atattttctcctcttgatgcttcatctaatctatttttgcaaagttgtcatgttatgtac12360
tatatgatcttttacaaggtttttgacttttcaaattattgtgtcgtatattatttgtac12420
tcagattgtgcttacaactttagtttatctatactttaaggggtgtttggtttctatggg12480
ctaatttataatcccttcattttattctattttcgtacctaaattgtcaagcacgaaaac12540
gaaaataaagttttaacttttatatttagcagtttatacactaaaatagaataaaataga12600
tgaagtaaaaattagtcctcagaaaccaaacatctcctaaatgtctagtaatagtcgcct12660
gaactgtagagcgcccaacacgcgccaccctgatttggtgtcttaaaatggcatgtgtat12720
ataggtggaatgggtttgacgagactgtaactactttttcttaattaaattatagatgga12780
cttaacttttctatatgcattttaaatatatttttctatatttttggtgggctgagttac12840
agtttatgtcaatataaattacaacagaccgaatctaaaattttattataaaatgtatgc12900
accaaccgttgactaaaaagataaaatttggacacctacattttagcaagtcacctgcta12960
atatatatctatactaggtaagtgtccgtgcgttgcaacgaaaacatataataatacgat13020
aacttatatataaaatatgtgttatactgttatgagaaaaagtttcacctgtcctatttt13080
tatcaatatgacaacagaggatcaatataaggccttggcatggcttcagaagttcagatt13140
aacgaattatggagcaagagcaactatttctggtgtttcagtagaaagaatggggatatg13200
tgtttatctctctcatacatgttacacaatgtgctatagaatgacacctctaggccgctg13260
ctacaactacagagaataaattatggatcatgggtgccctactaagttagctacacgtaa13320
aatctggggcgattgaccccctaatgcttcatcttggagatctcactaaaatacaccttc13380
cgcacaaaccgagccttgataaagctcgcgatcttcgtgtcctgatctttagtgcagaca13440
actgcaagggaaattattgtgccggtcagcaagagtggaaaatgtcagcagaaacacaag13500
aatggaaaattatatgatgtgcaagtaggacggcaccctatttaactaaaggtgtgtttg13560
gttcagttttctgaaccaattcgttgccaaaaaatctaaaatctcacacaaacggtacaa13620
catcagaatagatttttaaaagtttatagatttctcaagttcaattcaaaatcaccatct13680
accccaaatttttcagattactattattcatgttacaactatcactcttgtagtatctac13740
cattgatagttgttttaatcaaatatattttagctttctcacaatcctcagctcgaaaca13800
gattttcatggctcacagttggattcatattttcataaatctataggtgtgaaccaaata13860
gaccctaaaacatcatgttgtaatgttccaaaaaaatcatcacaaaaaacatgtatgcaa13920
cacacctaagtgcaacaacactaacctgcaggagcgttggcgtaggctggactcttgtgt13980
gtctgcaatgtggaaacctgcattgtaaatggttaggtaaatgcttcgcttaaaaatggc14040
agtaaatgcttcactaaaaatgcttccacgtgctcatgatcaagtaggttttatgttcaa14100
tctgcagttctacacaagtgcgattcattttgatactcatttctatttactttcagagct14160
cgtgctaactgtcataaagtgcaatgcatctatgactgccaattgatattgtgctcctgc14220
cataaagtgcactgcatcgtgctaactgtacactaatctgtgaggatctaagaccaatat14280
ttgtttacgttttctcctttagtatcctataaaaacaacacgcctagacaacccaaaaaa14340
tgtgtcccaataggaatatcagattctgacgctggcagcaatctcgaatgataatatttt14400
ttccaaacaagcgaggtccctaaatagaagcggcaacagataaaaactaaagataaaaac14460
taaagagtacagatgattggcatcacatcgggaatgaaatatgcctaacatatcaatttg14520
catattagattatttgctgagaacaataacgaaaacatatttagttgttcatcacaagtt14580
accttagattttgctgttcaaggtcctttgggtcttctttctgctagaacatacaagggt14640
atttcagatttgcaaacaaggaaaagcaagaacttcaatgatacatcattgtaaaaccaa14700
gtttccgatttaaataaagatgatgcttgcggtcaacacattcacaaatgtaaatgtgtg14760
aaatcgttcaaacataaggcttatgtggtcatgctcaggtagtatgtacagacctaaaaa14820
caaggtatatgacaacagtaccagccactaaacacacatggttaatcactaaaacaattc14880
tccgattaaccaggaaaactatagcagccactagaactatacaggtttctaccagtaatt14940
gcttcactaaaaaatgcccccatgtgtaaattttcaggtggtttgtacagacataaaaac15000
aagggtataggactacttcttgtgctaaaataaaagctggcactaaacagtgtatccagt15060
tcatcaggaaaacagttttagtgattaatcactaaaacaatacccatggtgcaaattatc15120
tgattaaccatgtacataccagtaattgcttcataaaaaatgcccccaatttgctcatgt15180
ttaaggaatatgtatagaactaaaaacaagggtatatgataacaataccagccactaaac15240
acacatggttaatcactaaaacaattctctgacaactcctataaaaagatagcaaccacc15300
agaaataaaccgcccacgacatccctaattgtagtcactaaaaactggtagcagtaattg15360
gttcactaaaaaatgaccataatcacgaaaactataacagtcaccagaactatataggtt15420
tcattagtaattgcttcactaaaaaatgtccccatgtgtaaattttcatgtcgtatgtat15480
agacctaaaaacaagggtatatgactacttgctatgctaaaacaatagctagcactaaac15540
agtgtattcagttaatcatccaaacaattttgtgattaatcactaaaacaatacccatgg15600
agcaaattatatgagaataagatctcgtcgttcctattgtgaagaatatactactacctc15660
cagtttcaaattacaatttcaaattacaagttgtttagaacatccacaaggtaattgcga15720
agaatatactaattgctctagtttcaaattagaagttgtttagagaaaaggtgttcctta15780
atagtctagttttagaagctacatccaactggtaaacataaattgcagaaaccttttatg15840
tggaagcctccgtcattgagtctgtcccctttagctgtaagtagttttctaaatattgtt15900
agtcaggcttagttgtttgagactctgtttccattcgtgaccatgggaactgtgaaatgt15960
gtagaagatgctcatgctcatgcatatgcatcgaattgttttgtaaagtcatcttaatgc16020
tcaaacagtttttttatctgccccagctgtacactgctttctgaattatgtcatttaggc16080
ttagctgtccgagataatttcatttgtgatgaaggtaaccggagcatctgtccttttgtt16140
ttaaacataaatattttgatagcttaacttgtgcgtcattttatcatgtactaacatggt16200
atatagatggcacttagcagtaacattcctgacttatgtgattgtcctgttagaatgctt16260
ctgcaatataatgggcttatgctaatctgtttggaatcccatgatgaataacaattatgg16320
atgttgggcattttgtatttttatgatgtaggcttaacatattttcttctctgctcagcc16380
ctgccggtggagacctctatttatagctataatccagccatatctagtgaactaacagat16440
ctattcctatctctagtgcttatccccttaacagatttattttcctatctctatttcttt16500
gatgttacttgctgcaggtgctatcccccactgatgcattatatgacaatcactcgaaga16560
atcagatgctaattaattgttttatacttgaccattgctaattaagtactccatttttt16619
13
17274
dna
zeamays
13
caaccctttccctctctcaaacggtcacttagaccgagtgaggcttcttccttaatctca60
tgggtcacttagaccccgcaaggatcaccacacaattggtgtctcttgcctcgcttacaa120
agcacttgagagtaagaagtgagaaagaaaagaaagccaagccaagcaaacaagagcaac180
aaagaaacacaaatgatcctttaacaagttctaatgcgctagagttgaatcgagaacttt240
gagtggatcgatcacttgaattgtgtctttgcagtggagtctattgctcttgtattgaat300
gcaatgtgttgaatgcttggatggttagagtggaggtggttgggggtatttatagccctc360
aaccaccaaacaactgttggggaggggttgctgtcgatgggcgcaccggacagtccggtg420
cgccagcaacgtcacccaaccgttagggttcgagcgcagacgactgttggagctttgtct480
tcttgtgccaccgaacagtcaggtgccgcaccggacaggcactgttcactgtccggtgcg540
cctctgacggctgctctaacttctgcgcgcactggtcgcacactgtagcgttcgcaggtg600
tccgttgcagtcgaccattgtgctggaagccgttgctccgtttggtgcatcggacagtct660
ggtggcacaccggacagtctggtgaattatagcggagtgcggcctgagaaacccgaaggt720
ggagagttcggagttgtacggtcctggtgcatcggacactgttcggtgcgccagaccagg780
gcacctttggtttctttgttcctttgcttttgaaccctaactttgatcttttattggttt840
gagttgaacctatatgcacctgtagaatataatctagagcaaactagttagtccaattat900
ttgtgttgggcattcaaccaccaaaattatttataggaaaaggttaaaccctatttccct960
ttcatccgggcccttgcggcggaccgtccgcgacaccagggtgagccttggacaggaaca1020
ctgcaaaaacacaagttaacactacggatcgtccgatggagaagcgagcaccgtccgaga1080
ccaagcacggaccgtccggcctcaggcgcgaatcgcccggtcgttgaaaaaccagaaaaa1140
cccgaaggtgacgggttcggtaaaatgcatttttagcgtccttgcggatcgtcctgggtg1200
cacggtcggaccgtccacgactgctttatctgacatttgacgacgcattaaaagctctat1260
agccgttactcctgaccgttgtgatttcagtcgttgatgtgcaggggtacggaccgtccg1320
cggtcggtagaaaatgagcaacgactaggaagtggttggaggctataaatacaaccccaa1380
ccacctccattcaaatgatccaagcactccactcattcacattcaatacaggagctagca1440
atacattccaagacacactcaaagctttcaatctctcaaagtcccacaatttagacaagt1500
gatcattagtgcttagtgacttgagagagtgtgatctatgtgttatttgtcgctcttgtt1560
gcttggctttcacaattgggctttcttcatctctttctcaaccttctaagtgaattataa1620
agcaagcaagagacacctaattttgtggtgatccttgtggggtcttagtgacccgtgtga1680
ttaagaagaagcactcgaccggtctaagtgaccgactgagagagggaaagggttggaata1740
gacccggactttgtggcctccttaacggggactaggttctttggaatcaaacctcggtaa1800
acaaatcgctgtgtttatttgtgttgattttcactcgatttgtttcccctcccttcctct1860
ctctaaaattcccttgctcatattgttgtgagttggctctcaaagttatctgcattgatt1920
gggcaactacttgcaaggataactatattccgcactccgaattatttctgacattaaccc1980
cgggcataatgtgtgttttaagtgtataattttcatgtttcgcctatttacccccctcta2040
ggcgactttcaaatgttctccttcacttgtgatgtctacaaccataatcagctcaacatt2100
tggactatcacccttgaacacttatgttgaactttaaaagttgtgcactaagcacttgtc2160
caacacttaacacacttgtcagtcctttaattgggttgtcatctaaaccaccaaaaacca2220
caaagagatctttcaccggggtccgtggttcatggccgtactgctcggtctcaagatttt2280
tattataaaatcactagagctctactatttatggttcggtgtgccatcgaaccgtctcca2340
acgggtacatccaacgagcgccagcacaccaacgactagttgacgtggtctacggtccag2400
aggctcatcagacttgtcaggaggctcgtcgactggtctggtgccacacccgtcgacttg2460
ggatcgaacaagaatgatggcaagaggacgaagcgcatcaagaagatcatctactacgac2520
tcctcttacccttcacacaaggacgacgattccacctcctccaagaaaaatacggttaaa2580
caaggttactctaagacatcttttatttattctcgcattccttacaatttcaatgctcat2640
ttgctttctattcatcttggcaagcctgctcgctttgatgggaggactattcttggtgga2700
gccataaaatgcgtagccatatttttttgctccaccctagcatttgggatgtcgtagaca2760
atgtaatgcaattgctaggtagcgatgataaaaattataatactattattgcccaataat2820
ctattcataatagcgcccaagctaaagaggtgcgctaagggatcctttttatagcccaaa2880
gaggtccataggcgttgctccttccttctaaacatcgatgaaattgtgttgtctgcgagg2940
acatcaaaccgggtctgtgcacaccttctccgggatctgatttggtgctttccttaactg3000
attcacagcggaccgatccgatgcaccaccggaccgtctgccacatcagctagtcgttgg3060
ccttcaaactccaccgggaagtagccgttggaggcggtccggtgcgccattggaccgtgc3120
aaagtgaagggtcgatgatttttataaataaaatctcgagacctcaaagttcagctctgt3180
aggcggtccggtgcacatggacctggttcgatgcattaggtcgcctaacactaagttcta3240
tgccctgcggacttgatccggtgcaccaccagacgagtccgatgaggcctagaacaaccc3300
aagtaaggctgttttgagcctaacttcttcaaatccttttggctattcttgggagctttc3360
caacaacttagacaaacataattagcacatattccaattgattaggtgtggagaactcac3420
cttttactttgtcgttcaccatgatttgcattttggcttaatctaagtattcgaaccact3480
tttctcacaggatagagttagagttcaaataaagtgctaaacacatagtattagacacat3540
gcaacttatctaagtaatcaaacctcatgattttacccttttgtccaaagctgcacactt3600
tagccttcattttagttctttaggatctagtactttcaaattgacttcaagtgcttgtgc3660
tcgtactcatatcaaattagttagtccatgttgttgtgctaaacacttaatcactaaaac3720
atgtagaaatggttatctaacacatttttctttcataagtaaaacaggagatttatattg3780
tagatgttattgtttgttgatgaaatttgaaataagagatataagagcaactcgaaaagc3840
ctagctaaatcgatttgtatcggtaaaaatagaaaactgatgattaaaataggatccaac3900
aaactctctttgctcctctctatgctatcctgctcagcatcacgtcgaggtctctagcca3960
tatttgctgagctcacctgcctcgccatcttcattctctcgtgcatcctcaccgtccctg4020
cgccttgccgtcgttgcagctcactgtcccacgccctgctgtcgccatgcctcgccgacc4080
ctgcggctcaccttaccgcgcctcgtcatcgtcgcggctcgccgtctcgtgcctcgccaa4140
ccccgctccttgacgtcctcacggcttacagtccccatcgtcctcgtggctccacgcctc4200
gccatatttgcggttcactgtccacgtgccctgccatcctagagcccgacatcgatatgt4260
caaatgagacaggagaaaaagaaatgaacatgtgattataatcagtgatttgaatattga4320
tagataagatttgaagagtctgttgtgtcatatcacctttttatgaaactctttattttt4380
tagagtttttataaaactctaaatttagctaaaattatatctagtcttttagagttactc4440
taacaagagatgagatagcgagctggcgagctgctggagacagccgagagtagagatagt4500
agaggagactagaaactccattaggcctagcccagagtcagctagggatgccgcccgatc4560
cactacgtactgaaagcgatcccggcccatgaacgctagtgggttgtaattcggcttacc4620
caagtacccagccgttcctccacacctctgcactacccgaaacccacgcccaacggccgt4680
ttcccgcaccccctatccgggaaggaagaaagccagaactcacccctgcttcgtctggcg4740
gcgccgtttcccgcacgcgatgccgtcgacggcggaggatttcccggcgataaggaagct4800
ggggaagctcttccggcttaccgaagtgtacctctggtaagtctcctgtctcccctaccc4860
gcttctcagcgtagggtttgccgtttgcgaggagtacgtctcctcaaactactctctctt4920
cctgcagggacgattcgtatggcgctggacctcacgatggacagaagaacgggcgctcgg4980
cggaggctgctctcgtggtacggtctctgacccgctttggtgtagctcactcttaagctt5040
tctgagttgggggtgcgcttgtgcttcgactagtagatggctaatttcgtcgggctactg5100
gtaatttcttggtatctgcattgtcgagaaagaaggcccgacgcataatttcatgcttgc5160
ccaagagtctacttaacacaaggaattggttttgtggcgtggtttgtgcattgcgcccaa5220
actgtagcctgtacaaaatgttaatcgtcgcgtgcattttaaacaaagttttgtattata5280
cgacaaaataccctcggcacatttgttacagactaccgataagtgcatacctatttctcc5340
tagttctatcaggaaataatcctggacctcgaaatgacagcctcgtctggctagaaccct5400
actaaacattttgagtgatcacttttcattactcattttcttgatgaaagcacattactg5460
acatggaagtttgctacataagacataacacttccttgtagtgctttatttaattattga5520
ccgatgatctttttggaaaattaagctgtattaaacaattgtagcttcggtgatgattgt5580
tggattaagcattagtctgctgcagtcctcttcttgattctgatatgacagttatttgtt5640
gattaaaataatgatggtttgctttacacttcgatctcctttgagaggaaaacatgtgaa5700
ggtgtggactagatcatgtatagaccaacagcattatcttattaaaacacttctaaataa5760
cttagcaatttcataaccatttttacacttctgaggaattcatcttgtcgtgaaagagtc5820
aattaacttagctgcttagcagactgtgtcaagcttattacttgtatgttgtgccctaca5880
aattactatgaggtttataatgtacatagcaatttgacgaccttcaacttttcaggactc5940
tcatacagataaaacatgcaatgaagcatccaatagcactgacaaaggttagatgtattt6000
ttcttgtattctagtatcttccttggtcaattttctttacagaggatgttacaatgtact6060
ctactttttttgtgtggaaacaacccactagagaaaaaaaatcacttcatttgagaaatc6120
ttaagaatctgaactctgaagctcagcatgcttccacaccaccacttttcaggccactgt6180
ctcttcattggagtaaatgacgcttcttttagagagaaagagagggggggggggtctgtt6240
tataattgaatcaagagattttattggtcacctgatttctgttgcatgacgtgggacctg6300
gatagacttcagatttgccttagttgataagttcaccggcactagtgaaagaaaagtata6360
tggtacaggtactcttatgaacaggcaaccacttagcaattcagcatctaatagagaagg6420
accaagtcttcaactaaagtcacaatatacctttagtactattagaggggctgaccctcc6480
ttgtctagtgcttgtgagatcataggagatggggcgtggtggttaagattgtggatcata6540
ttcgccaagctcccagggtcaatgtgattgagggatgtggtatgctacttgtgaaagggt6600
tcaaaaaggccaggatgacattgttcctacattcctggaatggggatgacacccccagga6660
caaggaaggtggcacagttccacgttcctggtgacatgttgtgtatcaaatgggaggcca6720
aatcaccacgggattactctaggaagggaagatgatgttgataatttgagtcattgttgc6780
aacatctgttcatggtttcatgcctcattttataagtcatattgcccacacataacattg6840
taatagtaaaatcaacaccagttattttacgttttcccttgtatgtcaaccgatttctta6900
ctgtgtatatgatctgtctatcaataggccatttctttgttgaagatttggaattggtca6960
atctcatgggttctttggggcttcctgtttcattcagcacaagtaaagtggtcagttgct7020
cacatagtgtgacaccatgttactgttccctaccggttccattgcttaattcctttatgc7080
attgcagaacaagaacacatgcaacaagggaaagaaaaaaggaagacaagcaccgctcaa7140
agcagcaaacactcaaatcaatgatgctgtgaggatatgtatcaatactgaagatagaga7200
aaattctgttgaatcattggatgctatggagcaaacgcactcatgcaatttatttgtgac7260
accactgggtcaaaatgaaccctcccgtgatgacactgacaagaggcttagggaagacag7320
ctcttgtgttgaagaacaagaagagtctggctgtagcaccatctactctgctggcaaagc7380
ccctggctgtgatgctaaaaatcatctcactgaacttggggcttttgagctttctgataa7440
cttggccaactcagcaaaagaagaatactcaattcaagaaaatcaagcttatgaaagtgt7500
gttgctagattctgaagagatgtcaaggaatgactgtgttgatgatgaatctacacattc7560
ctgtgttggcatttatcaggatgaaagagtgtccacaaggggagatcaaacatctgaaga7620
aactctatcagtaccccatgattacaatgatgttggcagagaagctagtctaagtttggc7680
agagccatcatctattgatgagcatgcacaaagctctgccaacaacttttactatgacta7740
tggtgaatggagggttatctgggatccattctataatcggtattatttttacaacatcca7800
gacacaagagtccacatggtgtcctcctgaaggactggaggattttgcatcatattgtag7860
cccagataccactaaagagctagctgaactgggatctcagtgttcaagcatggcaccaca7920
agagaacagtaaaaaccctagtctcgtcctttgcagttgacattacgaatagttatatgc7980
actacgataaaaactttctacaatatgtaacacttgagcatgtggcaatgggtgtaaaca8040
tttaataataaggtagtgaaatccattacacacagtattgaattttgcactacaaatgct8100
gaaggagaaacctaaattgtcaatgctttttggtgacattaattattgccattgatttcc8160
tgcttgtaggtgcttcatttatctgtctccaatttactcatatgctagcttcttgtttgg8220
gactaaaggctttgctgttgttttagtatgtcacacatttctctttaatctcaccatcac8280
agatctggctactcatgtcaatcatttagaagcacaggagcaagatcactgcattcatga8340
tttatctgacattcctgttgaaaagccaatatatcaaaggtagggaataccaaactgtac8400
aatgttgaacaagttattgttttttttgttaattctgttcatctatgcagtatgataact8460
acctctgacaaagcacagcacactgaaaataagtacagcgattcaacaactactgtgtta8520
gagatgaaccaggaagttgctagcaccaaaacgaaaaagagagtaaggagatctcgatcg8580
tgtaaggcgataatatatggcatctgctttctaggagtttgttcctgttacaattttagg8640
ttgcgcatttacacaatagtttcttggtttctttgagcaaatgcagctttgcatgactgc8700
tacattgcctacttatgtctaggtaacttttctttgcaaactgcaaagttatgtctaggt8760
aactatgccttctagaaaacctccttgttagctatgtattagtgagacttgcctaatatt8820
tattttcttgtggtccgttcttgtgctctttgtacatatttgccaataaccattttaatt8880
gttctacagatcattcatgccaagacatggcagggaacgtctctaatgacatcatcaagt8940
actgggctcagcggtattcacttttctcactttttgatagtggtataaagatggatgaag9000
tagggtggttttcagttacgccagagccaattgcaaagcatcatgcatctcgtgtgggtg9060
cgggagtaatgattgattgtttcacaggagttggtggaaatgccatccaatttgccaaaa9120
agtacgtcaatgttatcttgcaattgagttatgtgatggtctaatgtatcatttgcttga9180
acacttcctgtttagtagcaactgttatttttcttatgtcacgagaatgcaatggctata9240
tcaccttaagcagtatgctatgtccactgtccagtttaactaaggcatctgcttccagta9300
atatgcaaggctcttcttacttttgctgttatttaatatatggaagtgtccttacggagg9360
tgttattgtggacattttgagcatgttcatcatgtcacttgagttagtagagccagcctt9420
agttgtttgcagtgtaggtggatttattttatgttatcaatgtttcttctacagtactaa9480
gactattgttccacattaactatgtctccttttccaggtgcaagcatgtaattgcagttg9540
atattgatccacaaaagattgattgcgcgcatcataatgcatccatttatggagtaaatg9600
atcacatagatttcattgtaggtgattttatacatatagctcctcatctgaaggtaatgc9660
ctttttcttggaattattacttttaagtttctcaacacgtcacttctattagctatatgt9720
ttttgtagctgtttgcgagagtgaatttattgttgacattgttctcatttgcccacccat9780
tttaggataggggcttggtactacaaatatcttgatacttcaagtcctacaaaaagaaat9840
ttatgtttcatattttttccatttgaacgtcgagattttatggtcccatggagttctccc9900
tatttttcgatgatgcccatcttttggcagtaccttctttgtgtacacaataaatgggag9960
gatattttctgcagggagaaactgctttcatgtcgcctccttggggtggccctgactatg10020
ccaaagttgatgtttatgatatgaaaagcatgcttattccttgtgatgggttagttcctt10080
gtttctattttaagagagtaatttctttcagtttgcactcactgatgtttacttactttg10140
tgagtaaaacgcaccagagatccattaacctttaaggaggtgttatctatgtccatcaac10200
actcaaactgcatttttgggttcctaaactttttaagtgattcaccggagttccgtaccc10260
cttcgtttatatttgtattttgcagaaacctcactctgttttatttctccttcgcatgta10320
ggggtggtaatggatcacgattcaaacgtttctccacgattcgtttgagcccttaattaa10380
ttttagtacaaaaataaatagaaatagagatagagcctgatcctaatatgatttgatcct10440
caaattttatagcgtagaatttagagcccattaccatttaccacccctattcacatgcct10500
acccctctccatcttctggattgaatgttccaacctaatttacactcgtagtttctttga10560
tctgccaatcaaatccagagcctaattgctataacattagaacgaacacgccatattacc10620
agaatactcgatgcagatatggatagaagcgaggcgctaagcgcagccagccttggcttc10680
ttgctctgcaggccgatcagggcgccagccaaagccaaccatgcgcgcacgtgactgcaa10740
tgctactctctcttcgcctttgccatcgtcgtcgcaggatgttacgttgtgcttatgctg10800
gctcccacgagtgccgccgcccagagcgagctgagcgcacgcagccactgcttgtggttc10860
acgagcgtgagcatgccctcaccacctgctgctgcgcccttgctgcttgcttgcttgccg10920
gtggtgtacattatggacggattaaattgaatggatttacctgttccagaaaaagatctg10980
atcgacgatgggatgctatcttgtatggctccggatcaatgaagattaatggaacaacca11040
atcgaaggctcagagcaggctagttggtgcccggaagactctggccagaagatggaaatg11100
ggtaagcgtgtgaaggaaaaaagaaatagagggggatttctacaaaaaacaaacataatg11160
aagaggtatggatttcaggtgaaccacttaaaaataaaaagggcatacccagtgccgtag11220
gcttcccgcactgtgcggggtcgtctggggaagggtatctttaagcgtcaagtcttaccc11280
gcataatatgcagaggctggggctcgaacccgggacctttcggttatagacggtaggctc11340
taccgccgcaccaagcccgcccttgaaccatttaaaaaatttaggactcaaaaatacagt11400
ttgacagttgatggacctagatgacacctccttaaaaattttaatggacctatggtgcat11460
tttaatcttttgttgatcgactatactcaatgttgaactctttaggtactctctttttaa11520
actcggaaccatgatagcttcaagagtagtgatgtttcttcctcgcaacattgacctaaa11580
ccaattggcggacatgtccttgtctgtggatcccccgtgggcagttgaggtaagcccatt11640
tttgctgattttgtgccaagctgacgtttcctatagatgtcacagtggtctctctctctg11700
caggtcgagaagaacttcctcaacggaaagctgaaagccataacagcttactttgaagaa11760
caggatcgttgaaccaagcatcggcgctggtgatacaaatcatcttgttagctatgactc11820
acgacaattttttgtggtgaccctaaacagaacctttgtgttcggagacagaaagaagcg11880
gtttatcatcttcaccgagcatagataatttatttgcagagatgagtcattggtatcata11940
caaaagcagctcagcttatctcaattcacagcaagtgaaactgtcgaaggaaaactacaa12000
ggctgacagtcgaacgcgtgggagttagcttaattttgccttatgataagcaagcatgct12060
tcctggtttatttcatacagctactagtagtttcagctgcaacagttgtgcgttggtgtg12120
cgtgtgattctcacatatctggtcctgcggatgtgagtgatgcaaatgtatgtgtcatca12180
tcccatgtttgtttgtttgctctcaatctatgtagattgagtgggattaagtgagtttaa12240
atctcagacaagtcaaaaaaaaatgttttcaatctcatccaatccacatatgatagtaat12300
acccgagtaaggcttagatgtaatagttggaataagaaaaacaagtcagccattttgaag12360
ttttgtccttggagttctattaaaaggcattactgataaatctccaacagatttgcagtt12420
gaagcaacatgtgaaacatatttatcatgttaaaacaatttgccttagtattcgattatg12480
ccatgaaatctgacatttccttacacatcccagtttatcattgtcaactgtctttaggaa12540
tgtattgtatctgctgtttttacttgtatatgtatgttattttttgtcgttgtatgtata12600
tgttttattataaacatggccactaaggttgttctattcgttaaaataacacagatctat12660
aaacgactaaacaagcttcttgggataaagaatcatatggaggctggattttcgaggagt12720
ctggtgcactgttttgctaatgatcagaccccccccccccctctaaaaataaagaaaata12780
ctggatttcctgttcatttattacattcatatgtaaatgcttctgtccttttctatatct12840
gggctggactttttgtgtgctcgtcactcaagttggttagtgtggttaattttattatgc12900
tccgtgctctttcctaccgaacttggtctttgttagtatcattatcagtcagttatattt12960
tctcctcttgatgcttcatctaatctatttttgcaaagttgtcatgttatgtactatatg13020
atcttttacaaggtttttgacttttcaaattattgtgtcgtatattatttgtactcagat13080
tgtgcttacaactttagtttatctatactttaaggggtgtttggtttctatgagctaatt13140
tataatcccttcattttattctattttcgtacctaaattgtcaagcacgaaaacgaaaat13200
aaagttttaacttttatatttagcagtttatacactaaaatagaataaaatagatgaagt13260
aaaaattagtcctcagaaaccaaacatctcctaaatgtctagtaatagtcgcctgaactg13320
tagagcgcccaacacgcgccaccctgatttggtgtcttaaaatggcatgtgtatataggt13380
ggaatgggtttgacgagactgtaactactttttcttaattaaattatagatggacttaac13440
ttttctatatgcattttaaatatatttttctatatttttggtgggctgagttacagttta13500
tgtcaatataaattacaacagaccgaatctaaaattttattataaaatgtatgcaccaac13560
cgttgactaaaaagataaaatttggacacctacattttagcaagtcacctgctaatatat13620
atctatactaggtaagtgtccgtgcgttgcaacgaaaacatataataatacgataactta13680
tatataaaatatgtgttatactgttatgagaaaaagtttcacctgtcctatttttatcaa13740
tatgacaacagaggatcaatataaggccttggcatggcttcagaagttcagattaacgaa13800
ttatggagcaagagcaactatttctggtgtttcagtagaaagaatggggatatgtgttta13860
tctctctcatacatgttacacaatgtgctatagaatgacacctctaggccgctgctacaa13920
ctacagagaataaattatggatcatgggtgccctactaagttagctacacgtaaaatctg13980
gggcgattgaccccctaatgcttcatcttggagatctcactaaaatacaccttccgcaca14040
aaccgagccttgataaagctcgcgatcttcgtgtcctgatctttagtgcagacaactgca14100
agggaaattattgtgccggtcagcaagagtggaaaatgtcagcagaaacacaagaatgga14160
aaattatatgatgtgcaagtaggacggcaccctatttaactaaaggtgtgtttggttcag14220
ttttctgaaccaattcgttgccaaaaaatctaaaatctcacacaaacggtacaacatcag14280
aatagatttttaaaagtttatagatttctcaagttcaattcaaaatcaccatctacccca14340
aatttttcagattactattattcatgttacaactatcactcttgtagtatctaccattga14400
tagttgttttaatcaaatatattttagctttctcacaatcctcagctcgaaacagatttt14460
catggctcacagttggattcatattttcataaatctataggtgtgaaccaaatagaccct14520
aaaacatcatgttgtaatgttccaaaaaaatcatcacaaaaaacatgtatgcaacacacc14580
taagtgcaacaacactaacctgcaggagcgttggcgtaggctggactcttgtgtgtctgc14640
aatgtggaaacctgcattgtaaatggttaggtaaatgcttcgcttaaaaatggcagtaaa14700
tgcttcactaaaaatgcttccacgtgctcatgatcaagtaggttttatgttcaatctgca14760
gttctacacaagtgcgattcattttgatactcatttctatttactttcagagctcgtgct14820
aactgtcataaagtgcaatgcatctatgactgccaattgatattgtgctcctgccataaa14880
gtgcactgcatcgtgctaactgtacactaatctgtgaggatctaagaccaatatttgttt14940
acgttttctcctttagtatcctataaaaacaacacgcctagacaacccaaaaaatgtgtc15000
ccaataggaatatcagattctgacgctggcagcaatctcgaatgataatattttttccaa15060
acaagcgaggtccctaaatagaagcggcaacagataaaaactaaagataaaaactaaaga15120
gtacagatgattggcatcacatcgggaatgaaatatgcctaacatatcaatttgcatatt15180
agattatttgctgagaacaataacgaaaacatatttagttgttcatcacaagttacctta15240
gattttgctgttcaaggtcctttgggtcttctttctgctagaacatacaagggtatttca15300
gatttgcaaacaaggaaaagcaagaacttcaatgatacatcattgtaaaaccaagtttcc15360
gatttaaataaagatgatgcttgcggtcaacacattcacaaatgtaaatgtgtgaaatcg15420
ttcaaacataaggcttatgtggtcatgctcaggtagtatgtacagacctaaaaacaaggt15480
atatgacaacagtaccagccactaaacacacatggttaatcactaaaacaattctccgat15540
taaccaggaaaactatagcagccactagaactatacaggtttctaccagtaattgcttca15600
ctaaaaaatgcccccatgtgtaaattttcaggtggtttgtacagacataaaaacaagggt15660
ataggactacttcttgtgctaaaataaaagctggcactaaacagtgtatccagttcatca15720
ggaaaacagttttagtgattaatcactaaaacaatacccatggtgcaaattatctgatta15780
accatgtacataccagtaattgcttcataaaaaatgcccccaatttgctcatgtttaagg15840
aatatgtatagaactaaaaacaagggtatatgataacaataccagccactaaacacacat15900
ggttaatcactaaaacaattctctgacaactcctataaaaagatagcaaccaccagaaat15960
aaaccgcccacgacatccctaattgtagtcactaaaaactggtagcagtaattggttcac16020
taaaaaatgaccataatcacgaaaactataacagtcaccagaactatataggtttcatta16080
gtaattgcttcactaaaaaatgtccccatgtgtaaattttcatgtcgtatgtatagacct16140
aaaaacaagggtatatgactacttgctatgctaaaacaatagctagcactaaacagtgta16200
ttcagttaatcatccaaacaattttgtgattaatcactaaaacaatacccatggagcaaa16260
ttatatgagaataagatctcgtcgttcctattgtgaagaatatactactacctccagttt16320
caaattacaatttcaaattacaagttgtttagaacatccacaaggtaattgcgaagaata16380
tactaattgctctagtttcaaattacaagttgtttagagaaaaggtgttccttaatagtc16440
tagttttagaagctacatccaactggtaaacataaattgcagaaaccttttatgtggaag16500
cctccgtcattgagtctgtcccctttagctgtaagtagttttctaaatattgttagtcag16560
gcttagttgtttgagactctgtttccattcgtgaccatgggaactgtgaaatgtgtagaa16620
gatgctcatgctcatgcatatgcatcgaattgttttgtaaagtcatcttaatgctcaaac16680
agtttttttatctgccccagctgtacactgctttctgaattatgtcatttaggcttagct16740
gtccgagataatttcatttgtgatgaaggtaaccggagcatctgtccttttgttttaaac16800
ataaatattttgatagcttaacttgtgcgtcattttatcatgtactaacatggtatatag16860
atggcacttagcagtaacattcctgacttatgtgattgtcctgttagaatgcttctgcaa16920
tataatgggcttatgctaatctgtttggaatcccatgatgaataacaattatggatgttg16980
ggcattttgtatttttatgatgtaggcttaacatattttcttctctgctcagccctgccg17040
gtggagacctctatttatagctataatccagccatatctagtgaactaacagatctattc17100
ctatctctagtgcttatccccttaacagatttattttcctatctctatttctttgatgtt17160
acttgctgcaggtgctatcccccactgatgcattatatgacaatcactcgaagaatcaga17220
tgctaattaattgttttatacttgaccattgctaattaagtactccatttttta17274
14
1767
dna
artificialsequence
cdna
14
atgggttctttggggcttcctgtttcattcagcacaagtaaagtgaacaagaacacatgc60
aacaagggaaagaaaaaaggaagacaagcaccgctcaaagcagcaaacactcaaatcaat120
gatgctgtgaggatatgtatcaatactgaagatagagaaaattctgttgaatcattggat180
gctatggagcaaacgcactcatgcaatttatttgtgacaccactgggtcaaaatgaaccc240
tcccgtgatgacactgacaagaggcttagggaagacagctcttgtgttgaagaacaagaa300
gagtctggctgtagcaccatctactctgctggcaaagcccctggctgtgatgctaaaaat360
catctcactgaacttggggcttttgagctttctgataacttggccaactcagcaaaagaa420
gaatactcaattcaagaaaatcaagcttatgaaagtgtgttgctagattctgaagagatg480
tcaaggaatgactgtgttgatgatgaatctacacattcctgtgttggcatttatcaggat540
gaaagagtgtccacaaggggagatcaaacatctgaagaaactctatcagtaccccatgat600
tacaatgatgttggcagagaagctagtctaagtttggcagagccatcatctattgatgag660
catgcacaaagctctgccaacaacttttactatgactatggtgaatggagggttatctgg720
gatccattctataatcggtattatttttacaacatccagacacaagagtccacatggtgt780
cctcctgaaggactggaggattttgcatcatattgtagcccagataccactaaagagcta840
gctgaactgggatctcagtgttcaagcatggcaccacaagagaacaatctggctactcat900
gtcaatcatttagaagcacaggagcaagatcactgcattcatgatttatctgacattcct960
gttgaaaagccaatatatcaaagtatgataactacctctgacaaagcacagcacactgaa1020
aataagtacagcgattcaacaactactgtgttagagatgaaccaggaagttgctagcacc1080
aaaacgaaaaagagagtaaggagatctcgatcgtatcattcatgccaagacatggcaggg1140
aacgtctctaatgacatcatcaagtactgggctcagcggtattcacttttctcacttttt1200
gatagtggtataaagatggatgaagtagggtggttttcagttacgccagagccaattgca1260
aagcatcatgcatctcgtgtgggtgcgggagtaatgattgattgtttcacaggagttggt1320
ggaaatgccatccaatttgccaaaaagtgcaagcatgtaattgcagttgatattgatcca1380
caaaagattgattgcgcgcatcataatgcatccatttatggagtaaatgatcacatagat1440
ttcattgtaggtgattttatacatatagctcctcatctgaagggagaaactgctttcatg1500
tcgcctccttggggtggccctgactatgccaaagttgatgtttatgatatgaaaagcatg1560
cttattccttgtgatgggtactctctttttaaactcggaaccatgatagcttcaagagta1620
gtgatgtttcttcctcgcaacattgacctaaaccaattggcggacatgtccttgtctgtg1680
gatcccccgtgggcagttgaggtcgagaagaacttcctcaacggaaagctgaaagccata1740
acagcttactttgaagaacaggatcgt1767
15
1767
dna
artificialsequence
cdna
15
atgggttctttggggcttcctgtttcattcagcacaagtaaagtgaacaagaacacatgc60
aacaagggaaagaaaaaaggaagacaagcaccgctcaaagcagcaaacactcaaatcaat120
gatgctgtgaggatatgtatcaatactgaagatagagaaaattctgttgaatcattggat180
gctatggagcaaacgcactcatgcaatttatttgtgacaccactgggtcaaaatgaaccc240
tcccgtgatgacactgacaagaggcttagggaagacagctcttgtgttgaagaacaagaa300
gagtctggctgtagcaccatctactctgctggcaaagcccctggctgtgatgctaaaaat360
catctcactgaacttggggcttttgagctttctgataacttggccaactcagcaaaagaa420
gaatactcaattcaagaaaatcaagcttatgaaagtgtgttgctagattctgaagagatg480
tcaaggaatgactgtgttgatgatgaatctacacattcctgtgttggcatttatcaggat540
gaaagagtgtccacaaggggagatcaaacatctgaagaaactctatcagtaccccatgat600
tacaatgatgttggcagagaagctagtctaagtttggcagagccatcatctattgatgag660
catgcacaaagctctgccaacaacttttactatgactatggtgaatggagggttatctgg720
gatccattctataatcggtattatttttacaacatccagacacaagagtccacatggtgt780
cctcctgaaggactggaggattttgcatcatattgtagcccagataccactaaagagcta840
gctgaactgggatctcagtgttcaagcatggcaccacaagagaacaatctggctactcat900
gtcaatcatttagaagcacaggagcaagatcactgcattcatgatttatctgacattcct960
gttgaaaagccaatatatcaaagtatgataactacctctgacaaagcacagcacactgaa1020
aataagtacagcgattcaacaactactgtgttagagatgaaccaggaagttgctagcacc1080
aaaacgaaaaagagagtaaggagatctcgatcgtatcattcatgccaagacatggcaggg1140
aacgtctctaatgacatcatcaagtactgggctcagcggtattcacttttctcacttttt1200
gatagtggtataaagatggatgaagtagggtggttttcagttacgccagagccaattgca1260
aagcatcatgcatctcgtgtgggtgcgggagtaatgattgattgtttcacaggagttggt1320
ggaaatgccatccaatttgccaaaaagtgcaagcatgtaattgcagttgatattgatcca1380
caaaagattgattgcgcgcatcataatgcatccatttatggagtaaatgatcacatagat1440
ttcattgtaggtgattttatacatatagctcctcatctgaagggagaaactgctttcatg1500
tcgcctccttggggtggccctgactatgccaaagttgatgtttatgatatgaaaagcatg1560
cttattccttgtgatgggtactctctttttaaactcggaaccatgatagcttcaagagta1620
gtgatgtttcttcctcgcaacattgacctaaaccaattggcggacatgtccttgtctgtg1680
gatcccccgtgggcagttgaggtcgagaagaacttcctcaacggaaagctgaaagccata1740
acagcttactttgaagaacaggatcgt1767
16
434
prt
zeamays
16
metglyserserglugluhisvalpheleuaspprothrargilecys
151015
alaservalserleuleualahisaspleuileglyargmetleuasn
202530
arggluvalserserargproasnalalysgluvalleupropromet
354045
ilehisarggluilevalargpheglytyrcysgluserserserser
505560
lysserserseraspasnserglugluargaspglucysglyileval
65707580
aspalaleuvalthrthrilethrglnilearglysmetaspleuglu
859095
alaargserleuglnproserilelysalaglyleuleualalysleu
100105110
argglutyrlysseraspleuasnasnvallysmetglyleuserala
115120125
gluarglyslysglnlysleusergluileglnserglyvalgluglu
130135140
alagluserleuileglnlysmetaspleuglualaargserleugln
145150155160
proserilelysalaglyleuleualalysproargasptyrlysser
165170175
aspleuasnasnvallyssergluleulysargileseralaproasn
180185190
alaserglyleuilesertyrlyslysleuleuphehisglyleuasp
195200205
leutrpthralaleuserleuproglnproleuglyargalaalaleu
210215220
trpproprohisargthrilehisglnhisleuglncysglnglnleu
225230235240
thrglyvalalaglyserleualatyrleualaprogluvalleuleu
245250255
glyasntyrserglnlysvalaspvaltrpalaalaglyvalleuleu
260265270
hisvalleuleumetglythrleupropheglnglylysserileglu
275280285
alailepheaspvalilelysthralagluleuaspphehisasnser
290295300
glntrpalaservalserleuleualatyraspleuileglyargmet
305310315320
leuasnarggluvalserserargproaspalagluaspvalleuarg
325330335
hisprotrpvalleuphetyrthraspcysleuglnlysalagluphe
340345350
serasnleutrpaspthrasnlysthralaalaprometilehisarg
355360365
gluilevalargpheglytyrcysgluserserserserlysserser
370375380
seraspasnserglugluargaspglucysglyilevalaspalaleu
385390395400
alathrthrilethrglnvalargilesergluprolysargserarg
405410415
leupheserleuproasnglyleuleuproproserargasnserleu
420425430
argthr
17
143
prt
zeamays
17
metleuasnarggluvalserserargproasnalalysgluvalleu
151015
arglysphelyshisprocysasnleucyspheiletyrmetileleu
202530
asnleuserleuthrpheproasnglypheglnhisargalaprotrp
354045
valleuphetyrthraspcysproglnlysalaglupheserasnile
505560
trpaspthrasnlysthralaalaprometilehisarggluileval
65707580
argpheglytyrcysgluserserserserlysserserseraspasn
859095
serglugluargaspglucysglyilevalaspalaleuvalthrthr
100105110
ilethrglnvalargilesergluprolysargserargleupheser
115120125
leuproasnglyleuleuproproserargasnserleuargthr
130135140
18
162
prt
zeamays
18
metgluglyglyarghisproserproproproargileserarggln
151015
proproprotyrproalacysproserileleuproproleupropro
202530
valasnvalthrasnproglyleuvalproleuvalvalalathrleu
354045
pheaspgluargvalthrgluleuleuservalleualaaspalaala
505560
valglyargproglyargtrpserileglyglualaprotrpserser
65707580
serglyglythrasnglnalavaltyralaargargalaproglyser
859095
serserproproproalaproalaserproproleuproserserarg
100105110
alaaspcysleualaargtrpproglyserargalaleuvalalapro
115120125
leuglythrproalaphevalaspargleuphetrpserasppheser
130135140
glyserileargargglugluglualaglualaleuargaspproile
145150155160
argarg
19
87
prt
zeamays
19
metaspleuglualaargserleuglnproserilelysalaglyleu
151015
leualalysproargasptyrlysseraspleuasnasnvallysser
202530
gluleulysargileseralaproasnalaargpheglyargtrpthr
354045
trplysglnglyalatyrasnleualaleuargvalserserarggly
505560
tyrleuargproleuproglyargleuproglyargsersertrpser
65707580
leuglutrpleuileleuser
85
20
279
prt
zeamays
20
metalahispheaspgluleugluasplysthrthrasptyrvalasp
151015
leuservalglngluphealaleulysglnproglncysglymetala
202530
tyrasntyrtyrglyasnleuargleutyrvalvalalaasnlysala
354045
gluleualaserserilephegluileasplysalaserthrlysarg
505560
ileglyalaargphecysargcysleuprohisthrargmetglugly
65707580
glyarghisproserproproproargileserargglnproglnpro
859095
tyrproalacysproserileleuproglnproproprogluarglys
100105110
lysglnlysleusergluileglnserglyvalgluglualagluser
115120125
leuileglnlysmetaspleuglualaargserleuglnproserile
130135140
lysalaserleuleualalysleuargglutyrlysseraspleuasn
145150155160
asnvallyssergluleulysargileseralaproasnalaarggln
165170175
alathrargglugluleuleugluserglymetalaaspthrleuala
180185190
progluglngluglnleualacysalaalaalaalaleualavalgly
195200205
proalatyrgluargleuglnglualaargasnprosergluglngly
210215220
cysasnhisasplysglnilegluglnalatyraspaspileleuasn
225230235240
serserlyshisthrleualasermetmetgluleuglnglualaleu
245250255
leugluserasnglnalathrlysaspalaasnglyilealaalaleu
260265270
tyrilevalleuvalleumet
275
21
428
prt
zeamays
21
metalasertyrserserargargprocysasnthrcysserthrlys
151015
alametalaglyservalvalglygluprovalvalleuglyglnarg
202530
valthrvalleuthrvalaspglyglyglyvalargglyleuilepro
354045
glythrileleualapheleuglualaargleuglngluleuaspgly
505560
proglualaargleualaasptyrpheasptyrilealaglythrser
65707580
thrglyglyleuilethralametleuthralaproglylysasplys
859095
argproleutyralaalalysaspileasnhisphetyrmetglnasn
100105110
cysproargilepheproglnlysserargleualaalaalametser
115120125
alaleuarglysprolystyrasnglylyscysmetargserleuile
130135140
argserileleuglygluthrargvalsergluthrleuthrasnval
145150155160
ileileproalapheaspileargleuleuglnproileilepheser
165170175
thrtyraspalalysserthrproleulysasnalaleuleuserasp
180185190
valcysileglythrseralaalaprothrtyrleuproalahistyr
195200205
pheglnthrgluaspalaasnglylysgluargglutyrasnleuile
210215220
aspglyglyvalalaalaasnasnprothrmetvalalametthrgln
225230235240
ilethrlyslysmetleualaserlysasplysalaglugluleutyr
245250255
provallysproserasncysargargpheleuvalleuserilegly
260265270
thrglyserthrsergluglnglyleutyrthralaargglncysser
275280285
argtrpglyilecysargtrpleuargasnasnglymetalaproile
290295300
ileaspilephemetalaalaserseraspleuvalaspilehisval
305310315320
alaalametpheglnserleuhisseraspglyasptyrleuargile
325330335
glnaspasnserleuargglyalaalaalathrvalaspalaalathr
340345350
progluasnmetargthrleuvalglyileglygluargmetleuala
355360365
glnargvalserargvalasnvalgluthrglyargtyrgluproval
370375380
thrglygluglyserasnalaaspalaleuglyglyleualaarggln
385390395400
leuserglugluargargthrargleualaargargvalseralaile
405410415
asnproargglyserargcysalasertyraspile
420425
22
401
prt
zeamays
22
metalasertyrserserargargprocysasnthrcysserthrlys
151015
alametalaglyservalvalglygluprovalvalleuglyglnarg
202530
valthrvalleuthrvalaspglyglyglyvalargglyleuilepro
354045
glythrileleualapheleuglualaargleuglngluleuaspgly
505560
proglualaargleualaasptyrpheasptyrilealaglythrser
65707580
thrglyglyleuilethralametleuthralaproglylysasplys
859095
argproleutyralaalalysaspileasnhisphetyrmetglnasn
100105110
cysproargilepheproglnlysserargleualaalaalametser
115120125
alaleuarglysprolystyrasnglylyscysmetargserleuile
130135140
argserileleuglygluthrargalalysserthrproleulysasn
145150155160
alaleuleuseraspvalcysileglythrseralaalaprothrtyr
165170175
leuproalahistyrpheglnthrgluaspalaasnglylysgluarg
180185190
glutyrasnleuileaspglyglyvalalaalaasnasnprothrmet
195200205
valalametthrglnilethrlyslysmetleualaserlysasplys
210215220
alaglugluleutyrprovallysproserasncysargargpheleu
225230235240
valleuserileglythrglyserthrsergluglnglyleutyrthr
245250255
alaargglncysserargtrpglyilecysargtrpleuargasnasn
260265270
glymetalaproileileaspilephemetalaalaserseraspleu
275280285
valaspilehisvalalaalametpheglnserleuhisseraspgly
290295300
asptyrleuargileglnaspasnserleuargglyalaalaalathr
305310315320
valaspalaalathrprogluasnmetargthrleuvalglyilegly
325330335
gluargmetleualaglnargvalserargvalasnvalgluthrgly
340345350
argtyrgluprovalthrglygluglyserasnalaaspalaleugly
355360365
glyleualaargglnleuserglugluargargthrargleualaarg
370375380
argvalseralaileasnproargglyserargcysalasertyrasp
385390395400
ile
23
380
prt
zeamays
23
metalasertyrserserargargprocysasnthrcysserthrlys
151015
alametalaglyservalvalglygluprovalvalleuglyglnarg
202530
valthrvalleuthrvalaspglyglyglyvalargglyleuilepro
354045
glythrileleualapheleuglualaargleuglngluleuaspgly
505560
proglualaargleualaasptyrpheasptyrilealaglythrser
65707580
thrglyglyleuilethralametleuthralaproglylysasplys
859095
argproleutyralaalalysaspileasntyrphetyrmetgluasn
100105110
cysproargilepheproglnlysserargleualaalaalametser
115120125
alaleuarglysprolystyrasnglylyscysmetargserleuile
130135140
argserileleuglygluthrargvalsergluthrleuthrasnval
145150155160
ileileproalapheaspileargleuleuglnproileilepheser
165170175
thrtyraspalalysserthrproleulysasnalaleuleuserasp
180185190
valcysileglythrseralaalaprothrtyrleuproalahistyr
195200205
pheglnthrgluaspalaasnglylysgluargglutyrasnleuile
210215220
aspglyglyvalalaalaasnasnprothrmetvalalametthrgln
225230235240
ilethrlyslysmetleualaserlysasplysalaglugluleutyr
245250255
provalasnproserasncysargargpheleuvalleuserilegly
260265270
thrglyserthrsergluglnglyleutyrthralaargglncysser
275280285
argtrpglyilecysargtrpleuargasnasnglymetalaproile
290295300
ileaspilephemetalaalaserseraspleuvalaspilehisval
305310315320
alaalametpheglnserleuhisseraspglyasptyrleuargile
325330335
glnaspasnserleuargglyalaalaalathrvalaspalaalathr
340345350
progluasnmetargthrleuvalglyileglygluargmetleuala
355360365
glnargvalserargvalasnvalgluthrglyser
370375380
24
589
prt
zeamays
24
metglyserleuglyleuprovalserpheserthrserlysvalasn
151015
lysasnthrcysasnlysglylyslyslysglyargglnalaproleu
202530
lysalaalaasnthrglnileasnaspalavalargilecysileasn
354045
thrgluasparggluasnservalgluserleuaspalametglugln
505560
thrhissercysasnleuphevalthrproleuglyglnasnglupro
65707580
serargaspaspthrasplysargleuarggluaspsersercysval
859095
glugluglnglugluserglycysserthriletyrseralaglylys
100105110
alaproglycysaspalalysasnhisleuthrgluleuglyalaphe
115120125
gluleuseraspasnleualaasnseralalysgluglutyrserile
130135140
glngluasnglnalatyrgluservalleuleuaspsergluglumet
145150155160
serargasnaspcysvalaspaspgluserthrhissercysvalgly
165170175
iletyrglnaspgluargvalserthrargglyaspglnthrserglu
180185190
gluthrleuservalprohisasptyrasnaspvalglyarggluala
195200205
serleuserleualagluproserserileaspgluhisalaglnser
210215220
seralaasnasnphetyrtyrasptyrglyglutrpargvaliletrp
225230235240
aspprophetyrasnargtyrtyrphetyrasnileglnthrglnglu
245250255
serthrtrpcysproprogluglyleugluaspphealasertyrcys
260265270
serproaspthrthrlysgluleualagluleuglyserglncysser
275280285
sermetalaproglngluasnasnleualathrhisvalasnhisleu
290295300
glualaglngluglnasphiscysilehisaspleuseraspilepro
305310315320
valglulysproiletyrglnsermetilethrthrserasplysala
325330335
glnhisthrgluasnlystyrseraspserthrthrthrvalleuglu
340345350
metasnglngluvalalaserthrlysthrlyslysargvalargarg
355360365
serargsertyrhissercysglnaspmetalaglyasnvalserasn
370375380
aspileilelystyrtrpalaglnargtyrserleupheserleuphe
385390395400
aspserglyilelysmetaspgluvalglytrppheservalthrpro
405410415
gluproilealalyshishisalaserargvalglyalaglyvalmet
420425430
ileaspcysphethrglyvalglyglyasnalaileglnphealalys
435440445
lyscyslyshisvalilealavalaspileaspproglnlysileasp
450455460
cysalahishisasnalaseriletyrglyvalasnasphisileasp
465470475480
pheilevalglyasppheilehisilealaprohisleulysglyglu
485490495
thralaphemetserproprotrpglyglyproasptyralalysval
500505510
aspvaltyraspmetlyssermetleuileprocysaspglytyrser
515520525
leuphelysleuglythrmetilealaserargvalvalmetpheleu
530535540
proargasnileaspleuasnglnleualaaspmetserleuserval
545550555560
aspproprotrpalavalgluvalglulysasnpheleuasnglylys
565570575
leulysalailethralatyrpheglugluglnasparg
580585
25
589
prt
zeamays
25
metglyserleuglyleuprovalserpheserthrserlysvalasn
151015
lysasnthrcysasnlysglylyslyslysglyargglnalaproleu
202530
lysalaalaasnthrglnileasnaspalavalargilecysileasn
354045
thrgluasparggluasnservalgluserleuaspalametglugln
505560
thrhissercysasnleuphevalthrproleuglyglnasnglupro
65707580
serargaspaspthrasplysargleuarggluaspsersercysval
859095
glugluglnglugluserglycysserthriletyrseralaglylys
100105110
alaproglycysaspalalysasnhisleuthrgluleuglyalaphe
115120125
gluleuseraspasnleualaasnseralalysgluglutyrserile
130135140
glngluasnglnalatyrgluservalleuleuaspsergluglumet
145150155160
serargasnaspcysvalaspaspgluserthrhissercysvalgly
165170175
iletyrglnaspgluargvalserthrargglyaspglnthrserglu
180185190
gluthrleuservalprohisasptyrasnaspvalglyarggluala
195200205
serleuserleualagluproserserileaspgluhisalaglnser
210215220
seralaasnasnphetyrtyrasptyrglyglutrpargvaliletrp
225230235240
aspprophetyrasnargtyrtyrphetyrasnileglnthrglnglu
245250255
serthrtrpcysproprogluglyleugluaspphealasertyrcys
260265270
serproaspthrthrlysgluleualagluleuglyserglncysser
275280285
sermetalaproglngluasnasnleualathrhisvalasnhisleu
290295300
glualaglngluglnasphiscysilehisaspleuseraspilepro
305310315320
valglulysproiletyrglnsermetilethrthrserasplysala
325330335
glnhisthrgluasnlystyrseraspserthrthrthrvalleuglu
340345350
metasnglngluvalalaserthrlysthrlyslysargvalargarg
355360365
serargsertyrhissercysglnaspmetalaglyasnvalserasn
370375380
aspileilelystyrtrpalaglnargtyrserleupheserleuphe
385390395400
aspserglyilelysmetaspgluvalglytrppheservalthrpro
405410415
gluproilealalyshishisalaserargvalglyalaglyvalmet
420425430
ileaspcysphethrglyvalglyglyasnalaileglnphealalys
435440445
lyscyslyshisvalilealavalaspileaspproglnlysileasp
450455460
cysalahishisasnalaseriletyrglyvalasnasphisileasp
465470475480
pheilevalglyasppheilehisilealaprohisleulysglyglu
485490495
thralaphemetserproprotrpglyglyproasptyralalysval
500505510
aspvaltyraspmetlyssermetleuileprocysaspglytyrser
515520525
leuphelysleuglythrmetilealaserargvalvalmetpheleu
530535540
proargasnileaspleuasnglnleualaaspmetserleuserval
545550555560
aspproprotrpalavalgluvalglulysasnpheleuasnglylys
565570575
leulysalailethralatyrpheglugluglnasparg
580585
26
13229
dna
zeamays
misc_feature
(10689)..(10788)
nisa,c,g,ort
26
ccccgcgcggccaaccctctctaagagggccctggtccttccttttatagtcgtaaggag60
tggatccaggtgtacaacgggggtgtagcagagtgctacgtgtctagcgggggagagcta120
gcgccctaagtacatgccgatgtggcagccggagagatcttggcacccagcgagtgtgat180
gtcgtggccatcggaggagcgacggagcctggcggagggacagctgttggagcggttgag240
tccttgctgacgtcctcctgcttccgtaagagagctgagagccgccgtcgtcacagagct300
tgcggggcgccatcattgcctatctggcggagctagccagataggacaccggtcttgttc360
tctgcggcccgagtcggctcggggcagggtgatgatggcgcttcctgttgacgtgactgg420
cctgcgccctaggtcgggcgacgtggaggctcctccgaagccgaggtcgagtctgtcttc480
catggtcgaggccgagcccgagcccctgggtcgggcgaggcggaggtcgttcggcagagg540
ccagggcggagtccgagccctggggtcgggcgaagcggagttcgtcgtcttctggggctg600
agcccgagtccgagccctgggtcgggcggagcggagttcgccgtcttccgggactttagc660
ccgagtccgagccctgggtcgggcggagcggagttcgccgtcttccggggcttagcccga720
gtccgagccctgggtcgggcggagcggagttcgccgtcttccgggacttaccccgagtcc780
gagccctggggtcgggcggagcttcctatggtgcctttggcagggcctgactgcccgtca840
gtctcactctgtcgagtggcactgcagtcggagtggcgcaggcggcgctgtccttctgcc900
aggccggtcagtggagcggcgaagtgacggcggtcacttcggctctgccggagggcgtgt960
gtcaggataaaggtgtcaggccacctttgcgttaaatgctcctgcgattcggtcggtcgg1020
tgcggcgatttagtcagggttgcttcttagcgaaggcaaggcctcgggcgagccggagat1080
gtgtccgccgttggaggggggcctcgggcgagacggaaatcctccggggtcggctgccct1140
tgtccgaggctaggctcgggcgaggcgtgatcgagtcgctcgaatggactgatccttgac1200
ttaatcgcacccatcgggcctttgcagctttatgctgatgggggttaccagctgagaatt1260
aggcgtcttgagggtacccctaattatggtccccgacaaccacaaacgcccacgtcgtgc1320
gcgtggaggtaaggctatctgcattcatcatactttaaactagttgggtgcccgtgcgtt1380
gcaacagatatcatataaattcatgtgttttgctacgcgacacgagggatcggtattatt1440
agtagtcatttttctattggacctagtgaggtactcgtacgttgctacggagatcatata1500
aatccatgtgttttgctacgcgacacgagggatcggtattattagaagtcattttctatt1560
ggacctagtgaggtacctgtatgtgcccgtacgttaccacgagagttaaaacctagtata1620
aaacataaatacagaatgacaaacatcattatgatatacaaattcgtgtaacaaaatatc1680
actgtagcactaatattcaataaaatgtagcaactcactgctcactaatgatgtgtccag1740
ggtaagtgggtaaagcacggtagacgtcttcttttgacttagtacaatgctcgtgtgttg1800
agacggtgcacaattattcgataaaatgtttgagcacaagtgcacaacgattacataaaa1860
ttgaaaaatacttatattaacaaagtctaattgtctcaaattctttttgccaacaaccaa1920
ttcactgtgttgcgatgatacacaataatatttcatgaatgactcaggcaatccacctaa1980
gtgggtaacccaaaagcacaccaatgtgatgcctacacgtgaacatctaacctctattac2040
tacaatagtcttgttggaaatttacaactcatctagaatttcaaaatactcaaaattatt2100
tagactctctctaaaccggagcatacaaaatagatcaggcagttttaacggaaacatgta2160
tgactctcaccatcctgtgcagcgctgctcgaataccagaaagtcccagctcagccaata2220
ctgcatcaagttctgctagttcctttttttcatctcctttttagacaactgtctttctgt2280
ctctttgggtggcgcagctagggcccacaactgtcttgatagcaggttcggctggggcat2340
ctttcggatcatgctttaggttcatcctcagctccatcatcaacctcaaaatcaagatcc2400
ttgtcatcactctcaatgtcctgcaacatatatagaagtacatctaagaacactatcaga2460
gaggctaactataatgacaatcagccaaacatgctccatccccctcgagatgagtaaatt2520
atacttccacttaatagtttagttttgatgcaaaaattatgacaaaaaacaagtagaaca2580
ataaccagagtttgctatgacaaacatgcagcacctgtcatgtataaccatttccaaatc2640
acttgcttacccaagatatagaaaaacttggaacaagtccctatccaaggtacatatttt2700
ttatttttgtttagcaggtggtcagctctgaaaattttctgtcaaggtgaagtttgcctt2760
atgacttgctcatagcagggtctgctttgaactcctccactgggaatgcaatgtcgtgat2820
cccacagaccgatcaaatgggattcaaatttaggctgtgctatatcgcgctctgaatcta2880
gtgataatgcgtctatccttggtttcttgggagaatgttgcatgatctctgggtagtcat2940
ctaagcttggaggtgaatcagaaagtttctgaccctcagaagctttaattgatccaccaa3000
gcttttgttgctcctcaatgatcatctgcaagtatattccttgtgctttaattgttagtt3060
gcagttgtctttgaacctgaaaacataacttatttagacaagcaaaagaaagttggcaag3120
gaaaatttagattgatcttgaagctaaaaaggactcataaactacaagcaataagacaaa3180
acaggagaactgccaacaatagattattgacaactaagcattactacaaccatgagaaga3240
aaccactgttaggtgtaacgtgacagttaataagtgaatattgagaatgggccaaaatat3300
cagcttgtagaagtagccccaactaagtaggacttctggtcatagtaggattgtcggtaa3360
actaaaatgatgtaaaaaacagaaatctctactacttattaagtaagcaatagtagtctg3420
cctccctcgttctgccgttctgccatccatcgatctggaccgttcatatcgcatcgtgcg3480
tctgcgtctcccacactctatctccctccaccgcacgaccacccaatccctaaccttccc3540
ccacacgttcctctctctctcgcgttgcccgccccctgcccctgccgcgatagaatgcct3600
cgcggcctccaccgccgacgcggcctccaccgcttcccttcctccctttgcctccaccgc3660
ccctcccccaacgtcgtcgccaccgccgctctagggcctcgacgcaggcagcttgggaac3720
cgccctcggcctcggtgtcgcacggccccacgaggaccgaccttgccgcatggcggttgt3780
ggaggcgtgcgaccccgcaaggatcgacatctcgtccaagacagcagccccctcagagcg3840
gacgacgcagcttatctctggtggaaggcacgtagccacgcccccactccctcctcgtgc3900
tcccgccgacgcctgcctcgcgctcctcgtcgaccgtctctccctctcctccgtcgttgc3960
atctcgcctctcgcgcacgcacaaaggtgagtctcccctcccccgactcacccctctagg4020
tctctgattcttctcatcggcgagcaaggacggaggagctggcggaggagctgtcgaata4080
taacctaattatgtcggtctcgggtgggatttggctattttgggcgccatggcctaggtg4140
agcgtacagatctggttctgcattttgttttcttatctgtcaatcatgttttttcatatt4200
actgcaggtgggtctgattaaatattacagacatgttttgcttcgaagccctacacctgt4260
ccatctaattttaggaactgacgattgacacagctttgttgtggcgctatttagtataag4320
ttatagaaatcggatctttggctctcatctgcttgtagttgcaacattgaggtagaacct4380
gggtgggtctgattgctcattgctgtgatatatgtctggaaaagcagagttatagtttgg4440
aaactagcgcgtagtacatgtcggtttctttaagtaattgcttatgctctgttgttttga4500
tttccagatgccccaaatgtgctgaacttaagctatcgcgtgagaacacaactttctggt4560
aagtggcaagctcctccgtttcttattttaccgcattggaagttggaacaccatgatagc4620
tctcaggacgaatgagtatcagctaattttattgttttaacaatagagcataaaacctta4680
tgcatattttaaaggttcatgtatcatttagttatggccttatgaggaagcttcatcatc4740
atatctgtagataccttgattcgtggggatggtaattttttgtattcttgttcttttgat4800
ttttgaaggaaacacatcttataatacgatagcacccattggaagtcctcgcatacagtg4860
cagggtagaattatgcttcatgtgttgccccttcaccacgatatgccaaattgaatgtag4920
tttcatcagttatgctcaattatggatgttcagaggcatccaatttcagtgtgtactgca4980
atacttgggtccacctatagttgatagatgttctgtattttgttttcttataaattagat5040
ttggcttgcattatattgttctcttttggaacagagtcactctccgacagctcatcgccg5100
cctgcttcgaccgcctgtacacaaggtttaagaagcgctctgcgaacttccacggccgcg5160
ccgtgctgcttctagctatttacctttttctatgatgcaaatgtttatatgcatactatg5220
ttacttgagaaacattaaagtacttgatgtacctaaacacattttgtttagtgatgtata5280
gtgatatagcattattgtctatattaatatttatattgctggtagcttctactttaattg5340
atctcaatggggcatttgggtggctagcaattcacattgataatttaaaagtgaatttca5400
ggtgtacatttgatggcctccgatatggtgctgccttcaattctctacaatgcgcgagaa5460
tgctgctcaggagggtattaatggctcaacacagatgacctcctcggagtcatgtttcta5520
attatctacactatgattctccttctgttgataaaatattgttttattgtgctgtgagct5580
aatgataacagtgatggtaagtaaatatggtccatgcatattctcatcatagatggctga5640
aaaactccgagtgctgctacgctaccagagtcttcatgtgcatacttacttcaagaactc5700
aaggtacatagttttctcaacagaagaatatgtatctgtttgattccagctgaattgctt5760
actaaactcagtgtgtcactttaaatgatatgggatgaagttgggcaagaccaaagtgaa5820
agtgggagaatacccgaagaacttcttgttggacgaacttggagaaaccaatactaaaac5880
tcagtgtcaaccgcttgcaacaggcaatttaggtcgatgtgctcgcacgctgtgtgacca5940
tgtctgagcactccccacccacccaatcgcttcccacgtcatgccgccacgtcgagaatt6000
tgtacacaactcaggttgcccattttactctgttattgaacctcgcttttctgtagaccc6060
aggtatgctagaagtaggtaatggtagcatgaccttagcagagtatcgctcacattttag6120
catttgggttgtcatgaaggtaagttttcactgagattggaccgatgttgtgtcatcata6180
ttttgagtaaagggtttgaccgaagaattttgaaaaataaaagactgtagttcattgaag6240
gtgataactggtatgcaagtattactattaatgattctccacttactgatattacatttg6300
gataagaggaaggagatatcctaattctaatgggttaactgctagcagttccttatgtac6360
tgttatgcttcaggcccctttattaattgcctcatgtggaaagagggtgtgatgatcttt6420
tgttgtttttgtagcgagaagctcaaagaaggggaagacatctacatctatcagtaagtg6480
atccagtttagtagatgaatcaccgctatgggtgtttttctattcttgaataggcatgtg6540
gttctattgatggttgtttgatgtttggtatattgtgttctgactgtaggggtgacaaat6600
ccttttgcagcatggagtgcagagaaaatttcatggtagacgagatggaaggtgagccag6660
agagtatgtataatcagatcaattttagactttttttcactaaccacttatccatagcac6720
acttgtgtttatttgagatatgtgtcgtcattttgtgttcatcggctattgctatgaaga6780
tgaccacagggttaattcttgtttatgagttcatcctctagaatgtattgggttgtcgcc6840
aagaaaaaacacacgaagacataaattcaaataatttctggtgctgtggagtattcataa6900
ttgtgtgcttttgttgaatccttcatgtaccataattcttactgcttgcaagccctttta6960
gaatgtgactttaggatgtagaatttggtgaaaggagacaagaaattattctctaaacta7020
ctatttttataggcaagggaggctggaagcaccttttttgcatgggagctgcatggagag7080
aacttattggtagaaaacaaactttgaccagattcttgaatgtgccaagggaaaaaggta7140
ttaattgggaccttctatttatacaagggaaaaaggtattttttatacaattgctcttcc7200
ttctcatgtatggcaacttcctgttttgtagagattgagcgcccttcctcactgtttgaa7260
ctttactcatcctcatgaaactgtgatgctacctgtaggagcggtttgttcaaggaggga7320
atgacttttagaccctaataacttgtctacacttaagataaaaattgttgtcattgctat7380
ggttacctcctttagtgtcgtaaactcagaataatcatagatccctttgttgcttacaca7440
ttattcatgttctacaagctattgagattttgactagccgctacattttagtgcaggttt7500
tcataatttcttacctttatatctacattttagatttcctcccttttgacactacatttt7560
agccgctcactcggaagctttccatccctgctggagtactcagcccataaaggtaaatgc7620
atgcctttacctgcctgaaatgcattgtctttcttttcaaccgtgaatgtgaattggatc7680
catcattcatgtgcacacacaaggccggtttgaatttggaatgcactgttgttctgcctg7740
tgatccacttggtggtttttatttgcgttcacatattaaaaaatatattataccctgatt7800
ctgagtcaaactttggttagacctggatttgttgattagtttccgttgttgtgatctgtt7860
gaacaaactataggttaggtgagttatcagattcttagatctgtcctgtacggttgtgat7920
ctgttgaacaaactataggttaggtgagttatcagattcttagatctgtcttgtacgtct7980
gatctctaccatgttacaaacatctgaaagttaataataatcactactacattcacacct8040
atcttgtatggatttgctgttgaaattgcagaatgcttcccaacttgtgcatttttatta8100
catccccacaacttcatcgtatagtccagacatcctttgtttgtgtagcaaaatagatgt8160
gcaattgtttcattgtaacaatgtcctgtatattaacctcggcggctctggtgctttctg8220
caggcttctcattcttatttgcatggcctgtaactgctgcacagcgcgctacggctggcg8280
tagcggcagcaactgctggtatgaacaggatccgtccaccctcattagcgacgatgaagt8340
gacacccaattctaggcgcggacgtacgatggcgccctcgcgcgggatatcgggctgagt8400
gatgagacgctgctgccattggggttccaggcgcgcctgaccgctcccactcctaggccc8460
ggcatgtctctccgacgctacctaccggtgctggtgaaacgcaaacttaaaatcgtgcat8520
tgcaagatgttgccttctctttatgtctcctctactctacgcctacgagctttctttgtt8580
tataactttttctgtggtttcgctttcaacaagctcaagcggtcaacagatcaacttaga8640
caaaatgcagatgcagtggaaatattaaggaagaccagattccctcatgttcatggagca8700
ggtgaaaagaagtcaccagagacaattctggatcatgagactatgagaggactttggcaa8760
caaatgttcggtttggacttctctgggtggttctatgatactttataatttcttttggct8820
tatgtgtcatttcagaagatatgaaaatagctaagcattgtcaataatattagcttcctt8880
gtttcttgttaacctaagatgactgccctatttcacttgttttttctagctgtggtagta8940
ggtcattagagtagttcccttcaatcgtagcactcagccatgttgtaaatggtttacttc9000
tattgtgaatcgtgtttttcttttttaatggtggggttagtggaaaatctggacgtatcc9060
gcgaatggtggagaagcacacaaaacaacaattatatgcaaaaactaaactatgttttct9120
ttctttctatggtatatgaatttagtaccttgataatcacaatgattagagtctgtttgt9180
atttatattaaaacataactgcttaggaggtgtatgaggaggataacgagatgtgcagtg9240
cgcgcaacagtaagcagactaccatgacttggttgctcttcaagtgtgcaagtgtgtcta9300
tggattgtatggttctttggtttttgttgttcgctaaggtgcaaagtaggaagaaaatac9360
ggccctgaccctgcaagtggggaatgtatcgtttttgctggaactgaaatatgcggtgtt9420
cttttttatgcactaatattttgttgaatattttgtataccgttgcaacgcacgggcatc9480
tacctagtaaattaatagttttcaataaaagacctcgagtttctcatgtagtcgcttctg9540
aacctccatttacatctttagtgccttaaaaatcgtgccaatgtttccatatatcaaatc9600
acggtgatcaacaattaaacaaaataactcttaaccaaaaaaatggaagtgtcacgcact9660
catgaactacagatgtcaatagctaacatataaacaggctcaagccttccttcaaggata9720
ccaggttgttctaagcagcatctctatatttagagaggaagacttgtctcgattcatctc9780
ttcctgagaccccactcatgtgggtagttctaccttttttatgaatacagatgcaacatc9840
aacatataaacaggctcatgcctcagatactactctcaattggtctactgaaattctatt9900
gcatttatgacgaatgcagttacagatttaaaagtaaaggacgagtacagctgcatgatg9960
aaaagtaaacagaacggtaactgcaaatttacttcagacatgctgctgctttgatgagat10020
cggaacagggttgttgctcacaatctgtggatttaaagcctgcaacacatgtcacaattt10080
aatcatgaggttcactaagacaacattacacaaaagaggtgttggtactgagaacataaa10140
agatcatgtcatccttgtctgaaaggaagtgcattccaactaagaacaaagatcactgct10200
tctcaatgaagcaaggctacgacatgatgcatatttagctcaaattacccaagcaagtgt10260
ttattaaaataaaagcatgcatcttagacatcagcaataggttatctaagacgtcaatgc10320
atatttagcccagttttagacaacaatgcatatttagctaaaatattaaacatgcaatgc10380
ctctttagctcaaatttcagaccacagtattaaccggtggtgctacacttgaagaacaag10440
cttatctggagaagcagcccacatccagtcctgaaggcaaatcagatgtttgaacaaaat10500
cagtgcttagccaaatctagaaaccacaaccactcgaacgctgactcagatggtagagcc10560
atcacaccaatggtgctaccatcagtaaaataatagaatcaatttgtttttccttactga10620
tacagtacccttcgaataattgtttataagcaattgaacttgctgcccaatgtattatag10680
tcttggtcnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn10740
nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnttagagacatat10800
agagtagtatagataaggataatggacagtagtacttgaagttaatacataaaacttgca10860
gctaataaacaaacatagtactattttagttaacatacaacaattttccagctcttcttt10920
gcacttatagcatttttagtacctacacacaacctgaactgaacaccatcaagaacagta10980
ctgcatttggaagtttgaataaaaatggtgtccaattaacagttttctaatattgtaact11040
taattattgaatggtgcaatagggtgttcacttgttgaattggttgataatcttgtgtgg11100
gctttaaatataacattgttagtgaatgctaaataattctgcactcttggaatggtcaag11160
ctgagaatgagccagagattaactggtgcgaacctgggaagcctgcagggatggaggtcc11220
ttcttgggagtaaggatgggttcttgacgaagaagattaagagctatagcacatgccgaa11280
gtgatcctacaaagccacgtgcattgcaatgcctttcaccactgatagcaggtcatgtaa11340
gatacatcatttaaccatctggatatgtgttaatgttagacaactaatttataaaatttt11400
cgattcaggtgggcaatggaattgtttgttgaatgagagtaatgtttattattggtaaag11460
tatcacaattcagtctgattcaggtgggcatggtaaaatgaagtgcatctgccaaaaagt11520
aaggtaagtaactaagtatcttttggatgtttggaggatcagatgttgttgcagaatgtt11580
tattgactaattgcaagtcaaatgaattaaaggggaaaacagataagcactaagattttt11640
atatgaaagctgtgtgagtcaacctgctaagagctctaaatgtcctgtcccagaagtagt11700
ggcggagccagcccaaatttgaagcctatcgactcagaaaattcatccatgccaaccaaa11760
caaatagaataattaccttatctgttgcacgggtgtgaaggagctggagatgcagatcag11820
gtgtcacaaattctataaccaaattgcataatgcataagcaatcaaacaatacaccccta11880
attctttaggctaaacgaactgcactaagcctatgtttcagatttcaagattactattgg11940
attgtgttcatcacaccaataggttaagtaccaacatgctaatcaataagaacagtaaag12000
gggcattacctacaacaaatgttacgtctaaatagaactaacttggaggcaatgtaagct12060
ctgaatgtccgaaacattgaaacaaagtctgaactctgaaattgtctccctaatcctcac12120
ctcatcccatctgtgtcgatggatggtgggtgcctcgcctgagctgttgtgtggtgccgc12180
ggacgaggtgagcgcttgtgtagaatgaagatggggtcgcagagcgacgtggccgacggc12240
ggaccgatgatggagagggtactttttactttcattaacagatcatcaagcaaaataaaa12300
tagtaatgagcaattgagcccagatcacatttgtaatcatctagcatgtaagcactattt12360
tttaatttgatacccctcattcagagtttttgcacaaacttagaactggaggctgaaagt12420
aagcaaattgacttcatcttttagtttgatgaagatcaaatgaatttaaaagcttgagaa12480
gatgaaaagatctcatatgtatgaccaagagatgatagcaaatggcttcaagccaataaa12540
gtcacgagagatgaaagcaaataggttcacatgcaggatacaacagaaagttatgtatac12600
aggaacaaatgctaatagaactatgcatttatcatattttagaaactgtagcccttgttc12660
atcctacccccaatgagtattcctatcagtacacatgtaggctagcatttttttagttgt12720
aatacttgcttagctacagatcactggctagctgaaaaaaacctaagtagcaacagtgga12780
acagtttgagtatagctgtatgaaaaaggctatctgaataatagcaatatttggattcat12840
gttcatttcaagaacttcaccattgggaaaactccaacggtactaccatttgattgctgc12900
aatgtttgtaatgataaagagatctcacttactcacttgtggtaccatagaaacctgcaa12960
aaaagaggaaaataatataaattggtgtaaaaattaaggaattcatagtacaaatatgat13020
cgactttataggaccacccttttgttaaatttttcagtgtcaacaggaaggtgtgattca13080
ggcctttgtggatcaagtcagcacagaaccttggcttcttcattcggtagaatgatgaca13140
agactaccgaatgatgacaagacttttaaaaatatcctgttgaaaatgcatttaccatgt13200
cagctgtgttaccaagacgggtaattact13229
27
1515
dna
artificialsequence
cdna
27
atggcggttgtggaggcgtgcgaccccgcaaggatcgacatctcgtccaagacagcagcc60
ccctcagagcggacgacgcagcttatctctggtggaaggcacgtagccacgcccccactc120
cctcctcgtgctcccgccgacgcctgcctcgcgctcctcgtcgaccgtctctccctctcc180
tccgtcgttgcatctcgcctctcgcgcacgcacaaagatgccccaaatgtgctgaactta240
agctatcgcgtgagaacacaactttctgcttctactttaattgatctcaatggggcattt300
gggtggctagcaattcacattgataatttaaaagtgaatttcaggtgtacatttgatggc360
ctccgatatggtgctgccttcaattctctacaatgcgcgagaatgctgctcaggaggatg420
gctgaaaaactccgagtgctgctacgctaccagagtcttcatgtgcatacttacttcaag480
aactcaagcgagaagctcaaagaaggggaagacatctacatctatcagggtgacaaatcc540
ttttgcagcatggagtgcagagaaaatttcatggtagacgagatggaaggcaagggaggc600
tggaagcaccttttttgcatgggagctgcatggagagaacttattggtagaaaacaaact660
ttgaccagattcttgaatgtgccaagggaaaaaggtattaattgggaccttctatttata720
caagggaaaaaggcgcggacgtacgatggcgccctcgcgcgggatatcgggctgagtgat780
gagacgctgctgccattggggttccaggcgcgcctgaccgctcccactcctaggcccggc840
atgtctctccgacgctacctaccggtgctggtgaaacgcaaacttaaaatcgtgcattgc900
aagatgttgccttctctttatgtctcctctactctacgcctacgagctttctttgtttat960
aactttttctgtggtttcgctttcaacaagctcaagcggtcaacagatcaacttagacaa1020
aatgcagatgcagtggaaatattaaggaagaccagattccctcatgttcatggagcaggt1080
gaaaagaagtcaccagagacaattctggatcatgagactatgagaggactttggcaacaa1140
atgttcggtttggacttctctgggtggttctatgatactttataatttcttttggcttat1200
gtgtcatttcagaagatatgaaaatagctaagcattgtcaataatattagcttccttgtt1260
tcttgttaacctaagatgactgccctatttcacttgttttttctagctgtggtagtaggt1320
cattagagtagttcccttcaatcgtagcactcagccatgttgtaaatggtttacttctat1380
tgtgaatcgtgtttttcttttttaatggtggggttagtggaaaatctggacgtatccgcg1440
aatggtggagaagcacacaaaacaacaattatatgcaaaaactaaactatgttttctttc1500
tttctatggtatatg1515
28
2229
dna
artificialsequence
cdna
28
attgatctcaatggggcatttgggtggctagcaattcacattgataatttaaaagtgaat60
ttcaggtgtacatttgatggcctccgatatggtgctgccttcaattctctacaatgcgcg120
agaatgctgctcaggagggtattaatggctcaacacagatgacctcctcggagtcatgtt180
tctaattatctacactatgattctccttctgttgataaaatattgttttattgtgctgtg240
agctaatgataacagtgatggtaagtaaatatggtccatgcatattctcatcatagatgg300
ctgaaaaactccgagtgctgctacgctaccagagtcttcatgtgcatacttacttcaaga360
actcaagttgggcaagaccaaagtgaaagtgggagaatacccgaagaacttcttgttgga420
cgaacttggagaaaccaatactaaaactcagtgtcaaccgcttgcaacaggcaatttagg480
tcgatgtgctcgcacgctgtgtgaccatgtctgagcactccccacccacccaatcgcttc540
ccacgtcatgccgccacgtcgagaatttgtacacaactcaggttgcccattttactctgt600
tattgaacctcgcttttctgtagacccaggtatgctagaagtaggtaatggtagcatgac660
cttagcagagtatcgctcacattttagcatttgggttgtcatgaaggtaagttttcactg720
agattggaccgatgttgtgtcatcatattttgagtaaagggtttgaccgaagaattttga780
aaaataaaagactgtagttcattgaaggtgataactggtatgcaagtattactattaatg840
attctccacttactgatattacatttggataagaggaaggagatatcctaattctaatgg900
gttaactgctagcagttccttatgtactgttatgcttcaggcccctttattaattgcctc960
atgtggaaagagggtgtgatgatcttttgttgtttttgtagcgagaagctcaaagaaggg1020
gaagacatctacatctatcagggtgacaaatccttttgcagcatggagtgcagagaaaat1080
ttcatggtagacgagatggaaggcaagggaggctggaagcaccttttttgcatgggagct1140
gcatggagagaacttattggtagaaaacaaactttgaccagattcttgaatgtgccaagg1200
gaaaaaggtattaattgggaccttctatttatacaagggaaaaaggtattttttatacaa1260
ttgctcttccttctcatgtatggcaacttcctgttttgtagagattgagcgcccttcctc1320
actgtttgaactttactcatcctcatgaaactgtgatgctacctgtaggagcggtttgtt1380
caaggagggaatgacttttagaccctaataacttgtctacacttaagataaaaattgttg1440
tcattgctatggttacctcctttagtgtcgtaaactcagaataatcatagatccctttgt1500
tgcttacacattattcatgttctacaagctattgagattttgactagccgctacatttta1560
gtgcaggttttcataatttcttacctttatatctacattttagatttcctcccttttgac1620
actacattttagccgctcactcggaagctttccatccctgctggagtactcagcccataa1680
aggcttctcattcttatttgcatggcctgtaactgctgcacagcgcgctacggctggcgt1740
agcggcagcaactgctggtatgaacaggatccgtccaccctcattagcgacgatgaagtg1800
acacccaattctaggcgcggacgtacgatggcgccctcgcgcgggatatcgggctgagtg1860
atgagacgctgctgccattggggttccaggcgcgcctgaccgctcccactcctaggcccg1920
gcatgtctctccgacgctacctaccggtgctggtgaaacgcaaacttaaaatcgtgcatt1980
gcaagatgttgccttctctttatgtctcctctactctacgcctacgagctttctttgttt2040
ataactttttctgtggtttcgctttcaacaagctcaagcggtcaacagatcaacttagac2100
aaaatgcagatgcagtggaaatattaaggaagaccagattccctcatgttcatggagcag2160
gtgaaaagaagtcaccagagacaattctggatcatgagactatgagaggactttggcaac2220
aaatgttcg2229
29
579
dna
artificialsequence
cdna
29
aagttttcactgagattggaccgatgttgtgtcatcatattttgagtaaagggtttgacc60
gaagaattttgaaaaataaaagactgtagttcattgaaggtgataactggtatgcaagta120
ttactattaatgattctccacttactgatattacatttggataagaggaaggagatatcc180
taattctaatgggttaactgctagcagttccttatgtactgttatgcttcaggccccttt240
attaattgcctcatgtggaaagagggtgtgatgatcttttgttgtttttgtagcgagaag300
ctcaaagaaggggaagacatctacatctatcagggtgacaaatccttttgcagcatggag360
tgcagagaaaatttcatggtagacgagatggaaggcaagggaggctggaagcaccttttt420
tgcatgggagctgcatggagagaacttattggtagaaaacaaactttgaccagattcttg480
aatgtgccaagggaaaaaggtattaattgggaccttctatttatacaagggaaaaaggta540
ttttttatacaattgctcttccttctcatgtatggcaac579
30
2512
dna
zeamays
30
cccgctacctgttcaccgcgcgccagcgaaacctccgcacgcccactgcccatctgttcc60
ccgtgcgccagcgaaacatccgcacgcccgcggcccgcctgttccccgcgcatcccgctg120
cacgacttctgctaccgcaacggccacccacgcacgcccgcctgttcaccgcgcatcccg180
ctgacctccccttcacgctcgcacacgctccgttcccccaccccaccgcaatccccgacg240
ctataagagcggtaaccaactccatctccctggtgccacgcattgttgagttcttaaggt300
gcgtttcgttgaggacttgttcatttttgttggtcatgtattccattttactgctctacc360
attttgtggaataaagggaggaatgttttcactagaagagttcatcaatcttatgttggt420
ttcttggatcagttttgctctatggctaaatggtcgaattgagcctatttcattataaag480
ttagcgagcgaataattgttcagcctcttcctagaactcattaccagtagaatcagttac540
taactgcttttctttttcttggattagaatggctggggctatctctcaccatgcgctagc600
attttcacaatcccactggtgcagtgcgaagaactctagattcggaaagaggacgggcaa660
tgctcgcctggtttatctaaaaggaagatgtggttcaggcagcagaaaactgggtttgat720
gtgggcctcgagctcgcagtcttctgtcatggagccgacgcacctaccatctgatggcaa780
cagcagccacaccccaaaaaaatcaagtaattttaacgacctcctatggtggttatttgt840
ttttaatttgagaaaactatccatttgacacatttaactttgggcttctcagaatttggg900
ggcatataataagatctgctaatctgttatctctatgtcgttgtaggtgaaagcgctctt960
atattgatttggcatggtgaatccctgtggaacgagaaaaatctatttcctggctgcatc1020
gatgtacccctgacaccgaagggtgttgaggaggccattgaggcaggtaaaaggatatgc1080
aatatcccaatcgatgtgatatatacttcatcactgatttgtgctcagatgaccgcaatg1140
cttgccatgatgcagcatcgacgcaagaaggtttgtgtctttcctttgaaattccagtaa1200
tttcttctagcatttgtatgaacttgccggagaaatcatgctttgctggtgatatatgta1260
tttatagatcctagttatcacgcataatgagagtgaacaagctcacaggtggagtcagat1320
atacagtgaggagacaatgaaacagtccattcctgtcatcacagcttggcaattgaatga1380
acggatgtaatactttctccatactctttgatttgctaattactccctctgtctcaaaat1440
agtattaattttagctcttgatttttatgtctatattcaaatagatgatgataaatctag1500
attctagacacaaatataaaacatatacatcaagtattatatgaatctattaatttacta1560
agaccaattttaatttgggacagagggagtatacgattataatagttgtttgactgtgct1620
tctctttaaatatcccttgacatttctaggtatggtgagctacaaggccttaacaagcaa1680
gaaactgtagatcgatttggcaaagaacaagttcatgagtggcgccgcagttatgatatt1740
cctccgccaaatggagaaagtctagagaagtgtgctgagagagctgttgcttatttcaaa1800
gatcaggcacatctagcaaggccactttacactaattgaaagatacactttttacttggg1860
ttattggtcttgctgcagtattggtatgcatgctaaaggttattcttgaatcgatgaatt1920
cctctactatgggatgcagaaatgcatgtgcttagttttctttctattgtgctagctcat1980
atcaaatttataacctgaattttttatttatgttcgactctaaaaaacagttttttctag2040
ctcgatttgacctatagtaatttttccgtaatagattattccacaacttgtggctggaaa2100
acatgtgatggttgctgcacatgggaattcacttcgttcaattataatgcatctggacaa2160
attaacttctcagaaggtaattcactgtcgtttttgtctttccatcaaaaaggactcggc2220
taaacagaacatgtagcattatgttaagtttgggagtgagcctttcgtcccttcaggtaa2280
taagccttgagctgtctactggcattcccatgctttacatattcaaagagggaaagttta2340
ttcgacgtgggactccagtaggaccttcggaggccagtgtttatgcttataccagggtaa2400
gattctttcccccacatgttctaccataggacgatactccagtttacaaaccttatctgt2460
acagaccaaacgatttgctgagcacattacatttcagaacaaattggcctag2512
31
1005
dna
artificialsequence
cdna
31
atggctggggctatctctcaccatgcgctagcattttcacaatcccactggtgcagtgcg60
aagaactctagattcggaaagaggacgggcaatgctcgcctggtttatctaaaaggaaga120
tgtggttcaggcagcagaaaactgggtttgatgtgggcctcgagctcgcagtcttctgtc180
atggagccgacgcacctaccatctgatggcaacagcagccacaccccaaaaaaatcaagt240
gaaagcgctcttatattgatttggcatggtgaatccctgtggaacgagaaaaatctattt300
cctggctgcatcgatgtacccctgacaccgaagggtgttgaggaggccattgaggcaggt360
aaaaggatatgcaatatcccaatcgatgtgatatatacttcatcactgatttgtgctcag420
atgaccgcaatgcttgccatgatgcagcatcgacgcaagaagatcctagttatcacgcat480
aatgagagtgaacaagctcacaggtggagtcagatatacagtgaggagacaatgaaacag540
tccattcctgtcatcacagcttggcaattgaatgaacggatgtatggtgagctacaaggc600
cttaacaagcaagaaactgtagatcgatttggcaaagaacaagttcatgagtggcgccgc660
agttatgatattcctccgccaaatggagaaagtctagagaagtgtgctgagagagctgtt720
gcttatttcaaagatcagattattccacaacttgtggctggaaaacatgtgatggttgct780
gcacatgggaattcacttcgttcaattataatgcatctggacaaattaacttctcagaag840
gtaataagccttgagctgtctactggcattcccatgctttacatattcaaagagggaaag900
tttattcgacgtgggactccagtaggaccttcggaggccagtgtttatgcttataccagg960
accaaacgatttgctgagcacattacatttcagaacaaattggcc1005
32
394
prt
zeamays
32
metalavalvalglualacysaspproalaargileaspileserser
151015
lysthralaalaprosergluargthrthrglnleuileserglygly
202530
arghisvalalathrproproleuproproargalaproalaaspala
354045
cysleualaleuleuvalaspargleuserleuserservalvalala
505560
serargleuserargthrhislysaspalaproasnvalleuasnleu
65707580
sertyrargvalargthrglnleuseralaserthrleuileaspleu
859095
asnglyalapheglytrpleualailehisileaspasnleulysval
100105110
asnpheargcysthrpheaspglyleuargtyrglyalaalapheasn
115120125
serleuglncysalaargmetleuleuargargmetalaglulysleu
130135140
argvalleuleuargtyrglnserleuhisvalhisthrtyrphelys
145150155160
asnserserglulysleulysgluglygluaspiletyriletyrgln
165170175
glyasplysserphecyssermetglucysarggluasnphemetval
180185190
aspglumetgluglylysglyglytrplyshisleuphecysmetgly
195200205
alaalatrparggluleuileglyarglysglnthrleuthrargphe
210215220
leuasnvalproargglulysglyileasntrpaspleuleupheile
225230235240
glnglylyslysalaargthrtyraspglyalaleualaargaspile
245250255
glyleuseraspgluthrleuleuproleuglypheglnalaargleu
260265270
thralaprothrproargproglymetserleuargargtyrleupro
275280285
valleuvallysarglysleulysilevalhiscyslysmetleupro
290295300
serleutyrvalserserthrleuargleuargalaphephevaltyr
305310315320
asnphephecysglyphealapheasnlysleulysargserthrasp
325330335
glnleuargglnasnalaaspalavalgluileleuarglysthrarg
340345350
pheprohisvalhisglyalaglyglulyslysserprogluthrile
355360365
leuasphisgluthrmetargglyleutrpglnglnmetpheglyleu
370375380
asppheserglytrpphetyraspthrleu
385390
33
128
prt
zeamays
33
mettyrcystyralaserglypropheileasncysleumettrplys
151015
gluglyvalmetilephecyscysphecysserglulysleulysglu
202530
glygluaspiletyriletyrglnglyasplysserphecyssermet
354045
glucysarggluasnphemetvalaspglumetgluglylysglygly
505560
trplyshisleuphecysmetglyalaalatrparggluleuilegly
65707580
arglysglnthrleuthrargpheleuasnvalproargglulysgly
859095
ileasntrpaspleuleupheileglnglylyslysvalphepheile
100105110
glnleuleupheleuleumettyrglyasnpheleuphecysargasp
115120125
34
335
prt
zeamays
34
metalaglyalaileserhishisalaleualapheserglnserhis
151015
trpcysseralalysasnserargpheglylysargthrglyasnala
202530
argleuvaltyrleulysglyargcysglyserglyserarglysleu
354045
glyleumettrpalaserserserglnserservalmetgluprothr
505560
hisleuproseraspglyasnserserhisthrprolyslysserser
65707580
gluseralaleuileleuiletrphisglygluserleutrpasnglu
859095
lysasnleupheproglycysileaspvalproleuthrprolysgly
100105110
valgluglualaileglualaglylysargilecysasnileproile
115120125
aspvaliletyrthrserserleuilecysalaglnmetthralamet
130135140
leualametmetglnhisargarglyslysileleuvalilethrhis
145150155160
asnglusergluglnalahisargtrpserglniletyrsergluglu
165170175
thrmetlysglnserileprovalilethralatrpglnleuasnglu
180185190
argmettyrglygluleuglnglyleuasnlysglngluthrvalasp
195200205
argpheglylysgluglnvalhisglutrpargargsertyraspile
210215220
proproproasnglygluserleuglulyscysalagluargalaval
225230235240
alatyrphelysaspglnileileproglnleuvalalaglylyshis
245250255
valmetvalalaalahisglyasnserleuargserileilemethis
260265270
leuasplysleuthrserglnlysvalileserleugluleuserthr
275280285
glyileprometleutyrilephelysgluglylyspheileargarg
290295300
glythrprovalglyproserglualaservaltyralatyrthrarg
305310315320
thrlysargphealagluhisilethrpheglnasnlysleuala
325330335
35
637
dna
artificialsequence
cdna
35
gctacgtgccttccaccagagataagctgcgtcgtccgctctgagggggctgctgtcttg60
gacgagatgtcgatccttgcggggtcgcacgcctccacaaccgccatgcggcaaggttca120
aagacaactgcaactaacaattaaagcacaaggaatatacttgcagatgatcattgagga180
gcaacaaaagcttggtggatcaattaaagcttctgagggacattgagagtgatgacaagg240
atcttgattttgaggttgatgatggagctgaggatgaacctaaagcatgatccgaaagat300
gccccagccgaacctgctatcaagacagttgtggccctagctgcgccacccaaagagaca360
gaaagacagttgtctaaaaaggagatgaaaaaaaggaactagcagaacttgatgcagtat420
tggctgagctgggactttctggtattcgagcagcgctgcacaggatggtgagagtcatac480
atgtttccgttaaaactgcctgatctattttgtatgctccggtttagagagagtctaaat540
aattttgagtattttgaaattctagatgagttgtaaatttccaacaagactattgtagta600
atagaggttagatgttcacgtgtaggcatcacattgg637
36
691
dna
artificialsequence
cdna
36
tatagggagagcggccgccagatcttccggatggctcgagtttttcagcaagatgctacg60
tgccttccaccagagataagctgcgtcgtccgctctgagggggctgctgtcttggacgag120
atgtcgatccttgcggggtcgcacgcctccacaaccgccatgcggcaaggttcaaagaca180
actgcaactaacaattaaagcacaaggaatatacttgcagatgatcattgaggagcaaca240
aaagcttggtggatcaattaaagcttctgagggacattgagagtgatgacaaggatcttg300
attttgaggttgatgatggagctgaggatgaacctaaagcatgatccgaaagatgcccca360
gccgaacctgctatcaagacagttgtggccctagctgcgccacccaaagagacagaaaga420
cagttgtctaaaaaggagatgaaaaaaaggaactagcagaacttgatgcagtattggctg480
agctgggactttctggtattcgagcagcgctgcacaggatggtgagagtcatacatgttt540
ccgttaaaactgcctgatctattttgtatgctccggtttagagagagtctaaataatttt600
gagtattttgaaattctagatgagttgtaaatttccaacaagactattgtagtaatagag660
gttagatgttcacgtgtaggcatcacattgg691
37
2146
dna
artificialsequence
cdna
37
atgagaagaatcagagacctagaggggtgagtcgggggaggggagactcacctttgtgcg60
tgcgcgagaggcgagatgcaacgacggaggagagggagagacggtcgacgaggagcgcga120
ggcaggcgtcggcgggagcacgaggagggagtgggggcgtggctacgtgccttccaccag180
agataagctgcgtcgtccgctctgagggggctgctgtcttggacgagatgtcgatccttg240
cggggtcgcacgcctccacaaccgccatgcggcaaggtcggtcctcgtggggccgtgcga300
caccgaggccgagggcggttcccaagctgcctgcgtcgaggccctagagcggcggtggcg360
acgacgttgggggaggggcggtggaggcaaagggaggaagggaagcggtggaggccgcgt420
cggcggtggaggccgcgaggcattctatcgcggcaggggcagggggcgggcaacgcgaga480
gagagaggaacgtgtgggggaaggttagggattgggtggtcgtgcggtggagggagatag540
agtgtgggagacgcagacgcacgatgcgatatgaacggtccagatcgatggatggcagaa600
cggcagaacgagggaggcagactactattgcttacttaataagtagtagagatttctgtt660
ttttacatcattttagtttaccgacaatcctactatgaccagaagtcctacttagttggg720
gctacttctacaagctgatattttggcccattctcaatattcacttattaactgtcacgt780
tacacctaacagtggtttcttctcatggttgtagtaatgcttagttgtcaataatctatt840
gttggcagttctcctgttttgtcttattgcttgtagtttatgagtcctttttagcttcaa900
gatcaatctaaattttccttgccaactttcttttgcttgtctaaataagttatgttttca960
ggttcaaagacaactgcaactaacaattaaagcacaaggaatatacttgcagatgatcat1020
tgaggagcaacaaaagcttggtggatcaattaaagcttctgagggtcagaaactttctga1080
ttcacctccaagcttagatgactacccagagatcatgcaacattctcccaagaaaccaag1140
gatagacgcattatcactagattcagagcgcgatatagcacagcctaaatttgaatccca1200
tttgatcggtctgtgggatcacgacattgcattcccagtggaggagttcaaagcagaccc1260
tgctatgagcaagtcataaggcaaacttcaccttgacagaaaattttcagagctgaccac1320
ctgctaaacaaaaataaaaaatatgtaccttggatagggacttgttccaagtttttctat1380
atcttgggtaagcaagtgatttggaaatggttatacatgacaggtgctgcatgtttgtca1440
tagcaaactctggttattgttctacttgttttttgtcataatttttgcatcaaaactaaa1500
ctattaagtggaagtataatttactcatctcgagggggatggagcatgtttggctgattg1560
tcattatagttagcctctctgatagtgttcttagatgtacttctatatatgttgcaggac1620
attgagagtgatgacaaggatcttgattttgaggttgatgatggagctgaggatgaacct1680
aaagcatgatccgaaagatgccccagccgaacctgctatcaagacagttgtgggccctag1740
ctgcgccacccaaagagacagaaagacagttgtctaaaaaggagatgaaaaaaaggaact1800
agcagaacttgatgcagtattggctgagctgggactttctggtattcgagcagcgctgca1860
caggatggtgagagtcatacatgtttccgttaaaactgcctgatctattttgtatgctcc1920
ggtttagagagagtctaaataattttgagtattttgaaattctagatgagttgtaaattt1980
ccaacaagactattgtagtaatagaggttagatgttcacgtgtaggcatcacattggtgt2040
gcttttgggttacccacttaggtggattgcctgagtcattcatgaaatattattgtgtat2100
catcgcaacacagtgaattggttgttggcaaaaagaatttgagaca2146
38
637
dna
artificialsequence
cdna
38
gctacgtgccttccaccagagataagctgcgtcgtccgctctgagggggctgctgtcttg60
gacgagatgtcgatccttgcggggtcgcacgcctccacaaccgccatgcggcaaggttca120
aagacaactgcaactaacaattaaagcacaaggaatatacttgcagatgatcattgagga180
gcaacaaaagcttggtggatcaattaaagcttctgagggacattgagagtgatgacaagg240
atcttgattttgaggttgatgatggagctgaggatgaacctaaagcatgatccgaaagat300
gccccagccgaacctgctatcaagacagttgtggccctagctgcgccacccaaagagaca360
gaaagacagttgtctaaaaaggagatgaaaaaaaggaactagcagaacttgatgcagtat420
tggctgagctgggactttctggtattcgagcagcgctgcacaggatggtgagagtcatac480
atgtttccgttaaaactgcctgatctattttgtatgctccggtttagagagagtctaaat540
agttttgagtattttgaaattctagatgagttgtaaatttccaacaagactattgtagta600
atagaggttagatgttcacgtgtaggcatcacattgg637
39
55
prt
zeamays
39
metalaargvalpheglnglnaspalathrcysleuproprogluile
151015
sercysvalvalargsergluglyalaalavalleuaspglumetser
202530
ileleualaglyserhisalaserthrthralametargglnglyser
354045
lysthrthralathrasnasn
5055
40
56
prt
zeamays
40
metthrargileleuileleuargleumetmetgluleuargmetasn
151015
leulyshisaspprolysaspalaproalagluproalailelysthr
202530
valvalalaleualaalaproprolysgluthrgluargglnleuser
354045
lyslysglumetlyslysargasn
5055
41
1327
dna
zeamays
41
tattgttgcctcctcctcatctcatcactagtcactcaaccgcaattgattgaaaattgt60
gttcatcatctcgttggatcgatcataattctttcatttctggcctcgacaagtatcgag120
ctcattaatccatcaatccaatgtgtgttctgtcgaaggcgacaatggtgagctacttat180
cgcggcgtccatttaatggctgcagcacaaaggcgatggacgtgatcgtggtcgacaaga240
ccatcgtgccggggggggaggggggtagagggtgacggtgctgatgatggatggcgatgg300
tatccggggtctcatcccggaaaccattcttgccttcctcgaggcgagggtgcaggatct360
ggacaggctggaggcgaggctcgcagactacttcgactacatcgccaggaccggtgggct420
cgtcatcacgctgctcacttcgcccggcaaggacaagcggcctctctacgttgccaagaa480
catcaaccacttgttcttgcatccattcacatcgccctaatcacatcaatgtatagagga540
ctatgatggatggatgcaagaacaatgacgccagatggaattcacttttgagggaagaca600
tgtgggctgttctccatgtagaagtggttgatgtccttggcaacgtagagagaccgcttg660
tttttgccggtcccgatgaacatggcggtgatgatctcatcggtgctattcctagtgatg720
tagtcgaagtagtccgcgagccttgccttcgtcccgtccaactcctacggcatggcctcg780
aggaaggtgaggatggttcccgagatgagaccccagatggcgcctctgtccaccgttatc840
actgtcaccctcggttcaacacgacgggctcatcgaccatgctcccggccatcgccttcg900
tgctgcatgtgttgcatggacgccttgatgagtagctcaccgttgccgcctttgaccaag960
cacacacccaattgatcgattaatgaactagatactcattgaggccacgaatgaaagaat1020
tatcatcaaaccaacaaatgacggacacaattttcaatcgattatggatgagtgtgagta1080
gtgatgagctgaggaagatgcaatgatagatcgattgtgtacatatataggcactgcgta1140
cgtgctgcccctttttggagtgacaaataggaactagcgcgcgtatttttgcatacaacc1200
actactaaataagagatatatgtaaaatttaacgcaagggatatagggaagagatatttg1260
tccattgcaatgtattttgaagctgtccacatatactatttatgaagaaacggattatgc1320
caagtat1327
42
714
dna
zeamays
42
atcattacataacttatgctatattttcccgagtatgtcctaacatcttccacagtgttt60
ttatgggctccttagaagttccagcccaggggcctgaaactattaaagttccaactgctc120
attatgaatttggtgccaattttttagatccaaagttaatgctcattggaagggtgataa180
cagatggaaggcttaatgctcgcgtgaaatgtgatttgacagacaatctcacgctgaaag240
taaatgcacagcttacccaagaggcacattactcacaaggaatgtttaactttgactaca300
aggttgacgtttctgacaagtcagacgtaacgagggcgtccacaccgcggctccgccgga360
catcgcaacaatctccccgccccagctctcctctccctgcgccgaggccacaatccctgc420
cgccccggctctcctcgtccccaaatcttgcacgcggtcgtaatccccgccgcctcgctc480
tcctcgcccctagatcgccgcctccactatcgctgatataccagaccaagcaggtagagc540
agaccaagatgtcgctcgaggaggccaagctggagatggccacgctgctgcagcagcagg600
cgagcaagtcatgcatggtactaagtcctgcatggtactaatggttgtaatgtagtgatg660
aaatagctagattaaaataacaaaatttatgtatggctaggatcacaaatagat714
43
460
dna
zeamays
43
ccagcccaggggcctgaaactattaaagttccaactgctcattatgaatttggtgccaat60
tttttagatccaaagttaatgctcattggaagggtgataacagatggaaggcttaatgct120
cgcgtgaaatgtgatttgacagacaatctcacgctgaaagtaaatgcacagcttacccaa180
gaggcacattactcacaaggaatgtttaactttgactacaaggacgtaacgagggcgtcc240
acaccgcggctccgccggacatcgcaacaatctccccgccccagctctcctctccctgcg300
ccgaggccacaatccctgccgccccggctctcctcgtccccaaatcttgcacgcggtcgt360
aatccccgccgcctcgctctcctcgcccctagatcgccgcctccactatcgctgatatac420
cagaccaagcaggtagagcagaccaagatgtcgctcgagg460
44
192
prt
zeamays
44
metglyserleugluvalproalaglnglyprogluthrilelysval
151015
prothralahistyrglupheglyalaasnpheleuaspprolysleu
202530
metleuileglyargvalilethraspglyargleuasnalaargval
354045
lyscysaspleuthraspasnleuthrleulysvalasnalaglnleu
505560
thrglnglualahistyrserglnglymetpheasnpheasptyrlys
65707580
valaspvalserasplysseraspvalthrargalaserthrproarg
859095
leuargargthrserglnglnserproargproserserproleupro
100105110
alaproargproglnserleuproproargleuserserserproasn
115120125
leualaargglyargasnproargargleualaleuleualaproarg
130135140
serproproproleuserleuiletyrglnthrlysglnvalglugln
145150155160
thrlysmetserleugluglualalysleuglumetalathrleuleu
165170175
glnglnglnalaserlyssercysmetvalleuserproalatrptyr
180185190
45
127
prt
zeamays
45
metleuileglyargvalilethraspglyargleuasnalaargval
151015
lyscysaspleuthraspasnleuthrleulysvalasnalaglnleu
202530
thrglnglualahistyrserglnglymetpheasnpheasptyrlys
354045
aspvalthrargalaserthrproargleuargargthrserglngln
505560
serproargproserserproleuproalaproargproglnserleu
65707580
proproargleuserserserproasnleualaargglyargasnpro
859095
argargleualaleuleualaproargserproproproleuserleu
100105110
iletyrglnthrlysglnvalgluglnthrlysmetserleuglu
115120125
46
2979
dna
helianthusannuus
46
atgaataacatcaatcttgtaatagtttcgcttgtaatcgcgattgtagccatccaaccc60
cttgcgcaagagcaaaccgatgtaggtgaggcaaatttcgtcactgttcttagcatcgat120
ggtgggggtgttcgtggcattgttcccgccaccttgcttgcttttcttgaatccaaaatt180
caggtactcgaacttaaaatgcacatgtgcatcatattacaagctgtaacttattattga240
aatgtgccgtctcttcggataggaaatagatgggccagatgcacgaattgcggattattt300
tgatgtaatagccggaacaagcacaggagggctgatgacaactatgcttgcagctcctaa360
tgagaaaaatcgtcccatgttcgccgcaaaagacattaccaacttctactttcaacattc420
gcctaggatcttccctaaaatagggtaaactctaactagtttccggatctataagatcat480
cattaaatacaagtttcattttctttttcgaatcaaatacagacacacatttgatgaggc540
gcaaccttatccttctcaaaacgaagcctgcgaaatggggtattctcctacaaagacttt600
tgtaattcatgttctagtgggtgtttggatgtgcgttttaaaactgattattatttacat660
gtagtttctgaagaaaataaaacagttattcaaacactttttgttaataattctactaga720
aaaaaaaaatccttgtcaggaaattaattaaaaaaaagttaccatctattaaagttcttt780
cttactaatcaaaagtttttaaattttattatcatgttattataactaaacatacacatc840
caaacactatctcataccacatgattacacaagtctattatttgaatatgctaacttagt900
attttcatataataagtttttaaaacgccacatccaaaacccttgattcttattttacat960
tgtgtagctaaaacagtgtttatacataaaaacaatcagttatataaatcaaagcattat1020
ttaactaaagtaagctcggttcaaactcgataagagaattaatatatacgagtcgagttc1080
ctgttgaccagatttcgctagtgttaagtttcgagttcaaaattgtatatgaacttgaac1140
ctgtgtatcatcatacttgacatttaaaccataatgttgttgaataaataaagtgatttt1200
attttgtagtcggaccaaattcatgaattcggtagtaaccgtacttggtgaggccaccgg1260
accaaagtatgatggtaaatatcttcgagccatggcaaagatgatgttaaaaaacctcac1320
tattaaagatacgttgacgaatgttgtcatacctgctttcgacattaggcggcttcaacc1380
tgttatcttctcctctgctcaagtaattaaactcgttttttatatttatagcagttctct1440
atttaaaattgattgtgtatcataaaatggtttctgtttgatacgtttagggaaaagagg1500
tcgcgtggaaaaatgctttgctagcagacgtatgcattagtaccgcggcggcaccaacat1560
ttttcccgccatactattttgagactagagacgtcgatggaaccaagcacacttttgatc1620
taatcgatggcggggtagctgcaaacaatccggtagttacatttcaacaatattgagttt1680
gcattttatttttaggacaagtagtcacattagggtgaagggtgtgttcaagctcatccc1740
gaaggtgggagcggtgttcccactcgtacctcatggcgcattttcttctttgttgcagct1800
ccaattttaaaaagccaccccgccttttccattccatagcgccacgtcaactgggaaatg1860
gtgttcccactggtattggagatttggaggcgctacgccacctctgtcatcccgaagcca1920
caccctccacccttaggagtgttatcggttcagttttcggtttatacagtttaaaggttt1980
ttttttggttgaaaccaaaaaccgaactgaactgaacggaattcgggtagttcacaactg2040
aaccaaaaactgaatccatattcggttttctgtttgaccgaataatcattatttgttatt2100
tgattgtgcgaggttaaataagattgagcaaaaattgtaattaattggatttggctaaat2160
gacaatttaaataaccgttatttgttatccgattcaaaccgaacacccaactcgatttgg2220
tttaataccgaaccgacaaccggatacgtaattcagttcgctattaaaaggttcgggttt2280
ggtagggtttaacccatttgaaatcgaatcttcagagaaagggttcgctaatgtcaccca2340
cataaccaaagaaatcttgtttaaatgttaatggcagacacatttggctatcacacatat2400
aaccaaagaagcggtgatggggaaatacaggttctctggcccggaggttttcgacggaag2460
acggatgcttgtgctttcactcggcactggtacgcagacgtacaatgacttatatactgc2520
acaaaaggctgcaaaatgggggttgcttagttggatctttaccaatggtactgcgccaat2580
cctccgcatttttggtgatgccatgtcagatatggtcgacatccatgtgtcaactatatt2640
ccaatcgttgcaagtcgaaaaaaactatctgcgtattcaggtataactaagaacatataa2700
atataatgttgtataggttacatgtttagtaacaaggagtttttttatgggcaggaagat2760
aacttgaaaggggaagcaactgcaatggatatttcatcacctgagaacatgagggcgcta2820
gaggacattggcaagaaattgttgaagaaaccgttgtcgagattggatgtggagacaggc2880
aagcttgaaccagttaaaggagaaggtacgaatgctgatgcattagcacgtttcgccact2940
ttgctttgtgccgaacgaaagcgccgcaatccagcttaa2979
47
1302
dna
artificialsequence
cdna
47
atgaataacatcaatcttgtaatagtttcgcttgtaatcgcgattgtagccatccaaccc60
cttgcgcaagagcaaaccgatgtaggtgaggcaaatttcgtcactgttcttagcatcgat120
ggtgggggtgttcgtggcattgttcccgccaccttgcttgcttttcttgaatccaaaatt180
caggaaatagatgggccagatgcacgaattgcggattattttgatgtaatagccggaaca240
agcacaggagggctgatgacaactatgcttgcagctcctaatgagaaaaatcgtcccatg300
ttcgccgcaaaagacattaccaacttctactttcaacattcgcctaggatcttccctaaa360
ataggacacacatttgatgaggcgcaaccttatccttctcaaaacgaagcctgcgaaatg420
ggtcggaccaaattcatgaattcggtagtaaccgtacttggtgaggccaccggaccaaag480
tatgatggtaaatatcttcgagccatggcaaagatgatgttaaaaaacctcactattaaa540
gatacgttgacgaatattgtcatacctgctttcgacatcaggcggcttcaacctgttatc600
ttctcctctgctcaaggaaaagaggtcgcgtggaaaaatgctttgctagcagacgtatgc660
attagtaccgcggcggcaccaacgtttttcccgccatactattttgagactagagatgtc720
gatggaaccaagcacacttttgatctaatcgatggcggggtagctgcaaacaatccgaca780
catttggctatcacacatataaccaaagaagcggtgatggggaaatacaggttctctggc840
ccggaggttttcgacggcagacggatgcttgtgctttcactcggcactggtacgcagacg900
tacaatgacttatacactgcacaaaaggctgcaaaatgggggttgcttagttggatcttt960
accaatggtactgcgccaatcctccgcatttttggtgatgccatgtcagatatggtcgac1020
atccatgtgtcaactatattccaatcgttgcaagtcgaaaaaaactatctgcgtattcag1080
gaagataacttgaaaggggaagcaactgcaatggatatttcatcacccgagaacatgagg1140
gcgctagaggacattggcaagaaattgttgaagaaaccgttgtcgagattggatgtggag1200
acaggcaagcttgaaccagttaaaggagaaggtacgaatgctgatgcattagcacgtttc1260
gccactttgctttgtgccgaacgaaagcgccgcaatccagct1302
48
433
prt
helianthusannuus
48
metasnasnileasnleuvalilevalserleuvalilealaileval
151015
alaileglnproleualaglngluglnthraspvalglyglualaasn
202530
phevalthrvalleuserileaspglyglyglyvalargglyileval
354045
proalathrleuleualapheleugluserlysileglngluileasp
505560
glyproaspalaargilealaasptyrpheaspvalilealaglythr
65707580
serthrglyglyleumetthrthrmetleualaalaproasnglulys
859095
asnargprometphealaalalysaspilethrasnphetyrphegln
100105110
hisserproargilepheprolysileglyhisthrpheaspgluala
115120125
glnprotyrproserglnasnglualacysglumetglyargthrlys
130135140
phemetasnservalvalthrvalleuglyglualathrglyprolys
145150155160
tyraspglylystyrleuargalametalalysmetmetleulysasn
165170175
leuthrilelysaspthrleuthrasnilevalileproalapheasp
180185190
ileargargleuglnprovalilepheserseralaglnglylysglu
195200205
valalatrplysasnalaleuleualaaspvalcysileserthrala
210215220
alaalaprothrphepheproprotyrtyrphegluthrargaspval
225230235240
aspglythrlyshisthrpheaspleuileaspglyglyvalalaala
245250255
asnasnprothrhisleualailethrhisilethrlysglualaval
260265270
metglylystyrargpheserglyprogluvalpheaspglyargarg
275280285
metleuvalleuserleuglythrglythrglnthrtyrasnaspleu
290295300
tyrthralaglnlysalaalalystrpglyleuleusertrpilephe
305310315320
thrasnglythralaproileleuargilepheglyaspalametser
325330335
aspmetvalaspilehisvalserthrilepheglnserleuglnval
340345350
glulysasntyrleuargileglngluaspasnleulysglygluala
355360365
thralametaspileserserprogluasnmetargalaleugluasp
370375380
ileglylyslysleuleulyslysproleuserargleuaspvalglu
385390395400
thrglylysleugluprovallysglygluglythrasnalaaspala
405410415
leualaargphealathrleuleucysalagluarglysargargasn
420425430
pro
49
1795
dna
artificialsequence
tillingmutantd74n
49
agttcatcactaatcacacttattgtgccctcgacgagtatctatagctagctcattaat60
cgattcgggggtgtgttgtcgaaggcggcaatggcgagctactcgtcgcggcgtccatgc120
aatacctgtagcacgaaggcgatggccgggagcgtggtcggcgagcccgtcgtgctgggg180
cagagggtgacggtgctgacggtggacggcggcggcgtccggggtctcatcccgggaacc240
atcctcgccttcctggaggccaggctgcaggagctggacggaccggaggcgaggctggcg300
gactacttcaactacatcgccggaaccagcaccggcggtctcatcaccgccatgctcacc360
gcgcccggcaaggacaagcggcctctctacgctgccaaggacatcaaccacttttacatg420
cagaactgcccgcgcatctttcctcagaagtgagtccgatgctgccgccattgttcttgc480
atccatccagcatcgtacgtacgtcctctatacatctgcggatcatcatgtgcgcatgtt540
tgtggcatgcatgcatgcatgtgagcaggagcaggcttgcggccgccatgtccgcgctga600
ggaagccaaagtacaacggcaagtgcatgcgcagcctgattaggagcatcctcggcgaga660
cgagggtaagcgagacgctgaccaacgtcatcatccctgccttcgacatcaggctgctgc720
agcctatcatcttctctacctacgacgtacgtacgtcgtcacgaatgattcatctgtacg780
tcgtcgcatgcgaatggctgcctacgtacgccgtgcgctaacatactcagctctttccta840
tctgctgcgccaatttgcaggccaagagcacgcctctgaagaacgctctgctctcggacg900
tgtgcattggcacgtccgccgcgccgacctacctcccggcgcactacttccagactgaag960
acgccaacggcaaggagcgcgaatacaacctcatcgacggcggtgtggcggccaacaacc1020
cggtaactgactagctaactggaaaacggacgcacagactccatgtccatggcggcccac1080
aaggtcgatgctaattgttgcttatgtatgtcgcccgattgcacatgcgtagacgatggt1140
tgcgatgacgcagatcaccaaaaagatgcttgccagcaaggacaaggccgaggagctgta1200
cccagtgaagccgtcgaactgccgcaggttcctggtgctgtccatcgggacggggtcgac1260
gtccgagcagggcctctacacggcgcggcagtgctcccggtggggtatctgccggtggct1320
ccgcaacaacggcatggcccccatcatcgacatcttcatggcggccagctcggacctggt1380
ggacatccacgtcgccgcgatgttccagtcgctccacagcgacggcgactacctgcgcat1440
ccaggacaactcgctccgtggcgccgcggccaccgtggacgcggcgacgccggagaacat1500
gcggacgctcgtcgggatcggggagcggatgctggcacagagggtgtccagggtcaacgt1560
ggagacagggaggtacgaaccggtgactggcgaaggaagcaatgccgatgccctcggtgg1620
gctcgctaggcagctctccgaggagaggagaacaaggctcgcgcgccgcgtctctgccat1680
caacccaagaggctctagatgtgcgtcgtacgatatctaagacaagtggctttactgtca1740
gtcacatgcttgtaaataagtagactttattttaataaaacataaaaatatatat1795
50
1284
dna
artificialsequence
tillingmutantd74n
50
atggcgagctactcgtcgcggcgtccatgcaatacctgtagcacgaaggcgatggccggg60
agcgtggtcggcgagcccgtcgtgctggggcagagggtgacggtgctgacggtggacggc120
ggcggcgtccggggtctcatcccgggaaccatcctcgccttcctggaggccaggctgcag180
gagctggacggaccggaggcgaggctggcggactacttcaactacatcgccggaaccagc240
accggcggtctcatcaccgccatgctcaccgcgcccggcaaggacaagcggcctctctac300
gctgccaaggacatcaaccacttttacatgcagaactgcccgcgcatctttcctcagaag360
agcaggcttgcggccgccatgtccgcgctgaggaagccaaagtacaacggcaagtgcatg420
cgcagcctgattaggagcatcctcggcgagacgagggtaagcgagacgctgaccaacgtc480
atcatccctgccttcgacatcaggctgctgcagcctatcatcttctctacctacgacgcc540
aagagcacgcctctgaagaacgctctgctctcggacgtgtgcattggcacgtccgccgcg600
ccgacctacctcccggcgcactacttccagactgaagacgccaacggcaaggagcgcgaa660
tacaacctcatcgacggcggtgtggcggccaacaacccgacgatggttgcgatgacgcag720
atcaccaaaaagatgcttgccagcaaggacaaggccgaggagctgtacccagtgaagccg780
tcgaactgccgcaggttcctggtgctgtccatcgggacggggtcgacgtccgagcagggc840
ctctacacggcgcggcagtgctcccggtggggtatctgccggtggctccgcaacaacggc900
atggcccccatcatcgacatcttcatggcggccagctcggacctggtggacatccacgtc960
gccgcgatgttccagtcgctccacagcgacggcgactacctgcgcatccaggacaactcg1020
ctccgtggcgccgcggccaccgtggacgcggcgacgccggagaacatgcggacgctcgtc1080
gggatcggggagcggatgctggcacagagggtgtccagggtcaacgtggagacagggagg1140
tacgaaccggtgactggcgaaggaagcaatgccgatgccctcggtgggctcgctaggcag1200
ctctccgaggagaggagaacaaggctcgcgcgccgcgtctctgccatcaacccaagaggc1260
tctagatgtgcgtcgtacgatatc1284
51
428
prt
artificialsequence
tillingmutantd74n
51
metalasertyrserserargargprocysasnthrcysserthrlys
151015
alametalaglyservalvalglygluprovalvalleuglyglnarg
202530
valthrvalleuthrvalaspglyglyglyvalargglyleuilepro
354045
glythrileleualapheleuglualaargleuglngluleuaspgly
505560
proglualaargleualaasptyrpheasntyrilealaglythrser
65707580
thrglyglyleuilethralametleuthralaproglylysasplys
859095
argproleutyralaalalysaspileasnhisphetyrmetglnasn
100105110
cysproargilepheproglnlysserargleualaalaalametser
115120125
alaleuarglysprolystyrasnglylyscysmetargserleuile
130135140
argserileleuglygluthrargvalsergluthrleuthrasnval
145150155160
ileileproalapheaspileargleuleuglnproileilepheser
165170175
thrtyraspalalysserthrproleulysasnalaleuleuserasp
180185190
valcysileglythrseralaalaprothrtyrleuproalahistyr
195200205
pheglnthrgluaspalaasnglylysgluargglutyrasnleuile
210215220
aspglyglyvalalaalaasnasnprothrmetvalalametthrgln
225230235240
ilethrlyslysmetleualaserlysasplysalaglugluleutyr
245250255
provallysproserasncysargargpheleuvalleuserilegly
260265270
thrglyserthrsergluglnglyleutyrthralaargglncysser
275280285
argtrpglyilecysargtrpleuargasnasnglymetalaproile
290295300
ileaspilephemetalaalaserseraspleuvalaspilehisval
305310315320
alaalametpheglnserleuhisseraspglyasptyrleuargile
325330335
glnaspasnserleuargglyalaalaalathrvalaspalaalathr
340345350
progluasnmetargthrleuvalglyileglygluargmetleuala
355360365
glnargvalserargvalasnvalgluthrglyargtyrgluproval
370375380
thrglygluglyserasnalaaspalaleuglyglyleualaarggln
385390395400
leuserglugluargargthrargleualaargargvalseralaile
405410415
asnproargglyserargcysalasertyraspile
420425
52
1795
dna
artificialsequence
tillingmutantg78r
52
agttcatcactaatcacacttattgtgccctcgacgagtatctatagctagctcattaat60
cgattcgggggtgtgttgtcgaaggcggcaatggcgagctactcgtcgcggcgtccatgc120
aatacctgtagcacgaaggcgatggccgggagcgtggtcggcgagcccgtcgtgctgggg180
cagagggtgacggtgctgacggtggacggcggcggcgtccggggtctcatcccgggaacc240
atcctcgccttcctggaggccaggctgcaggagctggacggaccggaggcgaggctggcg300
gactacttcgactacatcgccagaaccagcaccggcggtctcatcaccgccatgctcacc360
gcgcccggcaaggacaagcggcctctctacgctgccaaggacatcaaccacttttacatg420
cagaactgcccgcgcatctttcctcagaagtgagtccgatgctgccgccattgttcttgc480
atccatccagcatcgtacgtacgtcctctatacatctgcggatcatcatgtgcgcatgtt540
tgtggcatgcatgcatgcatgtgagcaggagcaggcttgcggccgccatgtccgcgctga600
ggaagccaaagtacaacggcaagtgcatgcgcagcctgattaggagcatcctcggcgaga660
cgagggtaagcgagacgctgaccaacgtcatcatccctgccttcgacatcaggctgctgc720
agcctatcatcttctctacctacgacgtacgtacgtcgtcacgaatgattcatctgtacg780
tcgtcgcatgcgaatggctgcctacgtacgccgtgcgctaacatactcagctctttccta840
tctgctgcgccaatttgcaggccaagagcacgcctctgaagaacgctctgctctcggacg900
tgtgcattggcacgtccgccgcgccgacctacctcccggcgcactacttccagactgaag960
acgccaacggcaaggagcgcgaatacaacctcatcgacggcggtgtggcggccaacaacc1020
cggtaactgactagctaactggaaaacggacgcacagactccatgtccatggcggcccac1080
aaggtcgatgctaattgttgcttatgtatgtcgcccgattgcacatgcgtagacgatggt1140
tgcgatgacgcagatcaccaaaaagatgcttgccagcaaggacaaggccgaggagctgta1200
cccagtgaagccgtcgaactgccgcaggttcctggtgctgtccatcgggacggggtcgac1260
gtccgagcagggcctctacacggcgcggcagtgctcccggtggggtatctgccggtggct1320
ccgcaacaacggcatggcccccatcatcgacatcttcatggcggccagctcggacctggt1380
ggacatccacgtcgccgcgatgttccagtcgctccacagcgacggcgactacctgcgcat1440
ccaggacaactcgctccgtggcgccgcggccaccgtggacgcggcgacgccggagaacat1500
gcggacgctcgtcgggatcggggagcggatgctggcacagagggtgtccagggtcaacgt1560
ggagacagggaggtacgaaccggtgactggcgaaggaagcaatgccgatgccctcggtgg1620
gctcgctaggcagctctccgaggagaggagaacaaggctcgcgcgccgcgtctctgccat1680
caacccaagaggctctagatgtgcgtcgtacgatatctaagacaagtggctttactgtca1740
gtcacatgcttgtaaataagtagactttattttaataaaacataaaaatatatat1795
53
1284
dna
artificialsequence
tillingmutantg78r
53
atggcgagctactcgtcgcggcgtccatgcaatacctgtagcacgaaggcgatggccggg60
agcgtggtcggcgagcccgtcgtgctggggcagagggtgacggtgctgacggtggacggc120
ggcggcgtccggggtctcatcccgggaaccatcctcgccttcctggaggccaggctgcag180
gagctggacggaccggaggcgaggctggcggactacttcgactacatcgccagaaccagc240
accggcggtctcatcaccgccatgctcaccgcgcccggcaaggacaagcggcctctctac300
gctgccaaggacatcaaccacttttacatgcagaactgcccgcgcatctttcctcagaag360
agcaggcttgcggccgccatgtccgcgctgaggaagccaaagtacaacggcaagtgcatg420
cgcagcctgattaggagcatcctcggcgagacgagggtaagcgagacgctgaccaacgtc480
atcatccctgccttcgacatcaggctgctgcagcctatcatcttctctacctacgacgcc540
aagagcacgcctctgaagaacgctctgctctcggacgtgtgcattggcacgtccgccgcg600
ccgacctacctcccggcgcactacttccagactgaagacgccaacggcaaggagcgcgaa660
tacaacctcatcgacggcggtgtggcggccaacaacccgacgatggttgcgatgacgcag720
atcaccaaaaagatgcttgccagcaaggacaaggccgaggagctgtacccagtgaagccg780
tcgaactgccgcaggttcctggtgctgtccatcgggacggggtcgacgtccgagcagggc840
ctctacacggcgcggcagtgctcccggtggggtatctgccggtggctccgcaacaacggc900
atggcccccatcatcgacatcttcatggcggccagctcggacctggtggacatccacgtc960
gccgcgatgttccagtcgctccacagcgacggcgactacctgcgcatccaggacaactcg1020
ctccgtggcgccgcggccaccgtggacgcggcgacgccggagaacatgcggacgctcgtc1080
gggatcggggagcggatgctggcacagagggtgtccagggtcaacgtggagacagggagg1140
tacgaaccggtgactggcgaaggaagcaatgccgatgccctcggtgggctcgctaggcag1200
ctctccgaggagaggagaacaaggctcgcgcgccgcgtctctgccatcaacccaagaggc1260
tctagatgtgcgtcgtacgatatc1284
54
428
prt
artificialsequence
tillingmutantg78r
54
metalasertyrserserargargprocysasnthrcysserthrlys
151015
alametalaglyservalvalglygluprovalvalleuglyglnarg
202530
valthrvalleuthrvalaspglyglyglyvalargglyleuilepro
354045
glythrileleualapheleuglualaargleuglngluleuaspgly
505560
proglualaargleualaasptyrpheasptyrilealaargthrser
65707580
thrglyglyleuilethralametleuthralaproglylysasplys
859095
argproleutyralaalalysaspileasnhisphetyrmetglnasn
100105110
cysproargilepheproglnlysserargleualaalaalametser
115120125
alaleuarglysprolystyrasnglylyscysmetargserleuile
130135140
argserileleuglygluthrargvalsergluthrleuthrasnval
145150155160
ileileproalapheaspileargleuleuglnproileilepheser
165170175
thrtyraspalalysserthrproleulysasnalaleuleuserasp
180185190
valcysileglythrseralaalaprothrtyrleuproalahistyr
195200205
pheglnthrgluaspalaasnglylysgluargglutyrasnleuile
210215220
aspglyglyvalalaalaasnasnprothrmetvalalametthrgln
225230235240
ilethrlyslysmetleualaserlysasplysalaglugluleutyr
245250255
provallysproserasncysargargpheleuvalleuserilegly
260265270
thrglyserthrsergluglnglyleutyrthralaargglncysser
275280285
argtrpglyilecysargtrpleuargasnasnglymetalaproile
290295300
ileaspilephemetalaalaserseraspleuvalaspilehisval
305310315320
alaalametpheglnserleuhisseraspglyasptyrleuargile
325330335
glnaspasnserleuargglyalaalaalathrvalaspalaalathr
340345350
progluasnmetargthrleuvalglyileglygluargmetleuala
355360365
glnargvalserargvalasnvalgluthrglyargtyrgluproval
370375380
thrglygluglyserasnalaaspalaleuglyglyleualaarggln
385390395400
leuserglugluargargthrargleualaargargvalseralaile
405410415
asnproargglyserargcysalasertyraspile
420425
55
13516
dna
zeamays
55
tcttgctatatatgagatgacaaaattttccaaagaagagagaagccggcagaacccatc60
ctgtttcaaatctcttctactacttaagtttctaacgtaggcgtcgacaaaacggattgg120
tgcacggttctgccgatgtctcccacacacgcgcatggaaggaggcaggcacccttcccc180
gccgccccggatctcgcgccagccccagccctaccccgcctgcccttccattcttcccca240
gccgccccccggtcaacgtcacgaacccgggcctcgtgccgttcgccgtggccacgcggt300
tcgacgagcgggtcacggagctgctgagcgcgctcgctgacgcggcggcggggcgaccag360
gcaggtgggccatcggcgaagcgccatggtcgtcgtcggggggcaggaaccaggcggtgt420
acgcgcgccgcgcgcccggctcttcatcgcctccacccgctccagcgtctccaccacctc480
cttcatcgagggccgactgcgaggctcgccggccaggcagccgagcgtcagttgcgccgc540
ttggaacgcctgcttttgttgatcgtttgttttggtctgatttcggtgggtctatccgca600
gagaggaagaagcagaagctctccgagatccaatccggcgttgaggaagctgaatcgctg660
gtaaatagatgccgcgacacgttctggtttggggatccccttggctaacaggacatacga720
catttggggaatgggtagaaaagcagagattagggatttttcgtttccgtcggtgcagtt780
ttggtgttccaacggagttgcgagatgtttatgtgccttagtcttcaatttgggggttgg840
gggaaaagtaattttatgtttttgttttgtgtctgcagattcagaaaatggacctggagg900
caaggagcctacagcctagcattaaggctagtttgcttgcaaagctgagggagtataaat960
ctgacctcaacaacgtcaagagtgagctcaagaggatatctgcgcccaatgccaggcagg1020
ctacccgggaggagctcctggagtctggaatggctgatactctcgcagtgagctaatgat1080
aggacttgactgtgtctacgagactgctcctaacaataaactgaagaaagcaaaagaaat1140
cattcaacgtattcgccgaagagaactctacaaggtagtatgatgctttaattgctcata1200
tacaagtgtcattttgtcatgtcattacacatggttaggatacataggagattctgtttt1260
ttaacacatagttgtcccatgtccatgaattcatttgaattaatttactcttcgcaatct1320
tatacattaaaatcgtgttacctattacatcacaacttcatgagagcatgcttgttctgt1380
gtagatatggtagtcctgggaggaagagccatcgccgctatgtatggcagaaccacccgc1440
gaaagcaacctctctggtctcgcatagccagagcaggagcagctcgcttgcgcggccgca1500
gcgctggcggtcggccccgcgtacgagcgcctgcaggtaggccagcttctgctgcaatgc1560
ccgaatctcggcgtccacgcgcagcagcgtcgtcgcctcctcctccgtcagctcacccag1620
cttggccagcacccccgtcacccccgcgtccgccatggctgtcgccgggaccgaaaggct1680
aaaactgtcacaatgacgtaaagtttggttggtgttggcggctcacgcaaaaccagacct1740
ttccaagttttactttagcagagtttttttggaacgagagcaaagcagcacagtttcaag1800
aatgtggggcaatttgaatgttcgttcctgctgcactgctactgcttttagaattgtagt1860
atgcttcatcatttatttatttctaaaaaaacttgcatgaattctatcgtgacttttatt1920
gagaaaataatgtattcacgtatcttcatgtttctgataaaggtatttgtatatgcatcg1980
gtgctacatatgcgaatacaagttttgtttcaactctgaagtctcaagttgaattctaaa2040
ctccagtttgttttctactgtgctgctgcaggaagccaggaacccatccgaacaaggttg2100
caatcatgataagcagatagagcaagcatatgatgatattttgaattcgtcgaagcatac2160
tttggccagcatgatggagctgcaggaggttcagttatttgcacacattgtttttctttc2220
actcctatgattttcctcaatatgatcaaaatgtttcttttgcaaataatgattgaaatg2280
tttctcattgtactcaacctcttaaactacctataggctttgcttgagagtaatcaggct2340
acaaaggatgccaatggtattgctgctctctatattgttcttgttctaatgtaaaaacta2400
caacacaactctttacttgatcccagaaattccttctgcctcaaatggagacaatgacga2460
gtggtcagaagtacagagattgcagacaaggtaaattttgcaatagaaataactaaccaa2520
ccattagtgcttgaaaaaaactggactggtgactggggcacgtggtttcatcaacatttg2580
gacctcaacggtctaatcagtataacttagaagttggctagctcttgaaaaacactgcat2640
gacactaagcatttgtttattttcagctgcttgcacccctatgatttcaagtaactactt2700
gtctacttgtgataatcacctgaatatgagtatttgaaatgcttatcacgtctcggcaat2760
tgcatttcttttatgcgtaactgaagtctgctctagcttcctaatagagttcatttttta2820
atacagaaaccactttgagatagccacaatatagtaaaagtggcagctaaggtactaaaa2880
acacccatgcaaataagaaaaaaaatgaatcttgtattttaattttgttaaatacctcta2940
tagtttggcgatatattatgttaccatcctgcttgtagcctgtaggtcattttatatgag3000
ccatcaaattgcgatgacagttgccacaaatccagtttcatatgaaggtattagctgtgt3060
aacaagctaactgctgctctctgcccaataagttattcaattggattagtaggttgcatc3120
caaggttattcaattggatcagtaggttgcatccaaggtatactgctgctctctgcccaa3180
taagttattcaattcgatcagtaggttgcctgttcccttcattttattaaaaaatacata3240
ataatataataagtacctgtttgttctaaaaataatacttctgtaaatgaggttattaat3300
tttccttttggtaataatgcaggttgatgatactgaagtcatcagttttttgttgcaaac3360
tgaaataatttctctgtgcttgcgaaccatggagatgggtagtgagctatccaaaactgt3420
atgtagctagccatatattctcattcaaatatcataatttatctcttctgcttaatactg3480
gcaaaggtgtaatagtttttttagtattgatttgtcacctgaagtttatcttgtgcacta3540
ctactttgccatcatcagttatctctagaatactcttatcctgtaccatcttctctctga3600
taagcctaaatttgtacaattcataagcctaaaaggtgacttatataatatatacaagga3660
ccctcaagagttgtttggcaattcagtgactgtcctgggtcctgttttggggagcttctg3720
gtagcttttgcttctccaaaagaaaagctagaagctccccccaaacagagcagcttcttc3780
aagccggtaaaagcttcaaaagctataattatactaaaaacagtgaagctccctcagagc3840
agcttcccagctctccaggagatgcttttggagaagctacagtttccccaaacagggccc3900
tgctctgttgaaccccccttcctgatacatatttgaatatgagtttatagtgtgtgtggg3960
ggggtgtaagtaggggggtaatgggttctaaattttatactataaaaattaaggatcaga4020
ttagaattgagctctatttctattcatttttgaactaaaattaattaagggctcaaatga4080
attatgaagaagcattaggatcatgatccattaccacccctacgtgtaagatgttttttg4140
gtggttgtggttgattttgaattttaaggccgcatatgtctcatggactacacaagctca4200
tattcatctacatttgtagccgtcactaacttagccaaatatgcatatgtggcggctgag4260
agcacctagagggggggggggtgaataggtgatcctataaaaacttgaaacataatgcca4320
caaaacttgattaggagttagcacaataaagccaagtgactagagaggagttcttgcaag4380
acacgataaccacacgaagatcaacacagatagacacaatggtttatcccgttgttcggc4440
caagtccaacacttgcctactccacgttgtggcgtcccaacggacgagggttgcaatcaa4500
cccctctcaagcggtccaaggacccacttgaataccacggtgttttgcttagtttcacta4560
tatcccgcttgcgaggaatctacacaacttggagcctctcgcccttacaatttgatgttc4620
acaaagaagcacgaaagtaaggctgggatgagcaacgcacacaagacacaaaatcagagc4680
acaacacgcacacaagtcacaacacgagctcacaacacaacccaaagagttctctactca4740
aatggagctctagttgctatcacaaagaatcgaatacgcggaattggagtcttggtgctt4800
agaaacgcttagagaatgcttggtgtgttcctccatgcgcctaggggtcccttttatagc4860
cccaaggcagctaggagccgttgagagcattccaggaaggcaattcttgccttctgtcgc4920
ctggcgcaccggacagtccggtgcaccaccggacactgtccggtgcggatttctttcctt4980
ctttagcgaagccgaccgttggagattcagagccgttggcgcaccggacagtgtccggtg5040
cacaccggacagtccggtgcccccttctgaccgttggctctgccacgcgtcgcacgcgga5100
ttacgcggccgaccgttggcccggccgactgttggctcaccggatagtccggtgcaccac5160
cggacagtccggtgatttatagccgtacgccgccgacgaaacccgagagcagccagttcg5220
ccagagccagcctggcgcaccagacactgtccggtgcacccagactacgcagtcttggct5280
gcacagccaagtcttttccaaattggtctttttctgtttctagcacttagacacattaca5340
ttagtatccaaaacaatgtactaagtcttagaaacatacctttactcttgatttgcactt5400
tgtccatcatttggcatagattaacacatgaccacttgtgttggcactcaatctccaaaa5460
tacttagaaatggcccaatggcacatttccctttcaatctccccctttttggtgatttat5520
gccaacacaacaaaaagcaactaaaagaagtgcaacatcaatgcaaatgagaacaaaaaa5580
ttgttttgattcaaatttggcatatttggatcattctttgccaccacttggttttgtttt5640
tgcaaatcaacctcaatttcctatctctaagtcaaacacacttgttgaaacataaagaga5700
gttgttccatgagaaattgatcaaagatttcaaaaactcccccttttttccataatcaaa5760
cattctccccacaagagaccaacttttgacagaagagacaataagagaattttgacaaac5820
caaaaagctctattctactattttcaaaattctcaagtggtagctgatccatttattgct5880
ttggccttattttctccccctttggcatcaagcaccagaacgggataaatcttggccctt5940
aaaaccccattgcctcaccaaaatcttcaattaagagtaaaaaggcaataagagcatgaa6000
gatgaacttggagttagttactctttcatcggagtgcagtggaagtctttcatggtccaa6060
gtccaacatttcctttcaatccacctttgagactaaatcaagcaaactcaagcacacagt6120
tagtctcaaggggtcaagttgtagcacaactccccctaaatatgtgcattacttgcaaat6180
ggacttgtgaggtccggggagtgtttgtacaacttgagcaccatacataaacaacaaaat6240
gcataaaggaacatgatcaaggcataaaacacatgtatgctataaatcaatccaagttcc6300
gcgaatctaagacatttagctcactacgcagcctacaaaaggtcttctcatctagaggct6360
tggtaaagatatcggctagctggttctcggtgctaacatgaaacacttcgatatctccct6420
tttgctggtgatctctcaaaaagtgatgccggatgtctatgtgcttagtgcggctgtgtt6480
caacaggattatccgccatgcggatggcactctcattatcacataggagtgggactttgc6540
tcagattgtagccaaagtccttgagggtttgcctcatccaaagtagttgcgcgcaacact6600
gtcctgcggtaacgtactcggcctcagcggtggatagggcaacggaggtttgtttcttag6660
aattccacgacaccagggaccttcctaagaattgacacgtccccgatgtactctttctat6720
cgaccttacatccaacatagtcggagtctgaatatccaatcaagtcaaaggtagacccct6780
ttggataccagatcccgaagcaaggcgtagcgactaaatatctaagaattcgcttcacaa6840
ccactaagtgacactcctttggatcggattgaaatctagcacacatgcatacacttagca6900
taatatctggtctactagcacataaataaagtaaggaccctatcatagaccggtatgcct6960
tttgatcaacggacttacctcctttgttgaggtcggtgtgtccgtcagttcccattgtag7020
tctttgcgggcttggcgttcttcatcccaaactgctttagcagatcttgcgtgtacttcg7080
tttgggagatgaaagtgccgtccttgagttgcttcacttggaacccaaggaagtagttca7140
actcgcccatcatcgacatctcgaatttctgagtcatcaccctgctaaactcttcacaag7200
acttttggttagtagaaccaaatattatgtcatcgacataaatttggcacacaaaaagat7260
caccatcacaagtcttagtgaataaagttggatcggctttcccaaccttgaaagcattag7320
caagtaaaaagtctctaaggcattcataccatgctcttggggcttgcttaagtccataga7380
gggccttagagagcttacacacgtggtcggggtaccgttcatcctcgaagccagggggtt7440
gctccacgtgcacctcctccttgattagcccgttgaggaaagcactcttcacatccattt7500
ggaacaacctgaaggaatggtgagcggcataggctaacaaaatacgaatagactctagcc7560
tagccacaggagcaaaagtctcctcaaagtccaaacctgcgacttgggcatagccttttg7620
ccacaagtctcgccttgttccttgtcaccaccctgtgctcgtcctgtttgttgcggaaca7680
cccacttggttcccacaacgttttgcttgggacgaggcaccagtgtccaaacttcatttc7740
gcttgaagttgatgagctcttcctgcatggccaacacccagtccggatctagcaaggcct7800
cttctaccctgaaaggctcaatagaagagacaaaagagtaatgctcacaaaatttaacta7860
atctagagcgagtagttactcccttgctaatgtcacccaaaatctggtcgacgggatgat7920
tcctttgaatcgtcgctcgaacttgagttgaaggggcttgaggtgcttcttcctccataa7980
catgatcatcttgtgctcccccttgatcacatgcctcctcttgatgaacctgttcatcgt8040
cttgagttgggggatgtaccaatgttgaggaagaaggttgatcttgctccttttgttcct8100
gtggccgcacatctccaatcgtcatggtgcgtattgcggccgttggaatgtcttcttcat8160
ctacatcattaagatcaacaacttgctctcttggagagccattagtctcatcaaatacaa8220
cgtcgctagagacttcaaccaaactcgatgatttgttgaaaaccctatacgcctttgtat8280
ttgagtcataacctaacaaaaacccttctacagctttgggagcaaacttagaatttctac8340
ctttcttcactagaatgtagcatttactcccaaatacacgaaagtatgaaacattgggtt8400
tgttaccggttaggagctcatacgaagtcttcttgaggaggtgatgaaggtagacccggt8460
ttatggcgtggcaagccgtgttcacggcttccgaccaaaatcgctcgggggtcttgaact8520
ctccaagcatcgtcctcgccatgtcaatgagcgtcctatttttcctctctaccacaccat8580
tttgctgtggtgtgtagggagcggagaactcgtgcttgattccttcctcctcaaggtact8640
cttctacttgaagattcttgaactccgacccgttgtcgctccttatcttcttcaccttga8700
gctcaaactcattttgagctctccttaggaagcgcttgagggtcccttgggtttcagatt8760
tatcctgcaaaaagaatacccaagtgaagcgggaaaaatcatcaactataacaagaccat8820
acttacttcctcctaagcttaggtaggcgacgggcctgaagaggtccatatgtagcaact8880
ccaaaggtcttgatgtggtcatcacatttttggtatgatgagagcttcccacttgtttac8940
ctgcttgacaagctgcataaggtctatctttttcgaaagtaacatttgttagtcctatca9000
cgtgttctccctttagaagcttgtgaaggttcttcatccccacatgtgctaagcggcgat9060
gccacagccagcccatgctagtcttagcaattaagcatgcatctagaccggcctcctctt9120
ttgcaaaatcaactaagtagagtttgccgtctaatacacccttaaaagctaatgaaccat9180
cactccttctaaagacggacacatctacatttgtgaataagcaattatatcccatattac9240
aaagctaactcacagacaataagttatatccaagagactcaactaaaaacacattagaaa9300
tagagtgctcggatgaaatggcaatcttccctagtcctttgaccttgccttgattcccgt9360
caccgaatatgattgagtcttgggaatccttgttcttgacgtaggaggtgaacatcttct9420
tctcccccgtcatgtggtttgtgcatccgctgtcgataatccagcttgatcccccggatg9480
cataaacctgcaaggcaaattaggcttgggtcttaggtacccaactcttgttgggtccta9540
caaggttagtacaaatagccttagggacccaaatgcaagttttatctcccttgcattttg9600
cccctaattttctagcaaccaccttcttatcctttctacaaatatcaaaggaagcattta9660
aagcatgataaattgtagaaggttcattacttgttttcctaggtacatgagcatttctcc9720
taggcacatgatgaatgatatttttcctagccaaatttctatcatgcataatagaagaac9780
ttgaagcaaacattgcatttgaatcataagcatgtgaaatgacatcattgcaacttctat9840
catgatgaacattcctggaatatctcctatcatggtataagaaagcatggttcttttgaa9900
tactatttgccataggggccttccctttctccttgatggagataggagccttatgacttg9960
tcaagttcttggcttccctcttgaagccaagcccatccttaattgaggggtgtctaccaa10020
ccgtgtaggcatcccttgcaaattttagtttatcaaaatcatttttgctagtcttaagtt10080
gagcattaagactagccacttcatcttttagtttagaaatgcaaactaggtgttcactac10140
aagcatcaacattgaaatctttacacctattgcaaatcgtaacatgttcttcacgagagg10200
ttaatttactagctatttctaacttagcactcaaatcatcattaacactttttaggctag10260
agatagattcatggcatgtagacaattcacatgaaagcatttcatttcttttaatttcta10320
aagcaagagaattttgtgcttctacaaacttatcatgttcttcatacaaaagatcctctt10380
gcttttctaataatctattcttatcattcaaggcatcaatcaactcattgatcttatcaa10440
tcttagttctatctaagcccttgaacaaactagcatagtctatttcatcatcgctagatt10500
catcatcactagaagcataagtagactttcgagtgtttaccttcttctcctttgccatta10560
agcatgtgtgatgctcgttgggggaagaggaacgacttgttgaaggccgaggcgacgagt10620
ccttcgttgtcggagtcggacgacgaacaatccgagtcccactccttgccaaggtgtgcc10680
tcacccttagccttcttataagccttcttcttttccctcttcttctcttgttcctggtca10740
ctatcattatcgggacaattagcgataaatgaccaatcttaccacatttgaagcatgagc10800
gcttccccttcgtcttgttgggatgctccttgcgaccctttagtgctgtcttgaatcgct10860
tgatgatgagggccatttcttcttcattaagcccgaccgcctcaacttgcgccaccttgc10920
taggtagcgcctccttgctcctcgttgctttgagagccacagtttgaggctcgtggattg10980
ggccattcaacgcatcatcgacgtatcttgcctccttgatcatcatccgcccgcttacga11040
actttccaagtatttcttcgggcgacatcttggtgtacctaggattttcacgaatattgt11100
ttacaagatgtggatcaaggatagtgaaggaccttagcattaggcggacgacgtcgtgat11160
ccgtccatcgcgtgcttccatagcttcttattttgttgacgagggtcttgagccggttgt11220
acgtttgggttggctcttctcccctgatcattgcgaatctcccaagttcgccctccacca11280
actccatcttggtgagcatggtgacatcgttcccctcatgtgagatcttgagggtgtccc11340
agatctgcttggcgttatccaagccgctcaccttatggtattcatccctgcacaatgatg11400
ctagaagaacagtagtagcttgtgcatttttgtgaatttgctcattgataaatatgggac11460
tatcagtactatcaaattgcattccactctctactatctcccatatgcttggatggagag11520
agaacaagtgactacgcattttgtgactccaaaatccgtagtcctctccatcaaagtgag11580
gaggtttaccaagtggaatggagagtaaatgagcatttgtactttgcggaatacgagaat11640
aatcaaaagaaaagtttgaattgactgttttctttttctcgtagttgtcgtcgtcctttt11700
gggaagaagaggactcgtcgctgtcgtcgtagtagacgatctccttgatgcaccttgttt11760
tcttcttcttcctgtcttttcttttgtggctcgagcccgagtcagtaggcttgtcatctt11820
ttggatcattgacgaaggactccttctccttatcattgaccaccatccccttgcccttag11880
gatccatctcttcgggcgattagtcccttacgtgaagagaacgactcagataccaattga11940
gagcacctagaggggggtgaataggtgatcctataaaaacttgaaacttaatgccacaaa12000
acttgattaggagttagcacaataaagccaagtgactagagagttcttgcaagacacgat12060
aaccacacaaagatcaacacagatagacacagtggtttatctcgtggttcggccaagtcc12120
aacacttgcctactccacgttgtggcgtcccaacggacgagggttgcaatcaacccctct12180
caagcggtccaaggacccacttgaataccacagtgttttgcttagtttcactatatcccg12240
cttgcgaggaatctccacaacttgtagcctctcgcccttacaatttgatgttcacaaaga12300
agcacgaaagtaaggctgggatgagcaacgcacacaagacacaaaatcagagcacaacac12360
gcacacaagtcacaactcgagctcacaacacaacccaaagagttctctactcaaatggag12420
ctctagttgctatcacaaagaatcgaatgcgcggaattggagtcttggtgcttaggaacg12480
cttagagaatgctttgtgtgttcctccatacgcctaggggtcccttttatagccccaagg12540
cagctaggagccgttgagagcattccaggaaggcaattcttgccttctgtcgcctggcgc12600
accggacagtccggtgcaccatcggacactgtccggtgcagatttctttccttttttagc12660
gaagccgaccgtcggagattcagagccgttggcgcactggacactgtccgatggacaccg12720
gacagtccggtgcccccttctgaccgttggctctgccacgcgtcgcgcgcggattacgcg12780
gccgaccgttggctcgaccgactgttggctcactgaacagtccggtgcaccaccggacag12840
tccggtgatttatagccgtacgccgccgacgaaacccgagagcagctagttcgctagagc12900
cagtctggcgcaccagacattgtccggtgcaccaccggacagttcggtgcacccagactg12960
cgcagagtcttggctgcacagccaagtcttttccaaattggtcttttcctgtttctagca13020
cttagacacattacattagtctccaaaacaatgtactaagtattagaaacatacctttac13080
tcttgatttgcactttgtccatcatttggcatagattaacacatgaccacttgtgttggc13140
actcaatctccaaaatacttagaaatggcccaagggcacatttccctttcagcggctagc13200
aacaggtccttggtttcttgggttatttattctctttttatcgtgtttgaatgttttcgt13260
gttcatttgcataacatcttaggtctacatcagtatatgaattgagatcaaatgtgaatt13320
ggaccacacaagctcatattcatctacatttgtagtcgtcactaacttagccaaatatgc13380
atatgtccgcttctgatttcattgtgtcttttcttcaggagtttggggatcaaggagagg13440
actccattatcttgtcaccgcgactgaaggagattagtactcctgaccgccccgctgccc13500
tccgtttcctaggtac13516
56
1026
dna
artificialsequence
cdnasnaret1
56
gcgcccggctcttcatcgcctccacccgctccagcgtctccaccacctccttcatcgagg60
gccgactgcgaggctcgccggccaggcagccgagcgtcagttgcgccgcttggaacgcct120
gcttttgttgatcgtttgttttggtctgatttcggtgggtctatccgcagagaggaagaa180
gcagaagctctccgagatccaatccggcgttgaggaagctgaatcgctgattcagaaaat240
ggacctggaggcaaggagcctacagcctagcattaaggctagtttgcttgcaaagctgag300
ggagtataaatctgacctcaacaacgtcaagagtgagctcaagaggatatctgcgcccaa360
tgccaggcaggctacccgggaggagctcctggagtctggaatggctgatactctcgcagt420
gagctaatgataggacttgactgtgtctacgagactgctcctaacaataaactgaagaaa480
gcaaaagaaatcattcaacgtattcgccgaagagaactctacaagatatggtagtcctgg540
gaggaagagccatcgccgctatgtatggcagaaccacccgcgaaagcaacctctctggtc600
tcgcatagccagagcaggagcagctcgcttgcgcggccgcagcgctggcggtcggccccg660
cgtacgagcgcctgcaggaagccaggaacccatccgaacaaggttgcaatcatgataagc720
agatagagcaagcatatgatgatattttgaattcgtcgaagcatactttggccagcatga780
tggagctgcaggaggctttgcttgagagtaatcaggctacaaaggatgccaatgaaattc840
cttctgcctcaaatggagacaatgacgagtggtcagaagtacagagattgcagacaaggt900
aaattttgcaatagaaataactaaccaaccattagtgcttgaaaaaaactggactggtga960
ctggggcacgtggtttcatcaacatttggacctcaacggtctaatcagtataacttagaa1020
gttggc1026
57
874
dna
artificialsequence
cdnasnaret1
57
gcgcccggctcttcatcgcctccacccgctccagcgtctccaccacctccttcatcgagg60
gccgactgcgaggctcgccggccaggcagccgagcgtcagttgcgccgcttggaacgcct120
gcttttgttgatcgtttgttttggtctgatttcggtgggtctatccgcagagaggaagaa180
gcagaagctctccgagatccaatccggcgttgaggaagctgaatcgctgattcagaaaat240
ggacctggaggcaaggagcctacagcctagcattaaggctagtttgcttgcaaagctgag300
ggagtataaatctgacctcaacaacgtcaagagtgagctcaagaggatatctgcgcccaa360
tgccaggcaggctacccgggaggagctcctggagtctggaatggctgatactctcgcagt420
gagctaatgataggacttgactgtgtctacgagactgctcctaacaataaactgaagaaa480
gcaaaagaaatcattcaacgtattcgccgaagagaactctacaaggaagccaggaaccca540
tccgaacaaggttgcaatcatgataagcagatagagcaagcatatgatgatattttgaat600
tcgtcgaagcatactttggccagcatgatggagctgcaggaggctttgcttgagagtaat660
caggctacaaaggatgccaatgaaattccttctgcctcaaatggagacaatgacgagtgg720
tcagaagtacagagattgcagacaaggtaaattttgcaatagaaataactaaccaaccat780
tagtgcttgaaaaaaactggactggtgactggggcacgtggtttcatcaacatttggacc840
tcaacggtctaatcagtataacttagaagttggc874
58
553
dna
zeamays
58
cgatgtgcagtggcctgattagctacaagaagctcttgttccatggactcgatctctgga60
ccgcactatcgttgcctcagcccctaggtcatgctgccctctggcctcctcatcgtacaa120
ttcaccaacatctccaatgtaagtgcagctggttcagtaatgaactcagaagtggcatca180
gaatactccaagagttttttgttctttttgcctggatatataccaagggaaatgcattca240
aaactcctatagatgacgaatcccatctctccctcttttctcggacacggatccccaggt300
ccgtctccgtgctttactcatttgttttttacaagttcagatccacttgcgtactcacac360
agtggacatctgttatgcacatgtgtaaaccagcataagaattaggaattatgctcattt420
tatctaagaagtccttacactcgaaaatgcatgtgttatttagcttgagaataaataaaa480
ttattagcaaggagaaaaaaaataggactaaagaatagagtcacattggtttaaattagt540
acctagaagcaaa553
59
527
dna
artificialsequence
cdnasnaret2
59
gcttctcgatgtgcagtggcctgattagctacaagaagctcttgttccatggactcgatc60
tctggaccgcactatcgttgcctcagcccctaggtcatgctgccctctggcctcctcatc120
gtacaattcaccaacatctccaatgtaagtgcagctggttcagtaatgaactcagaagtg180
gcatcagaatactccaagagttttttgttctttttgcctggatatataccaagggaaatg240
cattcaaaactcctatagatgacgaatcccatctctccctcttttctcggacacggatcc300
ccaggtccgtctccgtgctttactcatttgttttttacaagttcagatccacttgcgtac360
tcacacagtggacatctgttatgcacatgtgtaaaccagcataagaattaggaattatgc420
tcattttatctaagaagtccttacactcgaaaatgcatgtgttatttagcttgagaataa480
ataaaattattagcaaggagaaaaaaaataggactaaagaatagagt527
60
9062
dna
zeamays
60
gttgcgagatgtttatgtgccttagtcttcaatttgggggttgggggaaaagtaatttta60
tgtttttgttttgtgtctgcagattcggaagatggacttggaggcaaggagcctacagcc120
tagcattaaggctggtttgcttgcaaagctgagggagtataaatctgacctcaacaacgt180
caagagtgagctcaagaggatatttgcgcccaatgccaggcaggctacccgggaggagct240
cctagagtttggaatggctgatactctcgctgtgagctaatgctaggacttgactgtgtc300
tacgagactgctcctaacaataaactgaagaaagcaaaagaaatcattcaacgtattcgc360
cgaagagaactctacaaggtagtatgatgctttaattgctcatatacaagtgtcattttg420
tcatgtcattacacatggttaggatacatacttaagtttctaacgtaggcgtccacacaa480
cggattggtgcacggttctgccgatgtatcccacgcacgtgcatggaaggaggcaggcac540
ccttccccgccgccccggatctcgcgccagcccccgccctaccccgcctgcccttccact600
cttcccccgctgcccccggtcaacgtcacgaacccgggcctcgtgccgctcgtcgtggcc660
acactgttcgacgagcgagtcacagagctgctgagcgtgctcgctgatgcggcggtgggg720
cgaccaggcaggtggtccatcggcgaagcgccatggtcgtcgtcggggggcacgaaccag780
gcggtgtacgcgcgccgcgcgcccggctcttcatcgcctccacccgctccagcgtctcca840
ccacttccttcatcgagggccgactgcttggctcgctggccaggcagccgagcattagtt900
gcgccgcttggaacgcctgcttttgttgatcgtttgttttggtctgatttcagtgggtct960
atccgcagagaggaagaagcagaagctctccgagatccaatccggcgttgaggaagctga1020
atcgctggtaaatagatgtcgcgacgcgttctgttttggggatccccttggctaacggga1080
catacgacatttggggaatgggtagaaaagcagagattagggatttttcgtttccgtcgg1140
tgcagttttggtgttccaacagagttgcgagatgtttatgtgccttagtcttcaatttgg1200
gggttgggggaaaagtaattttatgtttttgttttgtgtctgcagattcagaaaatggac1260
ctggaggcaaggagcctacagcctagcattaaggctggtttgcttgcaaagccgagggat1320
tataaatctgacctcaacaacgtcaagagtgagctcaagaggatatctgcgcccaatgcc1380
aggtaggctacccgggaggagctcgtggagtctagaatggctgatactctcgcagtgagc1440
taatgctaggacttgactgtgtctacgagactgctcctaataataaactgaagaaagcaa1500
aagaaatcattcaacgtattcgccgaagagaactctacaaggtagtatgatgctttaatt1560
gctcatatacaagtgtcattttgtcatgtcattacacatggttaggatacatacttaagt1620
ttctaacgtaggcatccacacaatggattggtgcacggttctgccgatgtatcccacgca1680
cgcgcatggaaggaggcaggcacccttccctgccgccccggatctcgcgccagccatcgc1740
cctaccccgcctgcccttccactcttccccctgaaagtcgcctagagggggggtgaatag1800
ggcgaatctgaaatttacaaacttaagcacaactacaagccgggttaacgttagaaatat1860
aaacgagtccgagagagagggcgcaaaacaaatcatgagcaaataaagagtgagacacga1920
tgatttgttttaccgaggttcggttcttgcaaacctactccccgttgaggtggtcacaaa1980
gaccgggtctctttcaaccctttccctctctcaaacggtcacttagaccgagtgagcttc2040
tcttctcaatcaaacgaaacacaaagttcccgcaaggaccaccacacaattggtgtctct2100
tgccttggttacaattgagtttgatcacaagaagaatgagaaagaaaagaagcgatccaa2160
gcgcaagagctcaaatgaacacaaatgtcgctctctctagtcactatttgatttggagtg2220
attccggacttgggagaggatttgatcttttggagtgtctagaattgaatgctatagctc2280
ttgtaatatgttgaaggtgggaaacttggatgccattgaatgtggggtggttggggtatt2340
tatagccccaaaacaccaaaaaaggccgttggaaggctgctctcgcatggcgcaccggac2400
agtccggtgcgccagccacgtcagcagaccgttggggttcgaccgttggagctctgactt2460
gtggggcctctgggctgtccggtggtgcaccggacaggtcctgtaggatgtctggtgcgc2520
caactgcacgtgctctgtcctctgcgcgcgcaggcgcgcattaaatgcgttgtagtcaac2580
cgttgcgcgcgaagtagccattgctctgctggcacaccggacagtccggtgaattatagc2640
ggagcgccctctgattttcccgaaggtagcgagttcagcttcgagtgccctggtgcaccg2700
gacactgtccggtgcgccaaaccagggtgccttccgggatgtcttttgctctctttgttt2760
gaaccctttcttggtctttttattggcttattgtgaacctttgacacctgtaaaacttat2820
agactagagcaaactagttagtccaattatttgtgttggacaattcaaccaccaaaatca2880
attaggaaataggtgtgagcctaattccctttcaatctccccctttttggtgattgatgc2940
caacacaaaccaaagcaagtatagaagtgcataattgaactagtttgcataatgtaagtg3000
caaaggttacttagaattgaaccaataaatattttcataagttatgcatggattgtttct3060
ttattttcatcattttggaccacgcttgcaccacatgttttgtttttgcaaatccttttg3120
taaatagtcaaaggtaaatgaataagattttgagaagcattttcaaaatttgaaattttc3180
tccccctgtttcaaatgcttttcctttgacttaaacaaaactcccccctcaaaaatccta3240
ctcatagtgttcaagagggttttaagatatcaattttgaaaatgctactttctccccctt3300
ttgaatataataagatatcaattgaaaaattcatcattttaaaaccttttgaaaatgggt3360
ggtggtgcggtccttttgctttgggctaatactttctccccctttggcatgaatcgccaa3420
aaacgaatacttgagtgaaatataagcccctttaactactttctcctgctttggcgaaca3480
taatatgagtgaagattataccaaagttggagagttgcttgaagcgacggtgaaggatga3540
gttatggagtggaggttaagcctttgtcttcgccgaagattccaattccctttcaataca3600
cctatgacttggtttgaaatatacttgaaaacacattagtcatagcacatgaaagagata3660
tgatcaaaggtatattaatgagctatgtatgcaagacatcaaaagaaattcctagaatca3720
agaatatttagctcgtgtctaagtttgttcatctagtggcttggtaaagatatcggctaa3780
ttgttccttagtgttaatataggcaatctcgatatctccctttttttggtgatcccttag3840
gaaatgataccgaatggctatgtgtttagtgcggctatgctcaacgggattatccgccat3900
gcggattgcactctcattatcacatagaagaggaactttggttaatttttaaccatagtc3960
cctaagggtttgcctcatccaaagtaattgtgcgcaacaatggcctgcggcaatatactc4020
ggcttcggcggtagaaagagctacggaattttgcttctttgaagcccaagacaccaggga4080
ccttcccaagaactggcaagtccccgatgtactctttctattaattttacaccccgccca4140
atcggcatccgaataaccaatcaaatcaaatgtggatcccgtaggataccaaagcccaaa4200
cttaggagtatgaactaaatatctcaagattcgttttacggccgtaaggtgagcttcctt4260
agggtcggcttggaatcttgcacacatgcatacggaaagcataatatccggtcgagatgc4320
acataaatagagtaaagagcctatcatcgaccggtataccttttgatcgacggatttacc4380
tcccgtgtcgaggtcgagatgcccattggttcccatgggtgtcttgatgggtttggcatc4440
cttcatcccatacttgtttagaatgtcttgaatgtacttcgtttggctaatgaaggtgcc4500
ctcttagcgttgcttcacttgaaatcacaagaagtacttcaactcccccatcatagacat4560
ctcgaatttctgtgtcatgatcctactaaattcctcacatgtagattcattagtagaccc4620
aaatataatatcatcaacataaatttggcatacaaacaaatcattgtcaagagttttagt4680
aaataaagtaggatcggcctttccgactttgaaaccattagtgataaggaaatctctaag4740
gcattcataccatgctcttggggcttgcttgagcccataaagcgccttagagagtttata4800
tacgtgattagggtactcactatcttcaaagccgggaggttgctcaacatagacctcttc4860
cttgattggtccgttgaggaaggcacttttcacgtccatttgataaagcttgaagccatg4920
gtaagtagcataggcaagtaatatgcgaattgactcaagcctagctacgggtgcataggt4980
ttcaccaaaatccaaaccttcgacttgtgaataacccttggccacaagtcgggctttgtt5040
ccttgtcaccacaccatgctcatcttgtttgttgcggaagacccacttggttcctacaac5100
attttgattaggacgtggaactaagtgacatacctcattcctagtgaagttgttgagctc5160
ctcttgcatcgccaccacccaatctgaatcctgtagtgcttcctctaccctttgtggctc5220
aatagaggaaacaaaagagtaatgttcacaaaaatgagcaacacgagatcgagtagttac5280
ccccttttgaatatcgccgaggatggtgttcacggggtgatctcgttggattgcttggtg5340
gactcttgggtgtggcggtcttggttcttcatcctccttgtcttgatcatttgcatctcc5400
cccttgattattgccgtcatcttgaggtggctcatcttcttgatcttctcctttatcatc5460
ttgagcctcatcctcattttgagttggtggagatgcttgcgtggaggaggatggttgatc5520
ttgtgcatttggaggctctttggattccttaggacacacatccccaatggacatgttcct5580
tagcgcgacgcacggagcctcttcatcacctatctcatcaagatcaacttgctctacttg5640
agagccgttagtttcatcaaacacaatgtcaccagaaacttcaactagtcccgaggactt5700
gttaaagactctatatgcccttgtgtttgaatcataccctagtaaaaagccttctacagc5760
cttaggagcaaatttagattttctacctcttttaacaagaatgaagcatttgctaccaaa5820
gactctaaaatatgaaacattgggcttttaccggttaggagttcatatgatgtcttcttg5880
aggattcggtgtagatataaccggttgatggtgtagcaagcggtgttgaccgcctcgatc5940
caaaaccgatccgaagtcttgtactcatcaagcatggttcttgccatgtccaatagagtt6000
cgattcttcctctccactacaccattttgttgtggggtgtagggagaagagaactcatgc6060
ttgatgccctcctcctcaaggaagccttcgatttgagaattcttgaactccgtcccgttg6120
tcgcttctaatctttttgattcttaagccgaactcattttgagcccatctcaagaatccc6180
tttaaggtctcttgggtatgagatttttcctgcaaaaagaatacccaagtgaagcgagta6240
taatcatccacaataactagatagtacttactcccgccgatgcttatgtaagctatcggg6300
ccgaataggtccatatgtaggagctcgagtggcctgtcagtcatcatgatgttcttgtgt6360
ggatgatgagtaccaacttgcttccctgtctgacatgcgctacaaatcctatctttctca6420
aagtgaacatttgttagtcccaaaatgtgttctccctttagaagcttgtgaagattcttc6480
attccaacatgtgctagtcggtgatgccagagccagcccatgttagtcttagcaattaag6540
caagtgtcgagttcagctctatcaaaatctaccaagtatagctgaccctctaacactccc6600
ttaaatgctactgaatcatcacttcttctaaagacagtaacacctatatctgtaaaaaga6660
cagttgtagcccattttacataattgcgaaactgaaagcaagttgtaatctaaagaatct6720
acaagaaacacattggaaatggaatggtcaggagatatagcaattttacccaatcctttg6780
accaaaccttggtttccatccccgaatgtgatcgctctttggggatcttggtttttctca6840
taggaggagaacattttcttctcccctgtcatatggtttgtgcacccgctatcgatgatc6900
caacttgggcccccggatgcataaacctacaaaacaagtttagttcttgattttaggtac6960
ccaaatggttttgggtcctttgacattagatacaagaactttgggtacccaaacacaagt7020
ctttgatcccttgtgtttgcccccaacatacttggcaactatcttgtcggatttgttagt7080
taaaacataagatgcatcaaaagttttgaatgaaatgttatgatcatttgatgcagcagg7140
agttttcttcttaggcaattttgcacgggttgattgcctagagctagatgtctcaccctt7200
atacataaaagcatgattatggccagagtgagacttcctagaatgaattctcctaatttt7260
gctctcgggataaccggcagggtacaaaatgtaaccctcattatcctgaggcatgggagc7320
cttgcccttaacaaagtttgacaatcttttaggagaggcattaagtttgacattgtttcc7380
cttttggaagccaatgccatccttgatgccagggcgtctcccactatagagcatgcttct7440
agcaaatttaaatttttcattttttaagtcatgctcggcaattttagcatctaattttgc7500
tatatgattattttgttgtttaattaaagccatatgatcatgaatagcatcaatgttaat7560
atctctacatctagtgcaaataatgacatgctcaatggcagatgtagagggtttgcaaga7620
attaagttcaacaatcttagcacgtaaaatatcattgttatttctaagatcagaaatgga7680
agcattgcaaacatctaattctttagccttagcaatcaatttttcattttcaaccctaag7740
gctagcaagagagacattcaattcttcaatcttagcaagcaaattaacattatcatctct7800
aagattgggaattgaaacatcacaaatattagaatcaaccttagcaattagtttagtatt7860
tttatttctaaggatggtaatagtatcatggcaagtgcttagctcactagataatttttc7920
acatttttctacttctagagcataagcatttttaaccttaacatgcttcttattttcctt7980
aattaggaagtcctcttgaaagtccaagagatcatctttctcatgaatagcactaattaa8040
ttcatttagtttttcctgtagttgcatgtttaggttggcaaaaagggtacgcaaattatc8100
ctcctcatcactagcattatcttcatcactagaggatgcatatttagtggaggattttga8160
ttttaccttcttctttttgccgtcctttgccatgaggcacttgtggccgacgttggggaa8220
gaggagccctttggtgacggcgatgttggcggcgtcctcgtcggatgaggagtcggagga8280
actctcgtcggagtcccactcgcggcacacatgggcatcgccgcccttcttcttgtaata8340
cctcctcttttctctcctcttgcccttcttgtcgtcgcccctgtcactatcactagataa8400
aggacatttaacaatgaaatgaccgggcttaccacacttgtagcacacccttttggaaca8460
aggcttgtaatctttccccttcctttgtttgaggatttggtggaagctcttgatgatgag8520
cgccattttctagttgtcgagcttagaggcgtcgatgggttgtctacttgatgtagactc8580
ctctttcttctcctctgtcgccttgaatgcgaccggttgtgcttcgggcgtggagggacc8640
gtcgtgctcgataattttctttgagcctttgatcatcaactcaaagctcacaaagtttcc8700
tattacttcctcgggagtcattagtgtatatctaggattaccacgaattaattgaacttg8760
cgtagggttaaggaacacaagtgatctaagaataaccttaaccatttcatggtcatccca8820
ttttttgctcccgaggttgcgcacttggttcaccaaggtcttgagccggttgtacatatc8880
ttgtggctcctccccttggcgaagccggaagcgaccgagctccccctcgatcgtctcccg8940
cttggtgatcttggttacctcgtctccttcgtgcgcggtctttagcacgtcccagatatc9000
ctttgcactcttcaacccttgcaccttattatactcctctcgacttggatttgaaatgtt9060
gg9062
61
1082
dna
artificialsequence
cdnasnaret3
61
cctgcccttccactcttcccccgctgcccccggtcaacgtcacgaacccgggcctcgtgc60
cgctcgtcgtggccacactgttcgacgagcgagtcacagagctgctgagcgtgctcgctg120
atgcggcggtggggcgaccaggcaggtggtccatcggcgaagcgccatggtcgtcgtcgg180
ggggcacgaaccaggcggtgtacgcgcgccgcgcgcccggctcttcatcgcctccacccg240
ctccagcgtctccaccacttccttcatcgagggccgactgcttggctcgctggccaggca300
gccgagcattagttgcgccgcttggaacgcctgcttttgttgatcgtttgttttggtctg360
atttcagtgggtctatccgcagagaggaagaagcagaagctctccgagatccaatccggc420
gttgaggaagctgaatcgctgattcagaaaatggacctggaggcaaggagcctacagcct480
agcattaaggctggtttgcttgcaaagccgagggattataaatctgacctcaacaacgtc540
aagagtgagctcaagaggatatctgcgcccaatgccagattcggaagatggacctggaag600
caaggagcctacaacctagcattaagagtgagctcaagaggatatctgcgcccattgcca660
ggcaggctacccgggaggagctcctggagtctggaatggctgatactctcgcagtgagct720
aatgctaggacttgactgtgtctacgagactgctcctaacaataaactgaagaaagcaaa780
agaaatcattcaacgtattcgccgaagagaactctacaagatatggtagtcctgggagga840
agagccatcgccgctatgtatggcagaaccacccgcgaaagcaacctctctggtctcgcg900
tagccagagcaggagcagctcgcttgcgcggtcgcggcgctggcggccggccccgcgtac960
gagcgcctgcaggaagccaggaacccatccgaacaaggttgcaatcatgataagcagata1020
gagcaagcatatgatgatattttgaattcgtcgaagcatactttggccagcatgatggag1080
ct1082
62
1154
dna
artificialsequence
cdnasnaret3
62
cctgcccttccactcttcccccgctgcccccggtcaacgtcacgaacccgggcctcgtgc60
cgctcgtcgtggccacactgttcgacgagcgagtcacagagctgctgagcgtgctcgctg120
atgcggcggtggggcgaccaggcaggtggtccatcggcgaagcgccatggtcgtcgtcgg180
ggggcacgaaccaggcggtgtacgcgcgccgcgcgcccggctcttcatcgcctccacccg240
ctccagcgtctccaccacttccttcatcgagggccgactgcttggctcgctggccaggca300
gccgagcattagttgcgccgcttggaacgcctgcttttgttgatcgtttgttttggtctg360
atttcagtgggtctatccgcagagaggaagaagcagaagctctccgagatccaatccggc420
gttgaggaagctgaatcgctgattcagaaaatggacctggaggcaaggagcctacagcct480
agcattaaggctggtttgcttgcaaagccgagggattataaatctgacctcaacaacgtc540
aagagtgagctcaagaggatatctgcgcccaatgccagactgctcctaataataaactga600
agaaagcaaaagaaatcattcaacgtattcgccgaagagaactctacaagattcggaaga660
tggacctggaagcaaggagcctacaacctagcattaagagtgagctcaagaggatatctg720
cgcccattgccaggcaggctacccgggaggagctcctggagtctggaatggctgatactc780
tcgcagtgagctaatgctaggacttgactgtgtctacgagactgctcctaacaataaact840
gaagaaagcaaaagaaatcattcaacgtattcgccgaagagaactctacaagatatggta900
gtcctgggaggaagagccatcgccgctatgtatggcagaaccacccgcgaaagcaacctc960
tctggtctcgcgtagccagagcaggagcagctcgcttgcgcggtcgcggcgctggcggcc1020
ggccccgcgtacgagcgcctgcaggaagccaggaacccatccgaacaaggttgcaatcat1080
gataagcagatagagcaagcatatgatgatattttgaattcgtcgaagcatactttggcc1140
agcatgatggagct1154
63
107
prt
zeamays
63
metileglyleuaspcysvaltyrgluthralaproasnasnlysleu
151015
lyslysalalysgluileileglnargileargargarggluleutyr
202530
lysglualaargasnprosergluglnglycysasnhisasplysgln
354045
ilegluglnalatyraspaspileleuasnserserlyshisthrleu
505560
alasermetmetgluleuglnglualaleuleugluserasnglnala
65707580
thrlysaspalaasngluileproseralaserasnglyaspasnasp
859095
glutrpsergluvalglnargleuglnthrarg
100105
64
131
prt
zeamays
64
metcysserglyleuilesertyrlyslysleuleuphehisglyleu
151015
aspleutrpthralaleuserleuproglnproleuglyhisalaala
202530
leutrpproprohisargthrilehisglnhisleuglncyslyscys
354045
sertrppheserasngluleuargserglyileargileleuglnglu
505560
phephevalleuphealatrpiletyrthrlysglyasnalaphelys
65707580
thrproileaspaspgluserhisleuserleupheserargthrarg
859095
ileproargservalservalleutyrserphevalphetyrlysphe
100105110
argserthrcysvalleuthrglntrpthrservalmethismetcys
115120125
lysproala
130
65
162
prt
zeamays
65
metgluglyglyarghisproserproproproargileserarggln
151015
proproprotyrproalacysproserileleuproproleupropro
202530
valasnvalthrasnproglyleuvalproleuvalvalalathrleu
354045
pheaspgluargvalthrgluleuleuservalleualaaspalaala
505560
valglyargproglyargtrpserileglyglualaprotrpserser
65707580
serglyglythrasnglnalavaltyralaargargalaproglyser
859095
serserproproproalaproalaserproproleuproserserarg
100105110
alaaspcysleualaargtrpproglyserargalaleuvalalapro
115120125
leuglythrproalaphevalaspargleuphetrpserasppheser
130135140
glyserileargargglugluglualaglualaleuargaspproile
145150155160
argarg