新四季網

單倍體誘導系的製作方法

2023-10-23 21:56:57


發明領域

本發明涉及通過分子生物學方法和標記技術和遺傳工程修飾植物的領域。其涉及提供技術工具例如核酸和載體,以及涉及方法和用途,用於產生和鑑定非轉基因和轉基因植物單倍體誘導系(haploidinducer),以及改良已有的植物單倍體誘導系。

發明背景

一般而言,在雜交植物生產中,作為親本的兩個育種系相互雜交,由於已知的雜種優勢效應,其後代部分地產生相對於親本系而言強烈增加的產量。所述育種系可以通過多個自交步驟獲得,然而其需要進行多代並因此涉及巨大的時間成本。現代植物育種許多年以前已經越來越多地轉變為經由單倍體誘導以及隨後的染色體加倍在短得多的時間量內生成育種系。對此的一技術需求是有用的單倍體誘導系統,其同時也提供足夠的高效以經濟上可用。

例如,對於玉米(zeamays),已知母體體內誘導系統,其中待誘導的植物用誘導系的花粉授粉。然後由此生成的後代的多達10%僅僅含有種子親本的簡單(單倍體)染色體組。現在可獲得用於玉米雜交育種的少數幾個這樣的誘導系。然而,這些全部都歸因於coe,1959描述的單一系「stock6」。這樣的已知的誘導系的一個實例是rws(etal.,2005)系。在過去,在這些品系上進行了多項qtl研究以鑑別誘導系相關基因座。deimling等在1997年已經鑑定到玉米種染色體1的主要qtl(bin1.04)。barretetal.2008更精確的定位於染色體1上66.96mb和68.11mb之間的範圍,priggeetal.2012更精確的定位於62.9mb和70.8mb之間的範圍,和隨後dongetal.2013更精確的定位於68.18mb和68.43mb之間的範圍,其根據公開注釋含有三個基因。全部位置信息指的是b73參考基因組,agpv02版本。dongetal.2014實現5%的誘導率似乎已經證明該基因座其自身的功能性。然而,不能排除錯誤的精細定位,因為由於缺乏輪迴親本中側翼標記的信息,不可能明確地對所述qtl進行劃界。

此外,wo2012/030893公開了玉米中染色體1上誘導系相關基因座,然而,其與前述基因座顯著不同並且更具體地定位於端粒處。在所考慮的基因組區域中沒有重疊。

總的而言,從「stock6」獲得的玉米品系中體內單倍體誘導的分子和發育特異性機制至今很大程度是未知的。例如可以考慮,發生受精,但是隨後其導致染色體消除,然後允許產生單倍體後代。例如,ravi&chan(2010)已經描述了在具有組蛋白cenh3的系統中的此類機制。然而,在另一方面,受精也可失敗,在三倍體胚乳中出現單倍體卵細胞的發育。如果不了解來自「stock6」的誘導系基因型背後的母體體內單倍體誘導基因和所負責的基因的知識,該玉米誘導系的定向改良或所述誘導基因向非誘導系基因型的轉移,或在玉米非誘導系中體內單倍體誘導能力的定向處理實際上是不可能的。

此外,對於一些栽培植物,根本沒有已知的高效(且因此是經濟的)可用的系統用於產生單倍體和雙-單倍體植物,例如高粱、黑麥或向日葵。

有需要提供遺傳元件如基因或調控元件,其在轉基因和/或非轉基因方法中可用以允許通過體內誘導的單倍體開發,或改善單倍體開發的效率。

發明概述

針對上文所述的現有技術的背景獲得本發明,其中本發明的目的是提供工具和方法,其可用於產生體內單倍體誘導系和/或提供單倍體植物。

根據本發明,所述目的通過這樣的核酸實現,所述核酸在植物中轉錄或表達後,適於介導單倍體誘導系的特性或適於增加單倍體誘導系的誘導能力。根據本發明的所述核酸可以作為轉基因使用。在另一方面,與本發明的所述核酸之一相同的植物基因組中或植物單倍體誘導系基因組中的內源dna序列也可以被修飾,使得在所述內源dna序列轉錄或表達後,單倍體誘導系的特性被介導或單倍體誘導系的誘導能力被增加。本發明的核酸優選是分離的核酸,其提取自其天然或原始環境(遺傳背景)。核酸可以是雙鏈或單鏈的,且可以是線性或環狀的。其因此可以是基因組dna、合成的dna、cdna或rna類型(如lncrna、sirna或mirna),其中rna中存在核鹼基尿嘧啶代替核鹼基胸腺嘧啶。

在本發明一優選實施方式中,本發明的核酸或所述核酸編碼的rna或所述核酸編碼的蛋白或多肽,對植物中花粉管的生長、植物花粉的能量代謝和/或優選在生殖細胞(例如其發育成花粉)中的著絲粒的活性有影響。

本發明的核酸的特徵可在於,所述核酸或所述核酸編碼的rna或所述核酸編碼的蛋白或多肽,適於或可用於與野生型植物的花粉相比加速或促進花粉管生長(例如,在植物花粉中),其中本發明的核酸或所述核酸編碼的rna或所述核酸編碼的蛋白或多肽如下文所描述使用。例如,本發明的核酸編碼蛋白,其在植物花粉的花粉管中參與大分子的運輸或影響該運輸。屬於這些的是例如snarev蛋白,其例如介導例如在花粉管頂端的果膠類或磷脂類的運輸(katoetal.,2010)。此外,磷脂酶類的酶——特別是磷脂酶a2或patatin磷脂酶——能夠促進花粉管的生長(kimetal.,2011),而肌醇聚磷酸酯-5-磷酸酶類如肌醇-1,4,5-三磷酸-5-磷酸酶,可以抑制花粉管生長(wangetal.,2012)。本發明的核酸可以用作轉基因,用於加速花粉管生長的目的,其中其然後在植物或其部分中例如通過過表達方法,與野生型植物或其相應部分相比,增加花粉管生長促進基因的表達率或增加正調節(激活)花粉管生長促進基因或負調節(抑制)花粉管生長抑制基因的rna如lncrna的轉錄率,和/或在植物或其部分中通過rnai方法或mirna方法(fireetal.,1998),與野生型植物或其相應部分相比,減少花粉管生長抑制基因的表達率。在另一方面,植物基因組中或植物單倍體誘導系基因組中與本發明的核酸相同的內源dna序列,或所述內源dna序列的調控序列也可以被修飾,例如,通過誘變或「基因組編輯」。與未誘變的野生型植物相比,該修飾可以增加或減少植物中所述內源dna序列的轉錄或表達率,或所述內源dna序列編碼的蛋白或多肽的活性或穩定性。例如,相比於未誘變的野生型植物或未經由「基因組編輯」修飾的野生型植物,植物中內源花粉管生長促進基因的表達率或正調節(激活)花粉管生長促進基因或負調節(抑制)花粉管生長抑制基因的內源rna如lncrna的轉錄率可以因此被增加,或者相比於未誘變的野生型植物或未經由「基因組編輯」修飾的野生型植物,植物中內源花粉管生長抑制基因的表達率或負調節(抑制)花粉管生長促進基因或正調節(激活)花粉管生長抑制基因的內源rna如lncrna的轉錄率可以因此被減少。此外,相比於未誘變的野生型植物或未經由「基因組編輯」修飾的野生型植物,植物中所述內源dna序列編碼的花粉管生長促進蛋白或多肽的活性或穩定性可以被增加,或者相比於未誘變的野生型植物或未經由「基因組編輯」修飾的野生型植物,植物中所述內源dna序列編碼的花粉管生長抑制蛋白或多肽的活性或穩定性可以被減少。

在進一步的實例中,本發明的核酸可以表徵為,通過使用所述核酸,或通過使用所述核酸編碼的rna,或通過使用所述核酸編碼的蛋白或多肽,植物中花粉的能量代謝與野生型植物相比可以被負影響。例如,這可以通過磷酸甘油酸變位酶或線粒體轉運蛋白或線粒體輸入受體(mitochondrialimportreceptor)實現。出於該目的,本發明的核酸可以在過表達方法中、或在rnai方法中、或在mirna方法中用作轉基因(fireetal.,1998)。在另一方面,植物基因組中或植物單倍體誘導系基因組中與本發明的核酸相同的內源dna序列,或所述內源dna序列的調控序列也可以被修飾,例如,通過誘變或「基因組編輯」。與未誘變的野生型植物或未經由「基因組編輯」修飾的野生型植物相比,該修飾可以增加或減少植物中所述內源dna序列的轉錄或表達率,或所述內源dna序列編碼的蛋白或多肽的活性或穩定性。

在另一實例中,本發明的核酸還可以表徵為,通過使用所述核酸或使用所述核酸編碼的rna或使用所述核酸編碼的蛋白或多肽,植物中的著絲粒的活性與野生型相比被修飾(尤其是在早期胚胎發生中和優選在所述植物中發育成例如花粉的生殖細胞中),其導致例如所述誘導系基因組的消除。著絲粒的活性可以通過dna的染色質修飾或在組蛋白水平修飾,此外,還可以通過轉錄、rna相互作用或rna結合。例如,著絲粒的活性的改變可以通過甲基轉移酶如rna甲基轉移酶實現。出於該目的,本發明的核酸用作轉基因,其中它然後通過過表達方法相對於野生型植物增加植物中染色質修飾基因或正調節(激活)染色質修飾基因的rna(如lncrna)的表達率。在另一方面,植物基因組中或植物單倍體誘導系基因組中與本發明的核酸相同的內源dna序列,或所述內源dna序列的調控序列也可以被修飾,例如,通過誘變或「基因組編輯」。與未誘變的野生型植物或未經由「基因組編輯」修飾的野生型植物相比,該修飾可以增加或減少植物中所述內源dna序列的轉錄或表達率,或所述內源dna序列編碼的蛋白或多肽的活性或穩定性。與未誘變的野生型植物或未經由「基因組編輯」修飾的野生型植物相比,植物中染色質修飾基因或正調節(激活)染色質修飾基因的rna(如lncrna)的表達率因此也可以被增加。此外,與未誘變的野生型植物或未經由「基因組編輯」修飾的野生型植物相比,植物中所述內源dna序列編碼的染色質修飾蛋白的活性或穩定性可以被增加。

前面所述的本發明的核酸或所述核酸編碼的rna或所述核酸編碼的蛋白或多肽的用途不是排他性的或限制性的,相反應該被理解為僅僅是示例。本領域技術人員從現有技術已知許多額外的技術手段和方法,通過所述技術手段和方法其可以實現上文所描述的本發明的核酸或相同的內源dna序列的表達或轉錄率的改變,或上文所描述的有本發明的核酸或所述內源dna序列編碼的蛋白或多肽的穩定性和活性的改變。

在本發明一特別優選的實施方案中,所述核酸,其在植物中轉錄後或翻譯後,適於介導單倍體誘導系的特性或適於增加單倍體誘導系的誘導能力,所述核酸可以包含這樣的核苷酸序列,其

(i)選自seqidno:1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、42、43、46、47、49、50、52、53、55、56、57、58、59、60、61和/或62,或具有這些的功能性片段,或

(ii)與來自(i)的序列互補,或

(iii)與來自(i)的序列至少80%、82%、84%、86%、88%,優選地至少90%、91%、92%、93%、94%、95%、96%,或特別優選地至少97%、97.5%、98%、98.5%、99%或99.5%相同,或

(iv)編碼具有選自seqidno:16、17、18、19、20、21、22、23、24、25、44、45、48、51、54、63、64和/或65的胺基酸序列的蛋白或所述蛋白的功能性部分,或

(v)編碼根據(iv)的蛋白的同源物、類似物或直系同源物或其功能性部分,或

(vi)與來自(ii)的序列在嚴格條件下雜交。

該核酸可以編碼蛋白或其功能性部分,其中所述蛋白或其功能性部分具有snare蛋白(尤其snarev蛋白)、磷脂酶(尤其磷脂酶a2或patatin磷脂酶)、甲基轉移酶(尤其rna甲基轉移酶)或線粒體輸入受體的功能(見表1)。可以如上所述實現所述核酸的用途,即為了在植物中介導單倍體誘導系的特性或增加單倍體誘導系的誘導能力,例如,所述核酸的表達率或所編碼的蛋白或所編碼的蛋白部分的活性或穩定性被轉基因地或內源地增加。由於該核酸或該核酸編碼的rna或該核酸編碼的蛋白或多肽對植物的單倍體誘導能力有正作用,在下文中,此處所定義的核酸命名為誘導促進核酸。下文進一步公開所述誘導促進核酸以及包含所述誘導促進核酸的物質的額外的方法和用途。

在本發明另一特別優選的實施方案中,所述核酸,其在植物中轉錄後或翻譯後,適於介導單倍體誘導系的特性或適於增加單倍體誘導系的誘導能力,可以是包含這樣的核苷酸序列的核酸,所述核苷酸序列

(i)具有選自seqidno:26、27、28、29、30和/或31的序列或其功能性片段,或

(ii)與來自(i)的序列互補,或

(iii)與來自(i)的序列至少80%、82%、84%、86%、88%,優選地至少90%、91%、92%、93%、94%、95%、96%,或特別優選地至少97%、97.5%、98%、98.5%、99%或99.5%相同,

(iv)編碼具有選自seqidno:32、33和/或34的胺基酸序列的蛋白或所述蛋白的功能性部分,或

(v)編碼根據(iv)的蛋白的同源物、類似物或直系同源物或其功能性部分,或

(vi)與來自(ii)的序列在嚴格條件下雜交。

這樣的核酸可以編碼蛋白或其功能性部分,其中所述蛋白或其功能性部分具有肌醇聚磷酸酯-5-磷酸酶(特別是肌醇-1,4,5-三磷酸-5-磷酸酶)或磷酸甘油酸變位酶的功能(見表1)。可以如上所述實現所述核酸的用途,即為了在植物中介導單倍體誘導系的特性或增加單倍體誘導系的誘導能力,例如,所述核酸的表達率或所編碼的蛋白或所編碼的蛋白部分的活性或穩定性被轉基因地或內源地減少。由於該核酸或該核酸編碼的rna或該核酸編碼的蛋白或多肽對植物的單倍體誘導能力有負作用,在下文中,此處所定義的核酸命名為誘導抑制核酸。下文進一步公開所述誘導抑制核酸以及包含所述誘導抑制核酸的物質的額外的方法和用途。

在本發明另一特別優選的實施方案中,所述核酸,其在植物中轉錄後或翻譯後,適於介導單倍體誘導系的特性或適於增加單倍體誘導系的誘導能力,其可以是這樣的核酸,所述核酸編碼具有雙鏈部分的rna,其中所述雙鏈部分的至少一條鏈具有與以下核酸的編碼序列中至少14、15、16、17、18、19、20、21、22、23、24、或25個,優選至少30、35、40、45、50、60、70、80、90、100、120、或140個,和特別優選至少160、180、200、250、300、350、400、450、500、600、700、800、900、或1000個連續核苷酸同源或相同的核苷酸序列:

(i)具有有義或反義方向的選自seqidno:26、27、28、29、30和/或31的序列或其功能性片段的核酸,或

(ii)與來自(i)的序列互補的核酸,或

(iii)與來自(i)的序列至少80%、82%、84%、86%、或88%,優選地至少90%、91%、92%、93%、94%、95%、96%,或特別優選地至少97%、97.5%、98%、98.5%、99%或99.5%相同的核酸,或

(iv)編碼具有選自seqidno:32、33和/或34的胺基酸序列的蛋白或所述蛋白的功能性部分的核酸,或

(v)編碼根據(iv)的蛋白的同源物、類似物或直系同源物或其部分的核酸,或

(vi)與來自(ii)的序列在嚴格條件下雜交的核酸。在轉錄後基因沉默中,例如在rnai方法和mirna方法中所描述的(fireetal.,1998),這樣的核酸可以用來抑制上文所述誘導抑制核酸的表達。dsrna編碼核酸也可以是編碼長非編碼rna(lncrna)的核酸。所述lncrna核酸然後優選包含核苷酸序列,其

(a)具有選自seqidno:35、36、37和/或38的序列或其片段,或

(b)與來自(a)的序列互補,或

(c)與來自(a)的序列至少80%、82%、84%、86%或88%,優選地至少90%、91%、92%、93%、94%、95%、96%,或特別優選地至少97%、97.5%、98%、98.5%、99%或99.5%相同,或

(d)編碼具有seqidno:40或41的胺基酸序列的多肽或所述多肽的部分,或

(e)與來自(b)的序列在嚴格條件下雜交。該lncrna,以下命名為lncrna1,可用於肌醇聚磷酸酯-5-磷酸酶例如肌醇-1,4,5-三磷酸-5-磷酸酶的表達或翻譯調控。此外,所述lncrna編碼核酸可以優選包含核苷酸序列,其

(w)具有選自seqidno:39的序列或其片段,或

(x)與來自(w)的序列互補,或

(y)與來自(w)的序列至少80%、82%、84%、86%或88%,優選地至少90%、91%、92%、93%、94%、95%、96%,或特別優選地至少97%、97.5%、98%、98.5%、99%或99.5%相同,或

(z)與來自(x)的序列在嚴格條件下雜交。該lncrna,以下命名為lncrna2,可用於磷脂酶尤其是磷脂酶a2或patatin磷脂酶的表達或翻譯調控。

在本發明另一特別優選的實施方案中,所述核酸,其在植物中轉錄後或翻譯後,適於介導單倍體誘導系的特性或適於增加單倍體誘導系的誘導能力,其可以是這樣的核酸,所述核酸編碼具有雙鏈部分的rna,其中所述雙鏈部分的至少一條鏈具有與以下核酸的內含子序列中至少14、15、16、17、18、19、20、21、22、23、24、或25個,優選至少30、35、40、45、50、60、70、80、90、100、120、或140個,和特別優選至少160、180、200、250、300、350、400、450、500、600、700、800、900、或1000個連續核苷酸同源或相同的核苷酸序列:

(i)具有有義或反義方向的選自seqidno:1、6、8、9、12、13、26、30、42、43、46、55、58和/或60的序列或其功能性片段的核酸,或

(ii)與來自(i)的序列互補的核酸,或

(iii)與來自(i)的序列至少80%、82%、84%、86%或88%,優選地至少90%、91%、92%、93%、94%、95%、96%,或特別優選地至少97%、97.5%、98%、98.5%、99%或99.5%相同的核酸,或

(iv)編碼具有選自seqidno:16、17、18、19、20、21、22、23、24、25、44、45、48、63、64和/或65,或選自seqidno:32、33和/或34的胺基酸序列的蛋白或所述蛋白的部分的核酸,或

(v)編碼根據(iv)的蛋白的同源物、類似物或直系同源物或其部分的核酸,或

(vi)與來自(ii)的序列在嚴格條件下雜交的核酸。在轉錄基因沉默中,例如在rddm方法中(shibuyaetal.,2009),這樣的核酸可以用來激活上文所述的誘導促進核酸的表達,或用於抑制上文所述的誘導抑制核酸的表達。dsrna編碼核酸也可以是編碼長非編碼rna(lncrna)的核酸。所述lncrna核酸然後優選包含核苷酸序列,其

(a)具有選自seqidno:35、36、37和/或38的序列或其片段,或

(b)與來自(a)的序列互補,或

(c)與來自(a)的序列至少80%、82%、84%、86%或88%,優選地至少90%、91%、92%、93%、94%、95%、96%,或特別優選地至少97%、97.5%、98%、98.5%、99%或99.5%相同,或

(d)編碼具有seqidno:40或41的胺基酸序列的多肽或所述多肽的部分,或

(e)與來自(b)的序列在嚴格條件下雜交。該lncrna,以下命名為lncrna1,可用於肌醇聚磷酸酯-5-磷酸酶例如肌醇-1,4,5-三磷酸-5-磷酸酶的表達或翻譯調控。此外,所述lncrna編碼核酸可以優選包含核苷酸序列,其

(w)具有選自seqidno:39的序列或其片段,或

(x)與來自(w)的序列互補,或

(y)與來自(w)的序列至少80%、82%、84%、86%或88%,優選地至少90%、91%、92%、93%、94%、95%、96%,或特別優選地至少97%、97.5%、98%、98.5%、99%或99.5%相同,或

(z)與來自(x)的序列在嚴格條件下雜交。該lncrna,以下命名為lncrna2,可用於磷脂酶尤其是磷脂酶a2或patatin磷脂酶的表達或翻譯調控。

表1:序列表和核苷酸以及胺基酸序列的編號。基因家族/蛋白家族的名字對應於公開模型。誘導系中結構的變化可以導致不同的功能。

在另一方面,本發明涉及包含本發明的核酸的載體。所述載體可以是質粒、粘粒、噬菌體或表達載體、轉化載體、穿梭載體或克隆載體;其可以是雙鏈的或單鏈的,線性的或環狀的;或其可以轉化原核或真核宿主,通過整合進其基因組或在染色體外。本發明的核酸在載體中優選與一或多個調控序列可操作體連接,所述調控序列允許在原核或真核宿主細胞中的轉錄以及任選地允許表達。調控序列(優選dna)可以與本發明的核酸同源或異源。例如,所述核酸在合適的啟動子或終止子的控制下。合適的啟動子可以是組成型誘導的啟動子(實例:35s啟動子,來自「花椰菜花葉病毒」(odelletal.,1985);那些組織特異性啟動子是特別合適的(實例:花粉特異性啟動子,chenetal.(2010),zhaoetal.(2006),或twelletal.(1991)),或者是發育特異性的(實例:開花特異性啟動子)。合適的啟動子也可以是合成的或嵌合的啟動子,其在自然界中不存在,包含多個元件,且含有最小啟動子,以及在所述最小啟動子上遊的至少一個順式調控元件,其是特定轉錄因子的結合位點。嵌合啟動子可以根據期望的特徵設計,且由不同的因子誘導或抑制。這樣的啟動子的實例在gurr&rushton(2005)或venter(2007)找到。例如,合適的終止子是nos終止子(depickeretal.,1982)。

除了上文描述的載體外,本發明還提供一種方法,包括將所描述的載體插入進宿主細胞。例如,可以通過接合、轉移(mobilization)、基因槍轉化、農桿菌介導的轉化、轉染、轉導、真空滲濾或電穿孔導入所述載體。這樣的方法,如同製備所描述的載體的方法,是本領域技術人員的常識(sambrooketal.,2001)。

在另一方面,本發明涉及包含本發明的核酸或本發明的載體的宿主細胞。本發明含義的宿主細胞可以是原核細胞(如細菌)或真核細胞(如植物細胞或酵母細胞)。所述宿主細胞優選是農桿菌,例如根癌農桿菌(agrobacteriumtumefaciens)或毛根農桿菌(agrobacteriumrhizogenes),或植物細胞,其包含本發明的所述核酸或本發明的載體。對於本領域技術人員,已知可以用來將本發明的核酸或本發明的載體導入農桿菌的許多方法(如接合或電穿孔),以及可以用來將本發明的核酸或本發明的載體導入植物細胞的方法如不同的轉化方法(基因槍轉化、農桿菌介導的轉化)(sambrooketal.,2001)。

在另一方面,本發明涉及轉基因植物細胞,其包含作為轉基因的本發明的核酸或包含本發明的載體,並涉及轉基因植物或其部分,其包含所述轉基因植物細胞。例如,這樣的植物細胞或植物是用本發明的核酸或本發明的載體轉化(優選穩定轉化)的植物細胞或植物。本發明的轉基因植物優選適於用作單倍體誘導系。在所述轉基因植物的一優選實施方案中,所述核酸與一或多個調控序列可操作地連接,所述調控序列允許在植物細胞中的轉錄以及任選的表達。調控序列(優選dna)可以與本發明的核酸同源或異源。由本發明的核酸和所述調控序列組成的總結構然後可以代表所述轉基因。植物的部分可以是受精的或未受精的種子、胚、花粉、組織、器官、或植物細胞,其中所述受精的或未受精的種子、所述胚或所述花粉在所述轉基因植物中產生,且本發明的核酸或所述載體整合進其基因組作為轉基因。本發明而且還包括所述轉基因植物的後代,其基因組中整合有本發明的核酸或載體作為轉基因,且其適於用作單倍體誘導系。

在另一方面,本發明涉及由本發明的核酸編碼的蛋白或多肽。所述蛋白或多肽優選適於在植物中介導單倍體誘導系的特性,或適於增加單倍體誘導系的誘導能力。特別優選由所述誘導促進核酸編碼的蛋白或多肽。本發明的蛋白或多肽優選包括選自seqidno:16、17、18、19、20、21、22、23、24、25、44、45、48、51、54、63、64和/或65,或選自seqidno:32、33和/或34,或選自seqidno:40和/或41的胺基酸序列。

在另一方面,本發明描述了用於產生適於用作單倍體誘導系的植物的方法。所述方法可以包括以下步驟:

a)誘變植物細胞且隨後從所誘變的植物細胞再生植物或誘變植物,和

b)鑑定植物a),其在與本發明的核酸相同的內源dna序列中具有至少一個突變,或在所述內源dna序列的調控序列(例如,啟動子、增強子、終止子或內含子)中具有至少一個突變,所述突變導致與未誘變野生型植物相比所述內源dna序列在所鑑定的植物中轉錄或表達率的變化,或者與未誘變野生型植物相比所述內源dna序列編碼的蛋白或多肽在所鑑定的植物中活性或穩定性的變化,其中所述至少一個突變在所鑑定的植物中導致要介導的單倍體誘導系特性或要增加的單倍體誘導系的誘導能力。所述轉錄率或表達率的改變,或所述活性或穩定性的改變,優選地至少在所鑑定的植物的花粉中或在所鑑定的植物的花粉組織中出現。

來自步驟b)的內源dna序列,或所述內源dna序列編碼的rna或所述dna序列編碼的蛋白或多肽,優選對植物中花粉管的生長、植物花粉的能量代謝和/或優選在生殖細胞(例如其發育成花粉)中的著絲粒的活性有影響。

所述用於產生適於用作單倍體誘導系的植物的方法的步驟b)的內源dna序列特別優選地編碼snarev蛋白;磷脂酶類的酶,特別是磷脂酶a2或patatin磷脂酶;肌醇聚磷酸酯-5-磷酸酶類的酶,例如肌醇-1,4,5-三磷酸-5-磷酸酶;磷酸甘油酸變位酶或甲基轉移酶,特別是rna甲基轉移酶,其中,在snare蛋白、磷脂酶和甲基轉移酶的情況下,所述轉錄率或表達率或所述活性或穩定性優選被改變至其被增加的程度,且其中,在肌醇聚磷酸酯-5-磷酸酶和磷酸甘油酸變位酶的情況下,所述轉錄率或表達率或所述活性或穩定性優選被改變至其被減少的程度。

所述用於產生植物的方法的步驟b)非常優選地是從a)鑑定植物,其a)在與所述誘導促進核酸或編碼所述lncrna1的核酸相同的內源dna序列中,或在所述內源dna序列的調控序列(如啟動子、增強子、終止子或內含子)中具有至少一個突變,其中所述至少一個突變實現所述內源dna序列的轉錄或表達的增加或由所述內源dna序列編碼的蛋白或多肽的活性或穩定性的增加;和/或b)在與所述誘導抑制核酸或編碼所述lncrna2的核酸相同的內源dna序列中,或在所述內源dna序列的調控序列(如啟動子、增強子、終止子或內含子)中具有至少一個突變,其中所述至少一個突變實現所述內源dna序列的轉錄或表達的減少或由所述內源dna序列編碼的蛋白或多肽的活性或穩定性的減少,其中a)和b)的所述至少一個突變在所鑑定的植物中導致要介導的單倍體誘導系特性或要增加的單倍體誘導系的誘導能力。所述轉錄率或表達率的改變,或所述活性或穩定性的改變,優選地至少在所鑑定的植物的花粉中或在所鑑定的植物的花粉組織中出現。

突變意指dna水平的修飾,並因此是遺傳學和/或表觀遺傳學的改變。例如,遺傳學的改變可以是所述內源dna序列中或所述內源dna序列的調控序列中的至少一個核鹼基的取代。如果這樣的核鹼基取代發生在例如啟動子,這可以導致所述啟動子改變的活性,由於例如順式調控元件被修飾使得轉錄因子與所突變的順式調控元件的親和力相對於野生型啟動子被改變,由此具有所述突變的順數調控元件的啟動子的活性被增加或減少,取決於所述轉錄因子是阻遏物還是誘導系,或所述轉錄因子與突變的順式調控元件的親和力被增強還是被弱化。如果這樣的核鹼基取代在例如所述內源dna序列的編碼區中發生,這可以導致所編碼的蛋白的胺基酸取代,其可以導致所述蛋白的活性或穩定性與野生型蛋白相比的改變。遺傳學改變的另一實例是在所述調控序列和/或所述內源dna序列中的核苷酸缺失,以及在所述調控序列和/或所述內源dna序列中的核苷酸添加。das&martienssen(1995)顯示在玉米中通過轉座子誘變插入核苷酸來調控基因。表觀遺傳學的改變可以通過dna改變的甲基化模式發生。

本領域技術人員已知本發明含義中的突變如何能夠通過用於產生適於用作單倍體誘導系的植物的方法的步驟a)的誘變過程實現。此處所述誘變包括常規誘變以及位點特異性誘變或「基因組編輯」。在常規誘變中,dna水平的修飾不是以靶向的方式產生的。植物細胞或植物被暴露至誘變條件如tilling,通過uv光暴露或使用化學物質(tilletal.,2004)。隨機誘變的另外的方法是藉助轉座子的誘變。uniformmu項目製備了大容量的可免費獲得的突變體庫。所述庫和所述方法描述於mccartyetal.(2005)。位點特異性誘變使得可以以靶導向方式在dna中的預定位置在dna水平導入修飾。例如,talens(wo2010/079430,wo2011/072246),大範圍核酸酶(silvaetal.,2011),歸巢內切核酸酶(chevalier2002),鋅指核酸酶(lloydetal.,2005),或crispr/cas系統(gajetal.,2013)可用於此。

步驟b)中植物的鑑定可以藉助例如分子標記或探針實現。例如,dna探針是引物或引物對,其可用於pcr反應。例如,可以通過在tilling群體中測序靶基因來驗證或鑑別tilling突變體,或者通過額外的驗證dna中錯配的方法,例如熔解點分析或使用錯配特異性核酸酶。對此,本發明同樣包括可用於此的引物/引物對,例如,針對磷脂酶、磷酸甘油酸變位酶、甲基轉移酶和磷脂酶的lncrna的引物。通過轉座子產生的突變體也可以通過在跨越整個群體的pcr中使用轉座子特異性引物和靶基因特異性引物以及隨後測序pcr產物來驗證。本發明也涵蓋這樣的引物。例如,花粉中表達率的改變可以用rt-pcr確定;穩定性的改變例如可以通過檢查泛素結合位點和預測三級結構的變化來確定。此外,野生型蛋白以及相應突變體蛋白的重組表達,和隨後的生化活性測試也是合適的。可用於在步驟b)中鑑別植物的額外的手段和方法是本領域技術人員從現有技術中知曉的。

本發明還涉及分子標記,其證明所述內源dna序列中或所述內源dna序列的內源dna序列的突變的存在或不存在。例如,這樣的標記基於snp且對所述突變特異(實例:kaspar或taqman標記)。

本發明還進一步涉及能夠用前述方法產生或用前述方法產生的植物,或該植物的部分,其中所述植物的部分可以是受精的或未受精的種子、胚、花粉、組織、器官或植物細胞,其中所述受精的或未受精的種子、所述胚、或所述花粉在所述轉基因植物產生,且所述至少一個突變存在於其基因組中。本發明同樣還包括所述植物的後代,其具有所述至少一個突變且適於用作單倍體誘導系。已經用前述方法產生的植物的兩個實例是這樣的植物(優選玉米或向日葵),其在內源dna序列中,具有以下核酸,所述核酸(i)具有選自seqidno:8、9和/或46的序列或其功能性片段;或(ii)與來自(i)的序列互補;或(iii)與來自(i)的序列至少80%相同;或(iv)編碼具有選自seqidno:21、22、23和/或48的胺基酸序列的蛋白或所述蛋白的功能性部分;或(v)編碼根據(iv)的蛋白的同源物、類似物或直系同源物或其功能性部分;或(vi)與來自(ii)的序列在嚴格條件下雜交,或在所述內源dna序列的調控序列中具有至少一個突變,所述突變導致與未誘變野生型植物相比所述內源dna序列在所鑑定的植物中轉錄或表達率的變化,或者與未誘變野生型植物相比所述內源dna序列編碼的蛋白或多肽在所鑑定的植物中活性或穩定性的變化,其中所述至少一個突變在所鑑定的植物中導致要介導的單倍體誘導系特性或要增加的單倍體誘導系的誘導能力。所述突變優選地是在seqidno:8或9的編碼序列中的改變(例如,點突變),其導致seqidno:21、22或23中胺基酸位置74-78之間的胺基酸取代,或者所述突變導致seqidno.46的編碼序列中的修飾,其導致seqidno:48相應編碼序列中的胺基酸取代。這可以包含根據seqidno:49-54的突變。通過tilling導致的在seqidno:49中的突變導致第74位處的編碼的胺基酸的胺基酸取代,其中天冬氨酸被天冬醯胺取代(d74n);在seqidno:52中的突變導致第78位處的編碼的胺基酸的胺基酸取代,其中甘氨酸被精氨酸取代(g74r)。

此外,本發明還涉及一種分離在植物中介導單倍體誘導系特性或增加單倍體誘導系的誘導能力的核酸的方法,包括以下步驟:

a)根據本發明前面描述的方法產生植物,或提供能夠用本發明前述的方法產生或用本發明前述的方法產生的植物;和b)從來自a)的植物的基因組分離核酸,所述核酸包含具有所述至少一個突變的內源dna序列。步驟b)中所述核酸非分離可以通過ctab提取或通過dna結合柱實現;所述突變的驗證可以通過測序或分子標記如基於snp的kaspar或taqman標記實現,或例如對於插入或缺失突變,通過基於長度多態性的標記實現。

本發明還包括通過或可通過前述用於分離的方法獲得的核酸,以及包含所述分離的核酸的載體。

在另一方面,本發明還涉及用於產生適於用作單倍體誘導系的轉基因植物的方法。所述方法可以包括以下步驟:

a)提供上文所述的核酸,其在植物中轉錄或表達後,適於介導單倍體誘導系的特性或適於增加單倍體誘導系的誘導能力;或提供上文所述的分離的核酸,所述核酸包含所述具有至少一個突變的內源dna序列;或提供上述的載體之一,

b)通過導入來自a)的核酸或載體轉化(優選穩定轉化)植物細胞,

c)從來自b)的經轉化的植物細胞再生轉基因植物,和

d)通過優選地在所鑑定的植物的花粉中的或在所鑑定的植物的花粉組織中的改變的表達模式從c)鑑定轉基因植物,其中單倍體誘導系的特性被介導或單倍體誘導系的誘導能力被增加。所述用於產生適於用作單倍體誘導系的轉基因植物的方法還包括提供兩種或更多種上文所述的核酸(或者本發明的核酸的不同的實施方式,且任選地在一或多種載體中)以及通過導入兩種或多種核酸轉化植物細胞。可選地或額外地,除了本發明的核酸之外,還可以提供或轉化已知可用於產生單倍體誘導系的一或多種額外的核酸(例如,經操作的cenh3基因(ravi&chan,2010))。

所述表達模式優選地被改變為實現

(i)與野生型植物(例如其再生自等基因未轉化的植物細胞)相比,所鑑定的植物中所導入的誘導促進核酸或導入的編碼lncrna1的核酸的轉錄或表達被增加,和/或

(ii)與野生型植物(例如其再生自等基因未轉化的植物細胞)相比,所鑑定的植物中所導入的誘導抑制核酸或導入的編碼lncrna2的核酸的轉錄或表達被減少,和/或

(iii)由於轉錄後基因沉默,與野生型植物(其例如再生自等基因、未轉化的植物細胞)相比,在所鑑定的植物中具有與所述誘導抑制核酸相同的核苷酸序列的內源dna序列的表達率通過所導入的核酸編碼的雙鏈rna被降低,所述導入的核酸如上所述與轉錄後基因沉默相關,和/或

(iv)由於轉錄基因沉默,與野生型植物(其例如再生自等基因、未轉化的植物細胞)相比,在所鑑定的植物中具有與所述誘導促進核酸相同的核苷酸序列的內源dna序列或編碼lncrna1的導入的核酸的轉錄率或表達率通過所導入的核酸(其在上文詳細描述,與轉錄基因沉默相關)編碼的雙鏈rna被增加;和/或與野生型植物(其例如再生自等基因、未轉化的植物細胞)相比,在所鑑定的植物中具有與所述誘導抑制核酸相同的核苷酸序列的內源dna序列或編碼lncrna2的導入的核酸的轉錄率或表達率通過所導入的核酸(其在上文詳細描述,與轉錄基因沉默相關)編碼的雙鏈rna被減少。轉錄率的驗證可以通過例如qrt-pcr實現。改變的蛋白穩定性可以通過例如蛋白印跡確定。

本發明還進一步涉及能夠用前述方法產生或用前述方法產生的轉基因植物或該植物的部分,植物的部分可以是受精的或未受精的種子、胚、花粉、組織、器官、或植物細胞,其中所述受精的或未受精的種子、所述胚或所述花粉在所述轉基因植物中產生,且本發明的核酸或所述載體整合進其基因組作為轉基因。本發明同樣還包括所述轉基因植物的後代,其具所導入的核酸作為轉基因且適於用作單倍體誘導系。

在另一方面,本發明還涉及用於產生單倍體植物的方法,所述方法包括以下步驟:

a)是本發明的適於用作單倍體誘導系的非轉基因或轉基因植物和相同屬優選相同物種的植物雜交,

b)選擇受精的單倍體種子或胚,和

c)從來自b)的種子或胚再生單倍體植物。

所述適於用作單倍體誘導系的植物優選用作花粉親本且和相同屬優選相同物種的種子親本雜交。所述適於用作單倍體誘導系的植物還可以用作種子親本且和相同屬優選相同物種的花粉親本雜交。因此,步驟a)中的兩種雜交配對物,種子親本和花粉親本,還可以是相同個體。所述雜交步驟因此代表自交。

所述單倍體受精的種子或胚的選擇可以包括驗證所述單倍性的步驟,已將所述單倍體受精的種子或胚與多倍體受精的種子或胚分開的步驟。所述受精的種子或胚的單倍性鑑定可以通過表型或基因型實現,例如,其中所述誘導系具有胚特異性可視標記,其在全部二倍體後代是可見的,但在誘導的單倍體後代則不可見。此外,所述倍性狀態可以通過流式細胞術確定。此外,分子標記完整的、純合的模式為單倍體植物提供指徵。例如,所述分開可以自動地基於單倍性驗證的數據實現。

本發明還進一步涉及單倍體受精的種子或胚,其通過所述用於產生單倍體植物的方法的步驟a)中的雜交而產生,以及涉及單倍體植物,其能夠用所述方法或用所述方法產生,或者該植物的部分,其中植物的部分可以是種子、胚、組織、器官或植物細胞。本發明同樣還包括所述植物的後代。此外,本發明還包括雙-單倍體(二倍體)植物或其部分,其中所述雙-單倍體(二倍體)植物或其部分通過所述單倍體植物或其部分的染色體加倍而產生。

在另一方面,本發明涉及本發明的核酸或本發明載體在植物中介導單倍體誘導系或增加單倍體誘導的誘導能力的用途,或本發明的核酸或本發明的載體用於產生適於用作單倍體誘導系的植物或轉基因植物的用途。此外,本發明還包括本發明如上所述的植物的用途,其適於用作單倍體誘導系以產生單倍體的受精的種子或胚,或單倍體植物。之前針對本發明的主題和方法的解釋也適用於所記載的用途。

在另一方面,本發明還涉及用於外部施加至植物的工具。提供該工具用於外部施加至植物且適於在植物中介導單倍體誘導系的特性,或適於增加單倍體誘導系植物的誘導能力。施加優選在花葯形成、花粉形成或受精的時間點進行。所述工具包含具有雙鏈部分的rna,其中所述雙鏈部分的至少一條鏈具有與以下核酸的編碼序列中至少14、15、16、17、18、19、20、21、22、23、24或25個,優選至少30、35、40、45、50、60、70、80、90、100、120或140個,和特別優選至少160、180、200、250、300、350、400、450、500、600、700、800、900或1000個連續核苷酸同源或相同的核苷酸序列

(i)具有有義或反義方向的選自seqidno:26、27、28、29、30和/或31的序列或其功能性片段的核酸,或

(ii)與來自(i)的序列互補的核酸,或

(iii)與來自(i)的序列至少80%、82%、84%、86%或88%,優選地至少90%、91%、92%、93%、94%、95%、96%,或特別優選地至少97%、97.5%、98%、98.5%、99%或99.5%相同的核酸,或

(iv)編碼具有選自seqidno:32、33和/或34的胺基酸序列的蛋白或所述蛋白的功能性部分的核酸,或

(v)編碼根據(iv)的蛋白的同源物、類似物或直系同源物或其部分的核酸,或

(vi)與來自(ii)的序列在嚴格條件下雜交的核酸。

用於產生本發明所述工具的雙鏈rna可以通過本領域技術人員已知的方法體外產生。例如,可以合成產生所述雙鏈rna,其中所述rna直接在體外形成。從雙鏈dna出發,所述雙鏈rna還可以通過例如形成mrna轉錄物而合成,所述mrna轉錄物然後形成髮夾結構。所述工具可以用作在植物中誘導單倍體的觸發器。例如,在植物開花前或後,可以通過以噴霧形式噴灑至植物組織上,或者通過本領域技術人員公知的額外的方式外部施加至植物組織上,或者通過噴霧或和其它添加劑混合來使用所述工具。例如,添加劑可以是潤溼劑、載體物質或rna穩定劑如脂質體。

令人驚奇地,本發明人已經發現,影響花粉管生長、花粉的能量代謝和/或著絲粒活性(優選在發育成例如花粉的生殖細胞中)的基因或基因產物特別適於將非單倍體誘導系轉換成單倍體誘導系。為此,可以鑑定具有顯著重要性的多個基因家族/蛋白家族。它們用於產生單倍體誘導系的用途之前在現有技術中沒有被描述或提示。因為花粉的產生以及受精過程(包括花粉管的生長)在單子葉和雙子葉植物中遵循廣泛適用的原理,在本發明的技術教導下,即使是對之前既不存在高效體內單倍體誘導系統或其它用於產生雙-單倍體植物的基於細胞培養的方法的栽培植物,本領域技術人員也接受開發單倍體誘導系的可能性。為此,使用他從本發明獲得的遺傳學信息,他可以通過常規勞動發現所描述的基因產物的同源物、直系同源物或類似物,並如本文所描述對它們進行操作。然而,本發明的技術教導也適於進一步針對它們的效率(即單倍體誘導率)改良已經存在的誘導系,並因此第一次使得它們能夠經濟地應用。此外,本領域技術人員將該技術教導和單倍體誘導的其它已知機制組合,例如操作cenh3蛋白(ravi&chan,2010),並因此進一步增加效率。

本申請中使用的一些術語在下文詳細解釋:

「b73」是玉米育種系,其在玉米遺傳學中用作模型基因型且被用於產生第一個玉米參考序列。

「介導單倍體誘導系的特性」或「單倍體誘導系的特性的介導」或相當的短語意指,通過使用本發明的核酸,使得植物在與來自相同屬(優選相同物種)的不具有單倍體誘導系特性的植物雜交時能夠產生具有單一染色體組(單倍體)的受精的種子或胚。單倍體誘導系的特性,定義為絕對單倍體誘導率,指的是至少0.1%、0.2%、0.3%、0.4%、0.5%、0.6%、0.7%、0.8%、0.9%、或1%,優選至少1.5%、2%、2.5%、3%、3.5%、4%、4.5%、或5%,或特別優選6%、7%、8%、9%、10%、11%、12%、13%、14%、或15%,或非常特別優選,至少20%、25%、30%、35%、40%、45%、或50%的所述受精的種子或胚具有單倍體染色體組。

「表達率的增加」或「增加的表達率」或「表達的激活」或相當的表述意指核苷酸序列的表達率與指定的參照相比增加超過10%、15%、20%、25%、或30%,優選增加超過40%、50%、60%、70%、80%、90%、或100%,或特別優選增加超過150%、200%、250%、300%、500%、或1000%。所述表達率的增加優選地導致其中表達率被增加的植物的表型變化。改變的表型可以是介導單倍體誘導系的特性,或單倍體誘導系誘導能力的增加。

「轉錄率的增加」或「增加的轉錄率」或相當的表述意指核苷酸序列的轉錄率與指定的參照相比增加超過10%、15%、20%、25%、或30%,優選增加超過40%、50%、60%、70%、80%、90%、或100%,或特別優選增加超過150%、200%、250%、300%、500%、或1000%。所述轉錄率的增加優選地導致其中轉錄率被增加的植物的表型變化。改變的表型可以是介導單倍體誘導系的特性,或單倍體誘導系誘導能力的增加。

核苷酸序列的「功能性」片段意指核苷酸序列的節段,其具有與所述功能性片段所源自的完整核苷酸序列相同或相當的功能。因此,所述功能性片段可以具有與所述完整核苷酸序列在至少50%、55%、60%、65%、70%、75%、80%、85%、90%、92%、94%96%、97%、98%、或99%的長度上相同或同源的核苷酸序列。此外,核苷酸序列的「功能性片段」還可以指核苷酸序列的節段,其改變總核苷酸序列的功能,例如在轉錄後或轉錄基因沉默中。因此,核苷酸序列的功能性片段可以包括完整核苷酸序列的至少14、15、16、17、18、19、20、21、22、23、24、或25個,優選至少30、35、40、45、50、60、70、80、90、100、120、或140個,或特別優選至少160、180、200、250、300、350、400、450、500、600、700、800、900、或1000個連續的核苷酸。

蛋白的「功能性部分」意指蛋白的節段,或編碼所述蛋白的胺基酸序列的區段,其中所述節段可以在植物細胞中執行與完整蛋白相同或相當的功能。蛋白的功能性部分在至少50%、55%、60%、65%、70%、75%、80%、85%、90%、92%、94%、96%、97%、98%、或99%的長度上具有與所述功能性部分所源自的蛋白相同或類似(在保守性和半保守性胺基酸取代下)的胺基酸序列。

「單倍體誘導系」也意指體內單倍體誘導系。

術語「異源(heterolog)」意指所導入的多核苷酸源自例如相同物種中具有不同遺傳背景的細胞或器官或另一物種,或者對原核或真核宿主細胞是同源的但是然後位於不同的遺傳環境且因此不同於可能的、天然存在的相應多核苷酸。除了相應的內源基因外,可以存在異源的多核苷酸。

在本發明含義中,「同源物」應理解為具有相同系統發生起源的蛋白,「類似物」應理解為執行相同功能但具有不同系統發生起源的蛋白,而「直系同源物」是來自不同物種執行相同功能的蛋白。

「雜交(hybridizing)」或「雜交(hybridization)」應理解為這樣的過程,其中單鏈核酸分子被添加至以最大可能程度互補的核酸鏈,即形成鹼基配對。用於雜交的標準方法例如描述於sambrooketal.2001。這應優選地被理解為所述核酸分子至少60%,更優選地,至少65%、70%、75%、80%、或85%,或特別優選地,90%、91%、92%、93%、94%、95%、96%、97%、98%、或99%的鹼基和以最大可能程度互補的核酸鏈形成鹼基配對。這樣的添加的可能性取決於雜交條件的嚴格性。術語「嚴格性」涉及雜交條件。當鹼基配對更難實現時存在高嚴格性;如果鹼基配對較容易實現則存在低嚴格性。例如,雜交條件的嚴格性取決於鹽濃度或離子強度以及溫度。一般而言,可以通過提高溫度和/或降低鹽含量來增加嚴格性。「嚴格雜交條件」應理解為其中雜交主要僅僅在同源核酸分子中發生的那些條件。術語「雜交條件」因此不僅僅涉及所述核酸實際添加時的條件,還涉及後續洗滌步驟的條件。嚴格雜交條件例如是這樣的條件,在所述條件下主要僅僅那些具有至少70%,優選至少75%、至少80%、至少85%、至少90%或至少95%序列相同性的核酸分子雜交。嚴格雜交條件例如是在4xssc中在65℃雜交,以及隨後在0.1xssc中65℃重複洗滌共大約1小時。本文所用術語「嚴格雜交條件」還可以指在68℃在0.25m磷酸鈉、ph7.2、7%sds、1mmedta和1%bsa中雜交16小時,以及隨後用2xssc和0.1%sds在68℃洗滌兩次。雜交優選在嚴格條件下發生。

「增加單倍體誘導系的誘導能力」或「單倍體誘導系的誘導能力的增加」意指具有單倍體誘導系的特性的植物的單倍體誘導率被增加。具有單倍體染色體組且獲得自所述單倍體誘導系和不具有單倍體誘導系特性的相同屬(優選相同物種)植物的雜交的受精種子的數目因此可以比不使用本發明的核酸獲得的單倍體受精種子的數目高至少0.1%、0.2%、0.3%、0.4%、0.5%、0.6%、0.7%、0.8%、0.9%、或1%,優選至少1.5%、2%、2.5%、3%、3.5%、4%、4.5%、或5%,以及特別優選至少6%、7%、8%、9%、10%、15%、20%、30%、或50%,即,單倍體誘導率可以相對於之前實現的單倍體誘導率增加至少0.1%、0.2%、0.3%、0.4%、0.5%、0.6%、0.7%、0.8%、0.9%、或1%,優選至少1.5%、2%、2.5%、3%、3.5%、4%、4.5%、或5%,以及特別優選至少6%、7%、8%、9%、10%、15%、20%、30%、或50%。

「可操作地連接」意指在同一核酸分子中連接,使得所連接的元件以這樣的方式相互定位和朝向使得核酸分子的轉錄可以發生。與啟動子可操作地連接的dna在該啟動子的轉錄控制下。

植物「器官」指的是例如葉、枝條(shoot)、莖、根、營養芽、分生組織、胚、花葯、胚珠或果實。植物「部分」指的是多個器官的組合,例如花或種子,或器官的部分,例如來自枝條的橫切。植物「組織」例如是愈傷組織、貯藏組織、分生組織、葉組織、莖組織、根組織、植物瘤組織或繁殖組織。例如,植物「細胞」應該理解為,例如具有細胞壁的分離的細胞或其聚集物,或原生質體。

在本發明含義中,如果不是另外指明,「植物」可以是來自雙子葉植物、單子葉植物和裸子植物的任何物種。這些中的許多是,例如,大麥(hordeumvulgare)、雙色高粱(sorghumbicolor)、黑麥(secalecereale)、黑小麥(triticale)、甘蔗(saccharumofficinarium)、玉米(zeamays)、狗尾草(setariaitalic)、水稻(oryzasativa)、小粒野生稻(oryzaminuta)、澳洲野生稻(oryzaaustraliensis)、高稈野生稻(oryzaalta)、小麥(triticumaestivum)、硬粒小麥(triticumdurum)、球莖大麥(hordeumbulbosum)、短柄草(brachypodiumdistachyon)、海濱大麥(hordeummarinum)、節節麥(aegilopstauschii)、甜菜(betavulgaris)、葵花(helianthusannuus)、daucusglochidiatus、daucuspusillus、daucusmuricatus、胡蘿蔔(daucuscarota)、巨桉(eucalyptusgrandis)、erythrantheguttata、genliseaaurea、棉屬物種(gossypiumsp.)、芭蕉屬物種(musasp.)、燕麥屬物種(avenasp.)、林菸草(nicotianasylvestris)、普通菸草(nicotianatabacum)、絨毛狀菸草(nicotianatomentosiformis)、番茄(solanumlycopersicum)、馬鈴薯(solanumtuberosum)、中果咖啡(coffeacanephora)、葡萄(vitisvinifera)、黃瓜(cucumissativus)、桑樹(morusnotabilis)、擬南芥(arabidopsisthaliana)、琴葉擬南芥(arabidopsislyrata)、arabidopsisarenosa、須彌芥(crucihimalayahimalaica)、卵葉須彌芥(crucihimalayawallichii)、彎曲碎米薺(cardamineflexuosa)、北美獨行菜(lepidiumvirginicum)、薺菜(capsellabursa-pastoris)、小擬南芥(olmarabidopsispumila)、硬毛南芥(arabishirsuta)、歐洲油菜(brassicanapus)、甘藍(brassicaoleracea)、芫菁(brassicarapa)、芸苔(brassicajuncacea)、brassicanigra、蘿蔔(raphanussativus)、erucavesicariasativa、甜橙(citrussinensis)、麻風樹(jatrophacurcas)、大豆(glycinemax)、和毛果楊(populustrichocarpa)。根據本發明的植物優選是玉蜀黍屬(zea)的植物,特別是物種玉米(zeamays),或者是高粱。

「減少表達率」或「表達率的減少」或「表達的抑制」或「減少的表達率」或相當的短語意指核苷酸序列的表達率與指定的參照相比減少超過10%、15%、20%、25%、或30%,優選增加超過40%、50%、45%、50%、55%、60%、或65%,或特別優選增加超過70%、75%、80%、85%、90%、92%、94%、96%、或98%。然而,其還可以指核苷酸的表達率被減少100%。所述表達率的減少優選地導致其中表達率被減少的植物的表型變化。改變的表型可以是介導單倍體誘導系的特性,或單倍體誘導系誘導能力的增加。

「轉錄率的減少」或「減少的轉錄率」或相當的表述意指核苷酸序列的轉錄率與指定的參照相比減少超過10%、15%、20%、25%、或30%,優選增加超過40%、50%、45%、50%、55%、60%、或65%,且特別優選增加超過70%、75%、80%、85%、90%、92%、94%、96%、或98%。然而,其還可以指核苷酸的轉錄率被減少100%。所述轉錄率的減少優選地導致其中轉錄率被減少的植物的表型變化。改變的表型可以是介導單倍體誘導系的特性,或單倍體誘導系誘導能力的增加。

與本發明相關,術語「調控序列」涉及影響表達特異性和/或強度的核苷酸序列,例如其中所述調控序列介導明確的組織特異性。這樣的調控序列可以位於最小啟動子的轉錄起始點上遊,但也可以在其下遊,例如在轉錄但不翻譯的前導序列中或在內含子中。

「適於用作單倍體誘導系」意指與相同屬(優選相同物種)的不具有單倍體誘導系特性的植物雜交時,植物能夠產生具有單一染色體組(單倍體)的受精的種子。單倍體誘導系的特性,定義為絕對單倍體誘導率,指的是至少0.1%、0.2%、0.3%、0.4%、0.5%、0.6%、0.7%、0.8%、0.9%、或1%,優選至少1.5%、2%、2.5%、3%、3.5%、4%、4.5%、或5%,或特別優選6%、7%、8%、9%、10%、11%、12%、13%、14%、或15%,或非常特別優選,至少20%、25%、30%、35%、40%、45%、或50%的所述受精的種子具有單倍體染色體組。

本發明的設計和實施方案通過示例的方式,針對附圖和序列進行描述。

圖1:與b73(agpv02)相比所鑑定的基因的基因組排列:

snarev1(grmzm2g179789):在rws花粉中增加的表達;

snarev2(grmzm2g412426):在rws花粉中增加的表達;

itp(肌醇-1,4,5-三磷酸-5-磷酸酶)(grmzm2g106834):在rws花粉中減少的表達;

pl(patatin磷脂酶)(grmzm2g471240):編碼序列中的多態性;

mito1(線粒體輸入受體):僅僅存在於rws中;

mito2:與mito1同源,但縮短。僅僅存在於rws;

pgm(磷酸甘油酸變位酶)(grmzm2g062320):在rws中缺失;

lncrna:pl的同源物:在rws中缺失;

ac213048:用於序列比較的錨定基因;

mt(rna甲基轉移酶)(grmzm2g347808):在調控區域中的多態性。

grmzm名字涉及agpv02中的注釋。

圖2:誘導系rws和三個非誘導系對照(ni1、ni2、ni3)中基因snarev1、rna甲基轉移酶和patatin磷脂酶的rt-pcr。

圖3:rws花粉的rnaseq數據,投射到來自agpv02的人工參照,其中snare和磷脂酶基因座被rwsbac的基因座取代。(t1:轉錄物1,snare2的同源物,但具有改變的內含子結構。t2:snare1的同源物。編碼133aa的蛋白;t3:snare1/2的同源物。來自圖2的rt-pcr片段)。

qtl分析和候選基因的鑑定:

在玉米單倍體誘導系rws(其被歸因於誘導系stock6(coe,1959))中,在染色體1(bin1.04)鑑定到主要-qtl並精細定位。基於這些工作,rws中的該qtl應當進行驗證以及分子分析以鑑別和功能性驗證其中的基因。測試來自rwsx對照1(母體誘導系x非誘導系)的qtl定位群體的誘導能力。由此可以顯示已知的qtl很可能也存在於誘導系rws。然而,也有可能發現強的等位基因遷移取代非-rws(對照1)等位基因。

為了分子描述所述基因座,選擇了在dna和rna水平的多種測序方法。由於誘導系和參照基因組b73的結構差異,僅僅小比例的經典的、基於參照的測序方法獲得成功。廣泛的和複雜的生物信息學分析顯示結構差異將然後需要通過其它技術進行檢查(圖1)。

在序列捕獲方法中,在三個stock6衍生的誘導系,以及rws和5個非誘導系對照中所鑑定的qtl周圍的3兆鹼基被測序,且分析誘導系特異性多態性例如存在-不存在變異、snp和indel。最初,由此鑑定到16個候選基因,其中3個基因通過測序和分析表達數據確認:一個基因編碼花葯特異性patatin磷脂酶a2,其具有rws誘導系-特異性單倍型;磷酸甘油酸變位酶,其不存在於誘導系rws;以及rna甲基轉移酶基因,其在調控序列具有突變(圖2)。

針對rws、emk(衍生自stock6的另外的誘導系)和對照1開發了bac文庫,並用沿著所鑑定的qtl分布的探針篩選。針對大約150kb的靶範圍(其由dongetal.2013提及為在誘導系uh400中很可能是誘導系相關的),提取rws、對照1和emk的bac並測序。對bac序列進行注釋並與針對rws、對照1、emk和b73創建的全面的轉錄組數據進行比較。

結果,此處確認了誘導系中的缺失。因此,所檢查的母體誘導系缺少在染色體1上68.26至68.36mb之間(b73參照序列的agpversion2)的100kb的區域。此外,在誘導系的靶區域之外出現基因類似區域的倒位和無法和b73參照基因組以及對照1比較的大的重複性序列節段。

儘管有缺失,已經鑑定的磷脂酶仍然存在於所述誘導系中,但顯示強烈不同於對照的前述單倍型,以及在啟動子區域的顯著遺傳變異。由於所述缺失,上文已經鑑定的磷酸甘油酸變位酶不再存在。

此外,在所述100kb缺失中,也鑑定到非編碼rna(lncrna)。如所述磷酯酶一樣,其是花粉特異性表達,且顯示與所鑑定的磷酯酶具有82%的同源性。所述序列自身互補,即所述lncrna形成髮夾結構。非常高的表達率、與所述磷酯酶顯著的同源性以及通過sanger測序確定的低snp密度表明該lncrna對所述磷酯酶的調控功能。理論上,從該轉錄物也可以翻譯出88個胺基酸長的所述磷酯酶蛋白的截短版本。

為了能夠測量來自該區域的所鑑定的基因的表達水平差異,除了在dna水平測量多態性,也實施了rt-pcr和rnaseq實驗。除了作為誘導系的rwp(rws的子系),使用了三種遺傳學上非常不同的對照品系。從這些植物收穫花粉、沒有花粉的花葯和通過自交或雜交授粉後6-7天的胚。所述磷酯酶此處顯示在來自rwp的花粉中輕微的表達增加。所述甲基轉移酶在rwp的花粉中顯示弱表達,而在對照的花粉中無表達。lncrna花粉特異性表達,其也如期望的,在rwp中不存在。

另外對相同材料的花粉進行rnaseq以進一步驗證前述結果。

將轉錄組數據(rws的花粉rna的rna-seq)投射到人工參照上,其中b73中所述qtl的區域被rws-bac置換。該分析顯示所述磷酯酶在花粉中的表達。該基因的外顯子-內含子結構對應於b73中的結構,但在5』端存在缺失,其導致終止密碼子並因此導致縮短的蛋白。此外,在所述磷酯酶上遊和下遊檢測到三種額外的rws-特異性轉錄物。具有兩種轉錄物的區域位於所述磷酯酶上遊大約60kb。第一種轉錄物是非編碼的;第二種轉錄物編碼192個胺基酸長的蛋白,其顯示與線粒體輸入受體(mito1)的同源性。在b73中,這僅僅在所述qtl(grmzm2g174696)上遊15兆鹼基。所述磷酯酶下遊大約90千鹼基(kb)是另一轉錄物,其反過來顯示和所述192個胺基酸長的轉錄物具有高度同源性。

為了獲得所述qtl外的誘導系-特異性表達,在基因組範圍評估rnaseq數據。出乎意料地,在上文記載的精細定位的區域之外但接近所述區域鑑別到新的候選基因,其之前很可能由於seqcapture方法的技術限制而無法被發現。來自所述精細定位區域的所鑑定的磷酯酶上遊大約400kb是基因複合物,其在rws的花粉中,與對照相比表達顯著不同(至少係數為2)。該基因複合物含有三個基因:兩個基因注釋為snarev基因,其相互具有高度同源性且在rwp中過表達,而一個基因注釋為肌醇-1,4,5-三磷酸-5-磷酸酶且其表達在rwp中被減少。這些基因的克隆的轉錄物與公共注釋有部分不同,使得它們也可以編碼具有不同功能的蛋白,或也可以作為lncrna起功能。可以從該基因座分離來自rws的bac並測序。該序列被整合進人工參照以在agpv02中重新分析rnaseq數據(圖3)。除了轉座酶外,兩種rna(t1(seqidno:55、56、57和63)和t3(seqidno:60、61、62和65))以及具有131個胺基酸的orf的rna在該基因座表達(t2(seqidno:58、59和64))。除了所述轉座酶,全部轉錄物位於所述兩個snarev基因內或之間。儘管推測起來它們本身不具有snare功能,它們可以參與調控同源基因。該區域的序列捕獲數據顯示誘導系、對照和參照基因組之間存在顯著的結構差異。bac測序確認肌醇-1,4,5-三磷酸-5-磷酸酶基因在所述誘導系的基因組水平不存在,以及來自b73的lncrna的不存在,所述lncrna與所述肌醇-1,4,5-三磷酸-5-磷酸酶共享轉錄起始位點,但從相反鏈閱讀。從所述snare基因之一(grmzm2g179789)分離cdna也表明所述誘導系中複雜的結構改變,由於所述cdna的一部分對應於正鏈而一部分對應於參照的負鏈。

基因功能

總之,因此可以鑑定7個基因,其對於在玉米中的體內單倍體誘導或體內單倍體誘導能力可能是重要的。

在這些對花粉管生長特別重要的四個基因中:

所述兩個snarev基因編碼已知參與泡囊運輸的蛋白(文獻)。在模式植物擬南芥(arabidopsisthaliana)中,snarev蛋白已經被證明在花粉管頂端,其中它們參與磷酯類和果膠類的運輸(文獻)。在所檢查的玉米誘導系中觀察到的snarev蛋白的過表達將導致增加的花粉管生長。

模式植物菸草(nicotianatabacum)中能夠顯示磷脂酶a2也顯著影響花粉管生長。因此,抑制磷脂酶a2導致花粉管生長的抑制(kimetal.,2011)。在所檢查的玉米誘導系種,所鑑定的與所述磷酯酶具有顯著同源性的lncrna的不存在可能導致所述磷酯酶基因減少的表達率或翻譯率,其將促進花粉管的生長速度。

在擬南芥中肌醇-聚磷酸酯-5-磷酸酶的敲除突變體中,顯示花粉管不受抑制地生長。在所檢查的玉米誘導系中,肌醇-1,4,5-三磷酸-5-磷酸酶的減少的表達水平因此同樣可能導致加速的花粉管生長。此處所鑑定的與肌醇-1,4,5-三磷酸-5-磷酸酶相關的lncrna對表達率有調控作用。

因此與非誘導系相比,所檢查的玉米誘導系顯示所述四個基因被修飾的調控/表達率。該破壞導致顯著更快的花粉管生長,其也被由於線粒體運輸蛋白的表達或其調控導致的可能增加的能量代謝促進。這能夠導致花粉管中的生殖細胞的運輸和其生長相分離。結果是,可能出現不完全或不正確的授粉以及隨後的染色體消除。

已知活性著絲粒在染色體分布中起關鍵作用,且通過在dna或組蛋白水平的染色質修飾(此外,通過轉錄、rna相互作用和rna結合)而表徵和修飾。所述甲基轉移酶基因調控的改變可以在早期胚胎發生期間影響誘導繫著絲粒的活性,其最終導致誘導系基因組在早期種子發育階段的消除。

在所檢查的誘導系中,其顯示所述磷酸甘油酸變位酶基因不再存在。該基因的不存在可能負面影響花粉的能量代謝,並因此對授粉有影響。此外,所述能量代謝可以被線粒體膜蛋白影響。

所述基因中任意基因單獨地或任意組合可以負責單倍體誘導的效果。

產生新的體內單倍體誘導系

為了在其它作物類型或玉米非誘導系基因型中開發新的誘導系,或為了增加誘導系基因型的誘導能力,如下進行:

在其它作物類型或玉米非誘導系基因型中鑑別相應的基因:在單子葉植物例如玉米、水稻、小麥、黑麥或大麥中,所述花粉-特異性patatin磷脂酶強烈保守,因此這些的同源物容易鑑定。與此相反,調控性lncrna在大多數單子葉植物不存在。然而,如果它們存在,它們同樣可以使用顯著的同源性被發現,正如它們也存在於所檢查的玉米誘導系中。在雙子葉植物中,其它磷酯酶類型執行相應的花粉管生長的任務。為了鑑定這些,創建花粉或花粉管的rna文庫並針對本發明的特定磷酯酶進行篩選。在花粉中強表達的patatin磷酯酶已經能夠通過向日葵花粉的rnaseq鑑定(seqidno:46-48)。

所述snarev基因和所述甲基轉移酶基因不需要是花粉特異性的。例如,所鑑別的snarev基因之一(snarev1)在玉米中也不是以花粉特異性方式表達的。snarev1在野生型花粉中根本不表達。在注釋的基因組中,可以通過blastp鑑定snarev蛋白的同源基因和功能性區域。在未注釋的基因組中,將需要rnaseq數據來注釋和選擇snare基因。

同源性肌醇-1,4,5-三磷酸-5-磷酸酶或磷酸甘油酸變位酶為了用作候選基因,必須在花粉中表達。可以如上所述進行鑑定,通過blastp和隨後的花粉中的rt-pcr或通過花粉rnaseq數據的注釋。

候選基因的操作

可能的誘導系或增加的誘導能力可以通過轉基因表達上述的磷酯酶和/或snare和/或甲基轉移酶和/或磷酸甘油酸變位酶和/或lncrna和/或線粒體輸入受體實現。為此,可以從誘導系rws克隆相應的基因,包括它們的啟動子。這些基因可以被克隆進合適的轉化載體並被轉化進期望的植物。

花粉表達的肌醇-1,4,5-三磷酸-5-磷酸酶可以額外地或專一地通過例如rnai減少它們的活性。例如,為此產生髮夾結構,其然後包含合適的啟動子和終止子,允許所述髮夾結構在花粉形成的時間點或之前轉錄。這些髮夾構建體可以被克隆進合適的轉化載體並被轉化進期望的植物。

可選地或額外地,可以通過tilling、轉座子誘變或其它誘變方法或「基因組編輯」產生具有穩定所述磷酯酶和/或snare和/或甲基轉移酶、增加表達或增加活性的突變(例如在所鑑定的基因中)的植物。突變的蛋白的二級和三級結構的結構分析對此可能有用,所述突變的蛋白顯示例如較緻密的結構,並因此較少蛋白酶的攻擊點。此外,也可以考慮所述蛋白中在泛素相互作用中起作用的區域。在所述基因活性中心的突變體可以直接測試它們的活性。為了驗證所述磷酯酶的功能,已經檢查了多種tilling突變體的誘導能力。取代d74n(第74位的天冬氨酸取代為天冬醯胺)或g78r(第78位的甘氨酸取代為精氨酸)導致0.2-0.4%的母體誘導率。為了可選地或額外地操作所述肌醇-1,4,5-三磷酸-5-磷酸酶或所述磷酸甘油酸變位酶,,必須搜索敲除突變體或搜索降低所述基因活性的額外的突變體。

也可以改良stock6衍生的誘導系。這是可能的,通過上述的轉基因方法和通過導入所鑑定的候選基因中的突變。此外,有可能通過轉基因或非轉基因方法操作基因組中所述基因額外的拷貝,只要它們在花粉中表達。誘導能力的測試:

為了測試潛在誘導系的誘導能力,例如有如下可能:

1.向具有可見的隱性標記(例如對於玉米,有光澤的(bordesetal.,1997)或無葉舌(sylvesteretal.,1990))的品系授粉。通過流式細胞術測試表達該特徵的後代的單倍性。

2.向與所述誘導系遺傳學不同,最優通過多個標記不同的品系授粉。使用這些標記以鑑定純合植物。通過流式細胞術測試這些植物的單倍性。

採用兩種可能來測試誘導能力。

參考文獻

barret,p.,brinkmann,m.,&beckert,m.(2008).amajorlocusexpressedinthemalegametophytewithincompletepenetranceisresponsibleforinsitugynogenesisinmaize.theoreticalandappliedgenetics,117(4),581-594.

bordes,j.,devaulx,r.d.,lapierre,a.,&pollacsek,m.(1997).haplodiploidizationofmaize(zeamaysl)throughinducedgynogenesisassistedbyglossymarkersanditsuseinbreeding.agronomie,17(5),291-297.

chen,l.,tu,z.,hussain,j.,cong,l.,yan,y.,jin,l.,...&he,g.(2010).isolationandheterologoustransformationanalysisofapollen-specificpromoterfromwheat(triticumaestivuml.).molecularbiologyreports,37(2),737-744.

chevalier,b.s.,kortemme,t.,chadsey,m.s.,baker,d.,monnatjr,r.j.,&stoddard,b.l.(2002).design,activity,andstructureofahighlyspecificartificialendonuclease.molecularcell,10(4),895-905.

coe,e.h.(1959).alineofmaizewithhighhaploidfrequency.americannaturalist,381-382.

das,l.,&martienssen,r.(1995).site-selectedtransposonmutagenesisatthehcf106locusinmaize.theplantcellonline,7(3),287-294.

deimling,s.,f.k.,geiger,h.h.(1997).methodikundgenetikderin-vivo-haploideninduktionbeimais.[methodsandgeneticsofinvivohaploidinductioninmaize]presentationpflanzenzüchtung,38:203-224.

depicker,a.,stachel,s.,dhaese,p.,zambryski,p.,&goodman,h.m.(1981).nopalinesynthase:transcriptmappinganddnasequence.journalofmolecularandappliedgenetics,1(6),561-573.

dong,x.,xu,x.,li,l.,liu,c.,tian,x.,li,w.,&chen,s.(2014).marker-assistedselectionandevaluationofhighoilinvivohaploidinducersinmaize.molecularbreeding,1-12.

dong,x.,xu,x.,miao,j.,li,l.,zhang,d.,mi,x.,...&chen,s.(2013).finemappingofqhir1influencinginvivohaploidinductioninmaize.theoreticalandappliedgenetics,126(7),1713-1720.

fire,a.,xu,s.,montgomery,m.k.,kostas,s.a.,driver,s.e.,&mello,c.c.(1998).potentandspecificgeneticinterferencebydouble-strandedrnaincaenorhabditiselegans.nature,391(6669),806-811.

gaj,t.,gersbach,c.a.,&barbasiii,c.f.(2013).zfn,talen,andcrispr/cas-basedmethodsforgenomeengineering.trendsinbiotechnology,31(7),397-405.

gurr,s.j.,&rushton,p.j.(2005).engineeringplantswithincreaseddiseaseresistance:whatarewegoingtoexpress?trendsinbiotechnology,23(6),275-282.

kato,n.,he,h.,&steger,a.p.(2010).asystemsmodelofvesicletraffickinginarabidopsispollentubes.plantphysiology,152(2),590-601.

kim,h.j.,ok,s.h.,bahn,s.c.,jang,j.,oh,s.a.,park,s.k.,...&shin,j.s.(2011).endoplasmicreticulum–andgolgi-localizedphospholipasea2playscriticalrolesinarabidopsispollendevelopmentandgermination.theplantcellonline,23(1),94-110.

lloyd,a.,plaisier,c.l.,carroll,d.,&drews,g.n.(2005).targetedmutagenesisusingzinc-fingernucleasesinarabidopsis.proceedingsofthenationalacademyofsciencesoftheunitedstatesofamerica,102(6),2232-2237.

mccarty,d.r.,marksettles,a.,suzuki,m.,tan,b.c.,latshaw,s.,porch,t.,...&curtishannah,l.(2005).steadyt‐astetransposonmutagenesisininbredmaize.theplantjournal,44(1),52-61.

odell,j.t.,nagy,f.,&chua,n.h.(1985).identificationofdnasequencesrequiredforactivityofthecauliflowermosaicvirus35spromoter.

prigge,v.,xu,x.,li,l.,babu,r.,chen,s.,atlin,g.n.,&melchinger,a.e.(2012).newinsightsintothegeneticsofinvivoinductionofmaternalhaploids,thebackboneofdoubledhaploidtechnologyinmaize.genetics,190(2),781-793.

ravi,m.,&chan,s.w.(2010).haploidplantsproducedbycentromere-mediatedgenomeelimination.nature,464(7288),615-618.

f.k.,gordillo,g.a.,&geiger,h.h.(2005).invivohaploidinductioninmaize-performanceofnewinducersandsignificanceofdoubledhaploidlinesinhybridbreeding.maydica,50(3/4),275.

sambrook,j.,russell,d.w.,&russell,d.w.(2001).molecularcloning:alaboratorymanual(3-volumeset)(vol.999).coldspringharbor,newyork:coldspringharborlaboratorypress.

shibuya,k.,fukushima,s.,&takatsuji,h.(2009).rna-directeddnamethylationinducestranscriptionalactivationinplants.proceedingsofthenationalacademyofsciences,106(5),1660-1665.

silva,g.,poirot,l.,galetto,r.,smith,j.,montoya,g.,&duchateau,p.(2011).meganucleasesandothertoolsfortargetedgenomeengineering:perspectivesandchallengesforgenetherapy.currentgenetherapy,11(1),11.

sylvester,a.w.,cande,w.z.,&freeling,m.(1990).divisionanddifferentiationduringnormalandliguleless-1maizeleafdevelopment.development,110(3),985-1000.

till,b.j.,reynolds,s.h.,weil,c.,springer,n.,burtner,c.,young,k.,...&henikoff,s.(2004).discoveryofinducedpointmutationsinmaizegenesbytilling.bmcplantbiology,4(1),12.

twell,d.,yamaguchi,j.,wing,r.a.,ushiba,j.,&mccormick,s.(1991).promoteranalysisofgenesthatarecoordinatelyexpressedduringpollendevelopmentrevealspollen-specificenhancersequencesandsharedregulatoryelements.genes&development,5(3),496-507.

venter,m.(2007).syntheticpromoters:geneticcontrolthroughcisengineering.trendsinplantscience,12(3),118-124.

wang,y.,chu,y.j.,&xue,h.w.(2012).inositolpolyphosphate5-phosphatase-controlledins(1,4,5)p3/ca2+iscrucialformaintainingpollendormancyandregulatingearlygerminationofpollen.development,139(12),2221-2233.

zhao,y.,zhao,q.,ao,g.,&yu,j.(2006).characterizationandfunctionalanalysisofapollen-specificgenest901insolanumtuberosum.planta,224(2),405-412.

wo/2010/079430(bonasetal.)modulardna-bindingdomainsandmethodsofuse.

wo/2011/072246(regentsoftheuniversityofminnesota)taleffector-mediateddnamodification.

wo2012/030893(monsantotechnologyllc)molecularmarkersassociatedwithhaploidinductioninzeamays.

序列表

kws種子歐洲股份公司

單倍體誘導系

kws0220pct

de102015004187.8

2014-11-12

65

patentinversion3.5

1

47944

dna

zeamays

misc_feature

(12578)..(12677)

nisa,c,g,ort

misc_feature

(22322)..(22421)

nisa,c,g,ort

misc_feature

(42238)..(42337)

nisa,c,g,ort

1

atggggagcagtgaggagcatgtttttttagatcccaccagaatatgtgcatccgtgtca60

cttcttgctcatgatctcattggccgaatgcttaatcgagaggtctcttcaaggcccaat120

gccaaagaagttctccgtaagttcaagcacccttgtaacttgtgctttatatatatgatt180

ctcaatttatcattgacttttcctaatggctttcaacacagggcaccatgggtcttattc240

tacactgattgcccgcagaaagctgaattctctaacatatgggatactaacaaaactgca300

gctcccatgattcatcgggagatagtcaggtttggttactgtgagtcttcatcttcaaaa360

tcctcaagtgacaactctgaagagcgagatgaatgcggtatagttgatgcactggtgaca420

acaataacacaggtgaggatctcagagcccaagaggagtcggctgttcagcctacccaac480

gggttgttgccgccaagcaggaacagtctccgaacatgaagatgatgaatccgtgtgtgg540

ctttctaacttgacctacctagctcccatccccatgcatgtataaacgacatttggggaa600

tgggtagaaaagcagagattagggattttcgtttccgtcggtgcagttttggtgttccaa660

tggagttgcgagatgtttatgtgccttagtcttcaatttgggggttgggggaaaagtaat720

tttatgtttttgttttgtgtctgcagattcggaagatggacttggaggcaaggagcctac780

agcctagcattaaggctggtttgcttgcaaagctgagggagtataaatctgacctcaaca840

acgtcaagagtgagctcaagaggatatttgcgcccaatgccaggcaggctacccgggagg900

agctcctagagtttggaatggctgatactctcgctgtgagctaatgctaggacttgactg960

tgtctacgagactgctcctaacaataaactgaagaaagcaaaagaaatcattcaacgtat1020

tcgccgaagagaactctacaaggtagtatgatgctttaattgctcatatacaagtgtcat1080

tttgtcatgtcattacacatggttaggatacatacttaagtttctaacgtaggcgtccac1140

acaacggattggtgcacggttctgccgatgtatcccacgcacgtgcatggaaggaggcag1200

gcacccttccccgccgccccggatctcgcgccagcccccgccctaccccgcctgcccttc1260

cactcttcccccgccgcccccggtcaacgtcacgaacccgggcctcgtgccgctcgtcgt1320

ggccacactgttcgacgagcgagtcatagagctgctgagcgtgctcgctgatgcggcggt1380

ggggcgaccaggcaggtggtccatcggcgaagcgccatggtcgtcgtcggggggcacgaa1440

ccaggcggtgtacgcgcgccgcgcgcccggctcttcatcgcctccacccgctccagcgtc1500

tccaccacttccttcatcgagggccgactgcttggctcgctggccaggcagccgagcatt1560

agttgcgccgcttggaacgcctgcttttgttgatcgtttgttttggtctgatttcagtgg1620

gtctatccgcagagaggaagaagcagaagctctccgagatccaatccggcgttgaggaag1680

ctgaatcgctggtaaatagatgtcgcgacgcgttctgttttggggatccccttggctaac1740

gggacatacgacatttggggaatgggtagaaaagcagagattagggatttttcgtttccg1800

tcggtgcagttttggtgttccaacagagttgcgagatgtttatgtgccttagtcttcaat1860

ttgggggttgggggaaaagtaattttatgtttttgttttgtgtctgcagattcagaaaat1920

ggacctggaggcaaggagcctacagcctagcattaaggctggtttgcttgcaaagccgag1980

ggattataaatctgacctcaacaacgtcaagagtgagctcaagaggatatctgcgcccaa2040

tgccaggtaggctacccgggaggagctcgtggagtctagaatggctgatactctcgcagt2100

gagctaatgctaggacttgactgtgtctacgagactgctcctaataataaactgaagaaa2160

gcaaaagaaatcattcaacgtattcgccgaagagaactctacaaggtagtatgatgcttt2220

aattgctcatatacaagtgtcattttgtcatgtcattacacatggttaggatacatactt2280

aagtttctaacgtaggcatccacacaatggattggtgcacggttctgccgatgtatccca2340

cgcacgcgcatggaaggaggcaggcacccttccctgccgccccggatctcgcgccagcca2400

tcgccctaccccgcctgcccttccactcttccccctgaaagtcgcatagagggggggtga2460

atagggcgaatctgaaatttacaaacttaagcacaactacaagccgggttaacgttagaa2520

atataaacgagtccgagagagagggcgcaaaacaaatcatgagcaaataaagagtgagac2580

acgatgatttgttttaccgaggttcggttcttgcaaacctactccccgttgaggtggtca2640

caaagaccgggtctctttcaaccctttccctctctcaaacggtcacttagaccgagtgag2700

cttctcttctcaatcaaacggaacacaaagttcccgcaaggaccaccacacaattggtgt2760

ctcttgccttggttacaattgagtttgatcacaagaagaatgagaaagaaaagaagcgat2820

ccaagcgcaagagctcaaatgaacacaaatgtcgctctctctagtcactatttgatttgg2880

agtgattccggacttgggagaggatttgatcttttggagtgtctagaattgaatgctata2940

gctcttgtaatatgttgaaggtgggaaacttggatgccattgaatgtggggtggttgggg3000

tatttatagccccaaaacaccaaaaaaggccgttggaaggctgctctcgcatggcgcacc3060

ggacagtccggtgcgccagccacgtcagcagaccgttggggttcgaccgttggagctctg3120

acttgtggggcctctgggctgtccggtggtgcaccagacaggtcctgtaggatgtctggt3180

gcgccaactgcacgtgctctgtcctctgcgcgcgcaggcgcgcattaaatgcgttgtagt3240

caaccgttgcgcgcgaagtagccattgctctgctggcacaccggacagtccggtgaatta3300

tagcggagcgccctctgattttcccgaaggtagcgagttcagcttcgagtgccctggtgc3360

accggacactgtccggtgcgccaaaccagggtgccttccgggatgtcttttgctctcttt3420

gtttgaaccctttcttggtctttttattggcttattgtgaacctttgacacctgtaaaac3480

ttatagactagagcaaactagttagtccaattatttgtgttggacaattcaaccaccaaa3540

atcaattaggaaataggtgtgagcctaattccctttcaatctccccctttttggtgattg3600

atgccaacacaaaccaaagcaagtatagaagtgcataattgaactagtttgcataatgta3660

agtgcaaaggttacttagaattgaaccaataaatattttcataagttatgcatggattgt3720

ttctttattttcatcattttggaccacgcttgcaccacatgttttgtttttgcaaatcct3780

tttgtaaatagtcaaaggtaaatgaataagattttgagaagcattttcaaaatttgaaat3840

tttctccccctgtttcaaatgcttttcctttgacttaaacaaaactcccccctcaaaaat3900

cctactcatagtgttcaagagggttttaagatatcaattttgaaaatgctactttctccc3960

ccttttgaatataataagatatcaattgaaaaattcatcattttaaaaccttttgaaaat4020

gggtggtggtgcggtccttttgctttgggctaatactttctccccctttggcatgaatcg4080

ccaaaaacgaatacttgagtgaaatataagcccctttaactactttctcctgctttggcg4140

aacataatatgagtgaagattataccaaagttggagagttgcttgaagcgatggtgaagg4200

atgagttatggagtggaggttaagcctttgtcttcgccgaagattccaattccctttcaa4260

tacacctatgacttggttgaaaatatacttgaaaacacattagtcatagcacatgaaaga4320

gatatgatcaaaggtatattaatgagctatgtatgcaagacatcaaaagaaattcctaga4380

atcaagaatatttagctcgtgtctaagtttgttcatctagtggcttggtaaagatatcag4440

ctaattgttccttagtgttaatataggcaatctcgatatctccctttttttggtgatccc4500

ttaggaaatgataccgaatggctatgtgtttagtgcggctatgctcaacgggattatccg4560

ccatgcggattgcactctcattatcacatagaagaggaactttggttaatttttaaccat4620

agtccctaagggtttgcctcatccaaagtaattgtgcgcaacaatggcctgcggcaatat4680

actcggcttcggcggtagaaagagctacggaattttgcttctttgaagcccaagacacca4740

gggaccttcccaagaactggcaagtcctcgatgtactctttctattaattttacaccccg4800

cccaatcggcatccgaataaccaatcaaaatcaaaatgtggatcccgtaggataccaaag4860

cccaaacttaggagtatgaactaaatatctcaagattcgttttacggccgtaaggtgagc4920

ttccttagggtcggcttggaatcttgcacacatgcatacggaaaagcataatatccggtc4980

gagatgcacataaatagagtaaagagcctatcatcgaccggtatacctttttgatcgacg5040

gatttacctcccgtgtcgaggtcgagatgcccattggttcccatgggtgtcttgatgggt5100

ttggcatccttcatcccatacttgtttagaatgtcttgaatgtacttcgtttggctaatg5160

aaggtgccctcttagcgttgcttcacttgaaatcacaagaagtacttcaactcccccatc5220

atagacatctcgaatttctgtgtcatgatcctactaaattcctcacatgtagattcatta5280

gtagacccaaatataatatcatcaacataaatttggcatacaaacaaatcattgtcaaga5340

gttttagtaaataaagtaggatcggcctttccgactttgaaaccattagtgataaggaaa5400

tctctaaggcattcataccatgctcttggggcttgcttgagcccataaagcgccttagag5460

agtttatatacgtgattagggtactcactatcttcaaagccgggaggttgctcaacatag5520

acctcttccttgattggtccgttgaggaaggcacttttcacgtccatttgataaagcttg5580

aagccatggtaagtagcataggcaagtaatatgcgaattgactcaagcctagctacgggt5640

gcataggtttcaccaaaatccaaaccttcgacttgtgaataacccttggccacaagtcgg5700

gctttgttccttgtcaccacaccatgctcatcttgtttgttgcggaagacccacttggtt5760

cctacaacattttggttaggacgtggaactaaatgtcatacctcattcctagtgaagttg5820

ttgagctcctcttgcatcgccaccacccaatccgaatcttgaagtgcttcctctaaccta5880

tgtggctcaatagaggaaacaaaagagtaatgttcacaaaaatgagcaacccgagatcta5940

gtagttacccccttatgaatgtcgccgaggatggtgtcgacggggtggtctcgttggatt6000

gcttggtggactcttgggtgtggcgggcgttgctcgtcctccttgtcttgatcatttgca6060

tctcccccttgatctatgccgtcatctagaggtggctcatttgattgatcttcttcttca6120

tcaacttgagcttcatcctcattttgagtcggtggagatgcttgcatggaggaggacggt6180

tgatcttgtgtatttggaggctcttcggattccttaggacacacatccccaatggacatg6240

ttccttagcgcgatgcatggagcctcttcatcacctatctcatcaagatcaacttgctct6300

acttgagagccgttagtttcatcaaacacaacgtcacatgaggcttcaactagtccagtg6360

gacttgttaaagaccctatatgcccttgtgtttgagtcataaccaagtaaaaagccttct6420

acagtcttaggagcaaatttagattttctacctcttttaacaagaataaagcatttgcta6480

ccaaaaactctaaaatatgaaatattgggctttttaccggtgaggagttcgtatgatgtc6540

ttcttgaggattcggtgtagatataaccggttgatggcgtagcaagcggtgttgactgcc6600

tcggcccaaaaccgatccgaagttttgtactcatcaagcatggttcttgccatgtccaat6660

agagttcgattcttcctctccactacaccattttgttgtggcgtgtagggtgaagagaac6720

tcatgcttgatgccctcctcctcaagaaagccttcgatttgagagttcttgaactccgtc6780

ccgttgtcgcttctaattttcttgatccttaagccgaactcattttgagcccgtcttaag6840

aatccctttaaggtctcttgggtttgagatttttcctgtaaaaagaatacccaagtgaag6900

cgagaataatcatccacaataactagacagtacttactcccgccgatgcttatgtaagcg6960

atcgggccgaatagatccatgtgtaggagctccagtggcctgtcggtcgtcattatgttc7020

ttatgcggatgatgagagccaacttgcttccctgcttggcatgcgctacaaatcctgtct7080

ttctcaaaataaacatttgtcaatcctaaaatgtgttctccctttagaagcttgtgaaga7140

ttcttcatcccaacatgtgcaagtcggcgatgccagagccaacccatgttagtcttagca7200

attaagcaagtgtcgagttcagctctatcgaaatctaccaagtatagctgaccctctaac7260

actcccttaaatgctattgaatcatcacttcttctaaagacagtgacaccaacatcagta7320

aaaagacagttgtagcccatttgacacaattgggatacagaaagcaaattgtaatctaaa7380

gaatctacaagaaaaacattggaaatagaatggtcaggagatatagcaattttacccaat7440

cctttgaccaaaccttgatttccatccccaaatgtgatcgctctttggggatcttggttt7500

ttctcatatgaggagaacatctttttctccccagtcatgtggtttgtgcacccgctgtcg7560

agtatccagcttgagcccccggatgcataaacctacaaaataattttagttcttgatttt7620

aggtacccaaatggttttgggtcctttggcattagacacaagaactttgggtacccaaac7680

acaagtcttggagcccttgtgcttgcccccaacatatttggcaactaccttgccggattt7740

gttagtcaacacataagatgcatcaaaagttttgaatgaaatgtcatgatcatttgatgc7800

actaggagttttctttctaggcaacttggcacgggttggttgcctagagctagatgtctc7860

acccttatacataaaagcataattaggaccagagtgagacttcctagaatgaattctcct7920

aattttgttctcgggataaccggcagggtataaaatgtaaccctcgttatcctgaggcat7980

gggagccttgcccttaacaaagttggacaatcttttaggaggggcactaattttgacatt8040

gtttcccctttggaagccaatgccatctttaatgcccgggcgtctcccattataaagcat8100

gccacgagcaaatttaaatttctcattttctaagttgtgctcggcaattttagcatctag8160

ttttgctatatgatcattttgttgtttaattaaggtcatatgatcatgaatagcattaac8220

atcaacatctctacatctagtacaaatagatacatgctcaacagtagatgtagagggttt8280

gcaagaattaagttcaacaatcttagcatgaagaatatcatttttatccctaagatcgga8340

aattgtagttttgcaaacatcaaaatctttagccttagcaattaaattttcatttttctg8400

ttctaaggctagcaagagaaatgtttaattcttcaatcctagcaagcaaatcatcattat8460

tatctttaggattgggaattgaaacattacaaacatgtgaatcaaccttagcatttaaac8520

tagtattttcatgtctaaggttgtcaatcatctcatggcaagtgcttagctcactagata8580

gtttttgacatttttctacttctagggcgtaagcatttttaaccttaacatgtttcttgt8640

tttccttaataagacaatcctcttgggaatccaaaaggtcatctttttcatgaatagcac8700

taattaattcatttaatttttccttttgttccatgttaagattagcaaaaagggtacgca8760

agttatcctcctcatcactagcattttcatcactagaggtttcatatttagtggaggatc8820

ttgattttaccttcttccttttgccgtcctttgccatgaggcacttgtggccgacgttgg8880

ggaagaggagtcccttggtgacggcgatgttggcggcgtcctcgtcgtcggaggagtcgc8940

ttgagctttcgtcggaatcccactcccgacaaacatgggcatcgccgcccttcttcttgt9000

agtacttcttcttctcctttcttctccccttcttgtcgtcgccacggtcactgtcactag9060

atatgggacatttagcaataaaatgaccgggcttaccacatttgtagcaaaccttcttgg9120

agcgggacttgtagtctttccccctcctttgtttgaggatttggcggaagctcttgatga9180

cgagcgccatttcctcattgtcgagcttggaggcgtctattggttgtcgacttggtgtag9240

cctcctccttcttttcttccgttgccttgaatgcgacgggttgagcttcggatgtggtgg9300

gttcgtcaagctcattgatctttctcgagccttcgatcatgcactcaaaacttacaaaat9360

gcccgataacttcctcgggggtcattttagtatatctaggattaccacgaattaattgaa9420

cttgagtagggttaaggaaaataagtgatcttagaataaccttaaccacctcgtggtcgt9480

cccactttatgctcccgaggttgcgcacttggttcaccaaggtcttgagccggttgtaca9540

tgtgttgtggctcctcccctttgcgaagccgaaaccgaccgagctccccctcgatcgttt9600

cccgcttggtgatctttgtgagctcgtctccctcgtgcgcggttttgagcacatcccaaa9660

cctccttggcgctcttcaacccttgaactttgttatactcctctctacttaaagaggcga9720

ggagtattgttgttgcttgagagttgaagtgctcgatttgggccacctcatcctcatcat9780

aatccttatcccctacggatggtacctgtgcaccaaactcaacaacatcccatatacttt9840

tgtggagtgaggttagatgaaatcgcattaaatcactccacctagcataatcttcaccat9900

caaaagttggtggtttgcctaacgggacggaaagtaaaggtgcatgtttagaaatgcgag9960

ggtagtgtaggggaatcttactaaacttcttacgctcttggcgtttagaagttacggagg10020

gcgcgtcggagccggaggttgatgttgatgaagtgtcggtctcgtagtagaccactttcc10080

tcatcctcttttgtttgtccccactccgatgaggcttgtgggaagaagatttttccttct10140

tctctttgtggtgagaagaagatttcttctccttccctttgttggaggagctcttcttct10200

tctccctccgtttggtgcgggactcttccaatgaagtgctctcgttgcttgtagtgggct10260

tttcgccggtctccatctccttcttggcgtgatctcccgacatcacttcgagcggttagg10320

ctctaacgaagcaccgggctctgataccaattgatagtcgcctagagggggggtgaatag10380

ggcgaaactgaaatttacaaatataaacacaactacaagccgggttagcgttagtaatga10440

agaaacgagtccgcgagagagggcgcaaaacaaatcgcaagcaaatgaagagtgtgacac10500

gtggatttgttttaccgaggttcggttctcgcaaacctactccccgttgaggaggccaca10560

aaggcccgggtctctttcaacccttccctctctcaaacggtccctcggaccgagtgagct10620

tctcttctctaatcaaagttgggaacaaaacttcccaacaagggccaccacacaattggt10680

gcctcttgccttgattacaatgggtttttgatcacaagaacaagtgcgaaagaaaagaag10740

caatccaagcgcaagagctcaaaaagaacacggcaaatctctctctctaatcactaaagc10800

cttttgtggaattggagaggatttgatctcttttggtgtgtctagaattgaatgctagag10860

ctcttgtagtagttgagaagtggaaaacttggatgcaatgaatggtggggtggttggggt10920

atttatagccccaaccaccaaacttgaccgttggctgggttgtctgttcgatggcgcacc10980

ggacagtccggtgcacaccggacagtccggtgcccctgccacgtcatcactgccgttgga11040

ttctagccgttgaagcttctgacttgtgggcccgcctgggtgtccggtgcacaccggaca11100

tctactgttccttgtccggtgtgccggagtgggcgcgcctgacatctgcgcgcgcagagc11160

gcgcattaaatgcgcggcagagagccgttggcgcggaaatagccgttgctctcgagtcgc11220

accggacagtccggtgcacaccggacagtccggtgaattatagtggacgggccgatggct11280

tttcccgaagctggcgagttcctgaggccgacctcccttggcgcaccggacactgtccgg11340

tgtacaccggacagtccggtgaattatagcggagtcgcctctggcaattctcgaaggggg11400

cgagttggagcttgagtcctctggtgcaccggacgctgtccggtgtacaccggacagtcc11460

ggtgctctcagaccagagggccttcggttcccactatgctcctttgttgaatccaaaaac11520

ttggtccttttattggctgagtgtgaaccttttacacctgtgtaatctataaacttgtgc11580

aaacttagttagtccaattgtttgtgttgggcaattcaaccaccaaaattaattagggac11640

taggtgtaagcctaattccctttcagttttcccgggcggtcatccatagaacaggtcctt11700

acggagaggcactcgagaaaccgctcgagcccccttgaagaccacaagcacaacatcata11760

ataagagaagggaaaacagcgtatcatagataatctcatcatgttcattgattagagtta11820

agcaatagcataaagctaaacagtaataatccaacccaaataggtgaacaaggacatgga11880

taacaaaagctagtcaatccttaggcataaatgtgtaaagcgggaggtgaattaaataat11940

gaataggacatagataggtcaagggacacttgcctccaccaaccgactgctgctcagggg12000

cttctcctgcgggttcctcgggctcttcaaccggatcgttctctatgcgagcgcaaacat12060

acacacatccacatatttaataccaaagaacagtacaccatacaatagaatgcaataagt12120

aaacagacgttccacgcgggctcgcgagtacggttaagagagaaagaggaaaagacagtc12180

gagaaacgatcacgttgcatgattataaattagccactagcttaatggaaggaaatttaa12240

tgtagacactatgtttagcgtaaagtaaagtcatgtttcatgtctaattattataagcag12300

gtggagacaaataaaaggatagccgcgcggcgagacgcgcgacaaagctctctaaaacaa12360

attaagaagttaacgactcgtcgcgcgactgagcacgcagcgagacacttcgccttagtt12420

aagaggagacgttaagcgtcgcgcgacgaagcgcacgacggcatacgtcgactaaactga12480

gtccaaagtggaacgtcgcgtcaattcccacgcggcgttacaccttaaacaacctgaaac12540

aaaatgaacgaatcaagcctgatcatccgccccccccnnnnnnnnnnnnnnnnnnnnnnn12600

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn12660

nnnnnnnnnnnnnnnnnaccgttcaacaaacaccgcccggggttccaaccgaccaggctc12720

cagatcgcccgaccgatcccccgtcccggcctcgcccccccccttcttcgctcgcccgcg12780

ctcgcggtgattgctcgttaagaagcgtgttgcgtcgcgcgcttcgccgcgcgacggatt12840

tatctaaaattcagattctatcctgtgttgcgtcgtgcgtttcgtcgcgcgacgatccat12900

tttgtttcaggttgtttaaggtgtaacgccgcgtgtgtattcacgcgacgttccactttg12960

gactcagtttagtcgacgtatgccgtcgtgcgcttcgtcgcgcgacgcttaacgtctcct13020

cttaactaaggcgaagtgtctcgctgcgtgctcagtcgcgcgacgagtcgttaacttctt13080

aatttgttttagagagctttgtcgcgcgtctcgccgcgcggctatccttttatttgtctc13140

cacctgcttataataattagatatgaaacatgactttactttacgctaaacatagtgtct13200

acattaaatttccttccattaagctagtggctaatttataatcatgcaacgtgatcgttt13260

ctcgactgtcttttcctctttctctcttaaccgtactcgcgagcccgcgtggaacgtctg13320

tttacttattgcattctattgtatggtgtactgttctttggtattaaatatgtggatgtg13380

tgtatgtttgcgctcgcatagagaacgatccggttgaagagcccgaggaacccgcaggag13440

aagcccctgagcagcagtcggttggtggaggcaagtgtcccttgacctatctatgtccta13500

ttcattatttaattcacctcccgctttacacatttatgcctaaggattgactagcttttg13560

ttatccatgtccttgttcacctatttgggttggattattactgtttagctttatgctatt13620

gcttaactctaatcaatgaacatgatgagattatctatgatacgctgttttcccttctct13680

tattatgatgttgtgcttgtggtcttcaagggggctcgagcggtttctcgagtgcctctc13740

cgtaaggacctgttctatggatgaccgcccgggaaaacagtgcaaccatgagggtggaat13800

ggggtgcccttagctgaataattagaggatccggggtgtagttcacttagccgtcgtgcc13860

gtcaatggggctcggtgtatgcggctcgctctgccaagtttgggttcgccccttggggag13920

gagtgcggtgcatttaggaaacctaacgggtggctacagtcccggggaatctttgtaaag13980

gctatgtagtgatgccctgctgggtcaccttggtagtgatcaatggagagtcatgatctc14040

cgggtagaatgggaatcacggcttgtgggtaaagtgcacaacctctgcagagtgtttgaa14100

aactgatatatcagccgtgctcacggttatgagcggccaagggagctccagtgattagtg14160

gtacttgatcagagatactttggtacaggtggttatgagatcgatgattctggttatgac14220

tatgatgctggtaagtggtactctttccgtttggaaaggagtacgtttgggttaataact14280

tgggttaatgctaaaacttggctttctattagtaaataataatctgaccaactaaaagca14340

actgcttgacttatccccacataaagctagtccactacagccaaacaggatacttgctga14400

gtatgttgatgtgtactcacccttgctctacacaccaaaccccccccccatccccaggtt14460

gtcagcattgcaaccactgctcagtcgaagatgaagctgtggaaggagacttccaggagt14520

tccaagattacgatgagttctaggtgtgggttagcggcaacccccagtcggctgcctgtg14580

aaggccgcggttatctacgtttcttttccgcactttgatttattgtaagaactatatgga14640

cgtctcagacgtatgatgtaatcgactatttcccttagtaatactattttgagcactgtg14700

tgatgatgtccatgttatgtaactgctgtgtacgtgaataactgatcctggcacgtacat14760

ggttcgcattcggtttgccttctaaaaccgggtgtgacataagtggtatcaaagccgtgc14820

tgactgtaggaccgctaacctagaatagaatggtcgctctaaggactatagacctctgtc14880

tctgccttgactttgatatcccttcaaaagttggtcataccgaccaaacctatgttctac14940

tatatattataccttgctgaaaatcatgttttattccagtccttcatttacttatgattc15000

attatttgctggtcatattaattctgttctcacctttttgcttgcgatgtcttttgtaga15060

tggctcgacttagacacactgcacgaaagtcagtcatccccttcttaccctcccgccttg15120

ctgagcgtccgcttcgccgtcccgtggccggacagtccagccacttggagagactacacc15180

accgcctgcgtgaggagcaggagcgtcgacgacaggagcagcagagctcttccttctcgc15240

tccaccaggagatagagtctgtgaggagctgctcccctgtgcttcctctggagccgcccc15300

ctccaccaccactgggcgccccagcttctggagtagctgctggaggagacccagacgatg15360

gagatggcgacgacagctcgagccacgacaccgacttctctgctaaccttgagccggaag15420

gatgggttactcgacccatcactcgcgacgctgctcgcgggtgtcacttccacgatgcgc15480

tcgacaccctgctacgtcgggcatttaaccagcatacttggtctgtcgagtatcgctgtg15540

tggtctaccagcacagtcgcggggtctacccggaccgctgggaggcaacctgcttggtgc15600

gctgcccggagaacagtctccagggtgcggaggcctgctcagagcactattctatctctg15660

agcgggactcagctgaggcagccatgcaagatgctgcacggcgtgcgctttcgcactact15720

gctcggttttcggtggggcagctgacggtcttgacctgaagtattacccccgccgtccat15780

ctggcagcacaggaggcgtgattgtctcacctgtcggtgagggcaatcctaggttgagca15840

gcacagtcaacctagccgccgtgctaaacacggagctggaccatgcattagacgagctga15900

gtagggctcgtgctgagatcgccctgctgcgggctgagcgcgcggaacgtcgtcacctgg15960

atggtggttcccccgctcccgtcgggactcagcacccgtaccgctcacctcagcgtggac16020

accagtcttatggcaatcccgcctgcaagaccaagataactctagaaccatatatcgtta16080

gagttggatcttgtaattaatacgaaatatatacatagaagcttcagtcttagcgttagt16140

ctcggtcttagttagtcttagttaaacagggtagtttgctatatcctgtgcatttatgtt16200

tgtcatgatgaactatgtttggtttggatctttgtaatgattgtcaccagagtgtgggta16260

ccccctgcattttggtttacctattatgttaatagagttagttatatagttgggaaacct16320

tttattccactctcctctttatctgagaaactgtgtggtctgtgttggagatcagtgaag16380

atgctcatctgttcagtgctgttgaagaattctattctcttttcttatgctgcaagattt16440

gccagatcagtcctgatgtgtggttgcattctgcagatgtcagagaacaggcgcagagga16500

ggaaggcgtgctcagcaggagcgagccgctcaacaggaggaggtgccccagcagcagcac16560

ctgccgcccccgcccccgatgtccatcgagcagatgtttctgatgcagactcaggcagtt16620

caagccatcggtcagactctggccgccattcagcagcagcagcagcagcaggccccacct16680

caacctcagatgcctcagatgcccagagacaagcgtgctgaattcatgagaggtcatcca16740

ccaacgttcgctcattcttctgaccctatggatgctgaagattggctgcgcactgtggag16800

cgggagttgcataccgctcagtgcgatgacagggagaaagtcctgtatggtccccgtctg16860

ttgagaggagcagcccagtcatggtgggagtcttacctcgccacccatgcccatcctgac16920

gccatcacctgggaagagttcagaggtagctttcgtcagtaccatgttcctgcaggtctg16980

atgacagtgaagaaggaggagttcctggccctcaaggaagggccattgtctgtcagtgag17040

taccgagacaggtttctgcaattgtctcgctatgctcctgaagatgtcaacaccgacgcc17100

aagcgacagtaccgtttcctgagaggcttggttgaccctctgcagtaccaactgatgaat17160

cacaccttcccgacattccagcacctgattgacagagcaatcatgacagaaggaagcgta17220

aggagatggaagatcgtaagcgcaagatcagtggaccccagcctggaagcagcaatcgtc17280

ctcgtttctcaggcaatcaacctcagcagttcaggcagaaccagcgtccacctcagcagc17340

agcagcaattccaaaggcagtatcctcagcaccagtaccagaaccgtcagagcaatcagt17400

caggaggtcagtttcagaggcagaatcagcaagcacctcgtcttcctgccccagcaaatc17460

agcagaacagtcaggcagcaccagctcaggttggaaacagagcatgtttccactgtggag17520

agcaaggccactgggtgatgcaatgtccgaagaaggcagcccagcagcagtcaggcccca17580

atgccccagcgaagcagaatgtgcctcagcctggagcaggcaatcgctctcagccgcgct17640

ataatcatggaaggctgaaccacttggaggctgaagcagtgcaggagacccccggcatga17700

tagtaggtatgttcccagtcgactcccatattgcagaagtgttatttgatactggagcaa17760

cgcattctttcattactgcatcatgggtagaagcacataaccttccaattactaccatgt17820

caacccccattcaaattgactcagccggtggtagaattcgagccgatagcatttgtttga17880

atataagtgtggaaataagggggatagcgtttcccgccaaccttatagtaatgggtactc17940

aggcaatagatgtcatcctagggatgaattggctagataagtatcaggcagttatcagtt18000

gtgataaaaggaccatcaagttggtgtccccactaggagaggaagtggtgaccgagttag18060

tcccgcctgagccaaagaaaggaagttgttatcagatagctgttgatagcagtgaagcag18120

acccaatcgagaggatcaaggttgtgtccgagttcccagatgtgtttccaaaggacttac18180

cgggtatgccaccagagcggaaagttgagtttgctatagagcttcttcccggaaccgccc18240

ctatctttaagagagcttacagaatatctggaccagagttggttgagcttaagaagcaga18300

ttgatgagctgtcagagaaaggttacattcggccaagcacctcgccttgggccgcccctg18360

tcctatttgtggagaagaaagatggcaccaagaggatgtgtatcgattatcgagctttga18420

atgaagtcacgatcaagaacaagtatcccttgcccagaatagaagatttgttcgaccagt18480

tgagaggagccagtgtgttctccaagattgatctgaggtcaggttatcatcagctcagga18540

tccgaccttcggacattccgaagacggcattcatttccaagtatggtttgtatgagttca18600

cagtgatgtcttttggtttgaccaatgcgccagcgttcttcatgaacttgatgaacagtg18660

tattcatggattatctcgataagtttgtggtggtattcattgatgacattctggtttatt18720

ctcaaagcgaagaagagcacgcagatcatttgaggttggtattgcagagattgcgagagc18780

atcagttgtatgcaaagttgagcaagtgtgagttctggatcagtgaggtcctgttcttgg18840

gtcacataatcaacaaagaaggattggttgtggatccgaagaaagtggcagacattttga18900

actggaaagcgccaacagatgctagaggaatcaagagtttcattggaatggccggatact18960

atcggcgattcattgaagggttttcgaagatcgcaaaaccaatgacagcgttgctaggca19020

acaaagttgagttcaagtggacccagaaatgtcaagaggcctttgaagcgctgaaagaga19080

agttgactacagcgcctgtcctagtcttgcctgatgtgcacaagcccttctcagtgtatt19140

gtgatgcttgttacacaggtttgggatgtgtgttgatgcaagagggaagagttgtggctt19200

actcgtcccgacaactgaaggttcatgagaagaattacccaatccatgatctagagttgg19260

cagcagtggttcacgcactgaagtcatggaggcactatctgtatggacagaaatgcgatg19320

tttacacagatcacaagagtctgaagtacatattcactcagtcagagttgaacatgaggc19380

aacgaagatggttagagttgatcaaagattatgagttggagattcattaccatccaggca19440

aagcaaacgtagtggcagatgctttgagcagaaagagtcaagtcaatctgatggtcgctc19500

gtccgatgccttatgagttggccaaagagtttgacaagttgagtctcggttttctgaata19560

attcgcgaggagtcaaagttgagttggaacctaccttggagcgcgaaatcaaagaagcgc19620

agaagaatgatgagaaaatcagcgagatccggcgactgattctagatggcaaaggcaaag19680

aatttcgagaagatgcagaaggcgtgatatggttcaaagaccgcttgtgtgttcctaatg19740

tccagtctattcgggagttgattctcaaggaagctcatgagacgtcctattcgattcacc19800

ctggcagtgagaagatgtatcaggatctgaaaaagaaattctggtggtacggaatgaaga19860

gggagatcgcagagcatgtggctaggtgcgatagttgccgaagaattaaggcagagcacc19920

agagacctgctggattgttgcaaccattgcagatccctcagtggaaatgggacgaaatcg19980

gtatggatttcatagtcggattgcctcgcactcgagccggctacgattccatctgggtag20040

tagtggaccgcttgaccaagtcagcccacttcatacctgtcaagaccaactacagcagtg20100

cagtattggcagaattgtatatgtctcggatcatttgtcttcatggtgtgccaaagaaga20160

tagtgtcagacagaggaacgcagttcacctctcatttctggcagcagttgcatgaagctt20220

tgggcacgcatctgaatttcagttcagcttatcatccacagacagatggccagaccgaaa20280

ggaccaaccaaattcttgaagatatgttgagagcctgtgcgttgcaagatcagtccggat20340

gggataagagattgccttatgcagagttttcctgtaacaacagttaccaggccagcttga20400

agatgtcaccatttcaggcgctctatggaaggagttgtagaactccgttgcaatgggatc20460

agcctggagaaaagcaagtgtttgggccagacattttgcttgaagccgaagagaacatca20520

agatggtccgagagaatctgaagatagcgcaatcgaggcagcgaagctatgcagacacaa20580

gaagaagagagctgagtttcgaagtcggagactttgtctatctgaaagtgtcaccaatca20640

gaggagtcagaaggttcggagtgaaaggcaagctagcaccccgctacattggtccgtatc20700

agattcttgcaagacgtggagaagtggcctatcagctcagtttgccagagaatttgtctg20760

ctgtgcatgatgtctttcatgtgtctcagttgaagaagtgcttgcgtgtgccagaagagc20820

agttgccagtagaaggtcttgaagtccaggaggacttgacctacgttgagaagccagtgc20880

aaatccttgaggttgcagaccgagtcacctggaggaagaccatcagaatgtgcaaagtca20940

gatgggatcatcactctgaggaagaagcaacctgggagcgtgaagatgatctgatggcca21000

agtaccctgagctctttgctagccaaccctgaatctcgagggcgagattcttttaagggg21060

gataggtttgtaacgccctgaatttgggggtagaatttttcttcttttctctcaccaaat21120

tcgggcgttactctcttttctctttccccgtttgctccttcttcccaatttcaaaccagt21180

atagcggcaggtgtccgtgtcatgtataaaccaaaacctaagtgtcatgggtgttgcatc21240

atgccgaagcacatttctttgtctgatgttgagtgttcgtctcgttccgttccggatttc21300

ggttcgcgatttaattccgtttagtggtccgcgctcgtcgcgggttttcgatccgcgaag21360

tggcccgacccatcccaacctagtccagcccagcccagccggcccggcccgccccggcct21420

gcgcgcccctggcgcccaaacccccccatgcgccccccctcctctctctctctcatttgg21480

atctcccgcgcaacaacctctcctctccctcttccacctctctctccccgtggtgcccta21540

ggatttggagacggcgatcaccggattttggaccccgaggtgagctcccctcccctcccc21600

ttctcttctctctctctctccctcctcttcttctccccacgcgcgccccccttctccccc21660

tgctcacgcgcgcccctgcccgcccccgctcaccggcggcgcggcgccccccgcccctgc21720

ccggccccgcgcggcggcgcccgccctcccccggccgcggcggcgctcgtccgccctcac21780

ccccggccgcggcgcccgcccctccccctcgcgcgcggaggcggcgcccgccctcaccac21840

gacccgcagcgaccccgccccgtccccgcctctccccccgcgcgcggcggcgcccgcccc21900

ccgcccctccccgcggcggcgctcgcccgcccgcccctgcccctccccgcggcggcgccc21960

gcccgcccctgctcgcgcgcagcgcggccccggcgcgcccccggcatggcccggcgcggc22020

cccggtggcccctgctcgccggcgcgaccccggcgtggcccccagcccccggcgcgtccc22080

cggcgcggccccggccggctcggccgcccctggccggttcaaccgccccccctggccggt22140

tcaaccgccctggccccagctcgcccgcccgttcccccgtcccggcctcgcccgaccgta22200

tcttcgctcgcccgcgctcgcggtgattgctcgttaattagcgtgttgcgtcgcacgcta22260

cgccgcgcgacggatttatctaaaattcagattctatcctgtgatacgtagtacaactga22320

cnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn22380

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnntgggtcggcgctgagatca22440

gcttgattcgtttttggttatacatgacacggacacctgccgctatactggtttgaaatt22500

gggaagaaggagcaaacggggaaagagaaaagagagtaacgcccgaatttggtgagagaa22560

aagaagaaaaaattctacccccaaattcagggcgttacatcaggctacaaaggatgccaa22620

tggtattgctgctctctatattgttcttgttctaatgtaaaaactacaacacaactcttt22680

acttgatcccagaaattccttctgcctcaaatggagacaatgacgagtggtcataagtac22740

agagattgcagacaaggtaaattttgcaatagaaataactaaccaaccattagtgcttga22800

aaaaaactggactggtgactggggcacgtggtttcatcaacatttggacctcaacggtct22860

aatcagtataacttagaagttggctagctcttgaaaaacactgcatgacactaagcattt22920

gtttattttcagctgcttacacccctatgatttcaagtaactacttgtctacttgtgata22980

atcacctgaatatgattatttgaaatgcttatcatgtctcgtcaattgcatttcttttat23040

gtgtacctgaagtctgctcttgcttcctaatagagttcgttttttaatacagaaaccact23100

ctgagatagccacaatatagtaaaagtggcagctaaggtactaaaaacacccatgcaaat23160

aagaaaaaaatgaatcttgtattttaattttgttaaatacctctatagtttggcgatata23220

ttatgttaccatcctgcttatagcctgtaggtcattttatatgagccatcaaattgcgat23280

gacagttgccacaaatccagtttcatatgaaggtattagctgtgtaacaagctaattgtt23340

gctctctgcccaataagttattcaattggattagtaggttgcatccaaggttattcaatt23400

ggatcagtaggttgcatccaaggtatactgctgctctctgcccaataagttattcaattc23460

gatcagtaggttgcatgttcccttcattttattaaaaaatacataataatataataagta23520

cttgtttgttctaaaaataatacttctgtaaatgaggatattaattttccttttggtaat23580

aatgcaggttgatgatactgaagtcatcagttttttgctgcaaactgaaataattcctct23640

gtgcttgcgaaccatggagatgggtagtgagctatccaaaattgtatgtagctagccaaa23700

tattctcattcaaatatcataatttatctcttctgcttaatactggcaaaggtgtaatag23760

tttttttagtattgatttgtcacctgaagtttatcttgtgcactactactttgccatcat23820

cagttatctctagaatactcttgtcctgtaccattttctctctgataagcctaaatttgt23880

acaattcataagcctaaaaggtgacttatataatatatacaaggaccctcaagagttgtt23940

tggcaattcagtgactgtcctgggtcctgttttggggagcttctggtagcttttgcttct24000

ccaaaagaaaagctagaagctccccccaaacagagcagcttcttcaagccggtaaaagct24060

tcaaaagctgtaattatactaaaaacagtgaagctccctcagagcagcttcccagctctc24120

taggagatgcttttggagaagctacagtttccccaaacagggccctgctctgttgaaccc24180

cccccttcctgatacatatttgaatatgagtttatagtgtgtgtgggggtgtaagtaggg24240

gggtaatgggttctaaattttatactataaaaattaaggatcggattagaattgagctct24300

atttctattcatttttgaactaaaattaattaagggctcaaatgaattatgaagaagcat24360

taggatcatgatccattaccacccctacgtgtaagatgttttttggtggttgtggttgat24420

tttgaattttaaggccgcatatgtctcatggaccacacaagctcatattcatctacattt24480

gtagccgtcactaacttagccaaatatgcatatgtggcggctagcaacaggtccttggtt24540

tcttgggttatttattctctttttatcgtgtttgaatgttttcgtgttcatttgcataac24600

atcttaggtctacattagtatatgaattgagatcaaatgtgaattggaccacacaagctc24660

atattcatctacatttgtagtcgtcactaacttagccaaatatgcatatgtccgcttctg24720

atttcattgtgtcttttcttcaggagtttggggatcaaggagaggactccattatcttgt24780

caccgcgactgaaggagattagtactcctgaccgccccactgccctccgtttcctaggta24840

cacgcataacagccattggtatgaatacatgttttatacgtgaatggagttccagtttaa24900

tttaaagattcaagttcactacaacaagattttacagtactgagcccatttgactttcct24960

tgagaaatagtgaaagggaattaggcttacacctagttcctaaataattttggtggttga25020

attgcccaacacaaataattggactaactagtttgctctagtgtacaagttatacaggtg25080

ccaaggttcacaacaagccaattaaaaagaccaaagttgggttcaaaatagagagccaaa25140

ggcatcccgaaaggctccctggtttggcgcaccggactgtccggtggcgcaccggacagt25200

gtccggtgcaccaggggacctcgcgcagaactcctcagcctcgggaatttttcggagccg25260

ccgcgctataattcaccggactgtccggtgtacaccggacagtgtccggtgctccaagaa25320

aacgcggctccggaacttggcagcctcaggaaatcagaacggctgctccgctataattca25380

ccagacatgtccggtgtacaccggactgtccggtgcaactgcggagcaacggctacttcg25440

cgccaacggtcacctgcaggcgcattcaatgcgcgccagaagcgcgcagaagtcaggcac25500

acccatgctggcgcaccggacactctacagtacatgtccggtgcgccaccggacatcaag25560

gcgggcccagaagacagaactccaacggtcaaattccaacgactttggtgacgtggctgg25620

cgcaccggactgtccggtgcaccatatgacagacagcctccaccaacggtcatgtttggt25680

ggttggggctataaataccccaaccaccccaccattcattgcatccaagttttccagctt25740

ccaaccactatataagagctagcattcattgcaaagcacaccaaaagagatcaagtcctc25800

tcccaactccacacaaagccttagtgattagagagagtgatttgtagtgttcatttgagc25860

tcttgcgcttggatcgcttcttttctttggcattctttcttgtgatcaaacactcacttg25920

taattgaggcaagagacaccaattgtgtggtggtccttgcgggaagtttgattcccaagt25980

gatttgagaagagaagctcactcggtccaagggaccgtttgagagagggaagggttgaaa26040

gagacccggcctttgtggcctcctcaacggggagtaggtttgagagaaccgaacctcggt26100

aaaacaaatccacgcgtctcacttcattattcgcttgcgatttgttttcacgccctctct26160

cggactcgttcttatttctaacgctaacccggcttgtagttgtgtttatatttgtaaatt26220

tcagtttcgccctattcaccccccctctaggcgactatcaattggtatcagagcccggtg26280

cttcattagagcctaaccgctcgaagtgatgtcgggagatcacgccaagaaggagatgga26340

gaccggcgaaaagcccactacaagccacgggagcacttcatcggaagagtcccgcaccaa26400

gaggaaggagaagaaagactcctccaaagggaaggagaagaagaaggactcctccaaagg26460

aaaggagaagaaatcttcttcacacaaagaaaagaaggagaagtcttcctcccacgagcc26520

gcaacggagtggggacaagaaaaagaggatgaggaaagtggtctactacgagaccgattc26580

ttcatcgacatccacctctggctccgacgcggcgtccgtcacttctaaacgccaagagcg26640

taagaagtatagtaagattcccctacgctaccctcgcatttctaaacatacacctttact26700

ttccgtcccattaggcaaaccaccaacttttgatggtgaagattatgctaggtggagtga26760

tttaatgcgatttcatctaacctcactccacaaaagtatatgggatgttgttgagtttgg26820

tgcacatgtaccatccgtaggggatgaagactatgatgaggatgaggtgacccaaatcga26880

gcacttcaactcccaagccacaaccatactcctcgcctctctaagtagagaggaatacaa26940

caaggtgcaagggttgaagaatgcgaaagaaatttgggatctactcaagaccgcgcacga27000

gggtgatgaactcaccaagattaccaagcaggaaacgatcgagggggagctcagtcgctt27060

ccgtcttcgccaaggggaggagccacaagatatgtacaaccggctcaaaaccttggtgaa27120

ccaagtgcgcaacctcgggagcaagaaatgggatgaccacgaggtggttaaggttattct27180

tagatcactcatcttccttaaccccactcaagttcaattaattcgtggtaatcctagata27240

tacactaatgacccccgaggaagttattgggaattttgtgagctttgaatgtatgatcaa27300

gggctcaaagaagatcaacgagcttgatgatccctccacgtccgaagcacaaccggtggc27360

tttcaaggcgacggaggagaagaaggaggagtctacaccaagtagacaaccaattgacgc27420

ttcaaagctcgacaacgaggagatggctttaatcatcaaaagctttcgccaaatcctcaa27480

gcaacggaaggggaaggattacaaatcccgttcaaagaaagtttgctacaagtgtggtaa27540

gcccggtcactttattgctaaatgtccattatcaagtgacagtgacagggataatgacaa27600

gaagggcaagaggagagaaaagaagaggtaccacaagaagaggggcggtgatgcccacgt27660

atgccgcgagtgggactccaacgagagctccaccgactcctccgacgacgaggacgtcgc27720

caacatcgccgacaccaagggactcctcttccccaacgtcggccacaagtgcctcatggc27780

aaaggacggcaaaaacaagaaggataaatctaaatcctccactagatatgaatcctctag27840

tgatgaaaatgttagtgatgaggaagataacttgcgatctctttttgccaacctcaacat27900

gcaacaaaaagagaaacttaatgaattgattagtgtcattcatgaaaaggatgatctctt27960

ggacacccaagaggacttccttattaaagaaaataagaagcatgttaaggttaaaaatgc28020

ttatgctctaaaagtagaaaaatgtgaaaaattgtctagtgagctaagcacttgccatga28080

gactataaacaaccttagaaatgagaatgctaatttgttagctaaggttgattctcatat28140

ttgtaatgtttcaagttccaatcctagagataataatgatgatttatttgctaggattaa28200

agatttgaacatttcacttgctagccttagaaatgaaaatgaaaaattgcttgctaaggc28260

taaagattttgatgtttgcaatgttactatttctaaccttagaagtgaaaacgacatatt28320

acatgctaaggttgtagaattaaaatcttgcaaacctcctacatctatagttgagcatgt28380

atctatttgtactagatgtagagatattgatgttgatgctattcatgatcacatgacttt28440

aattaaacaacaaaatgatcatatagcaaaactagatgctaaaattgccgagcataactt28500

agaaaatgaaaaatttaaatttgctagaagtatgctctatagtgggagacgccctggcat28560

caaggatggcattggcttccaaaggggagacaatgtcaaacttaatgcccctcctaaaag28620

attatctaattttgtaaagggcaaggctcccatgcctcaggataacgagggttacatttt28680

gtaccctgccggttatcccgagagcaaaattaggaggattcactctaggaagtctcactc28740

tggccctaaccatgctttcatgtacaagggtgagacatctagctctaggcaaccaaccca28800

tgttaagttgcctaagaagaaaactcctagtgcatcaaatgaacatagcatttcatttaa28860

gacttttgatgcatcttatgttttgactaacaaatccggcaaagtagttgccaagtttgt28920

tgggggcaaacacaagggctccaagacttgtgtttgggtacccaaagttcttgtttctaa28980

tgccaaaggacccaaaaccgtttgggtacctaaagtcaagaactaaaattgttttgtagg29040

tttatgcatccggaggctcaagttggatactcgacagcgggtgcacaaaccatatgacag29100

gggagaagaagatgttctcctcctacgagaaaaaccaggatccccaacgagctatcacat29160

tcggggatggaaatcaaggtttggtcaaaggtcttggtaaaatagctatatctcctgacc29220

attctatttccaatgtttttcttgtagattcattagattacaatttgctttctgtatctc29280

aattatgcaaaatgggctacaactgtcttttcactgatataggtgtcactgtctttagaa29340

gaagtgatgattcaatagcatttaagggagtgttggagggtcagctatacttagtagatt29400

ttgatagagctgaactcgacacttgcttaattgctaagactaacatgggctggctctggc29460

atcgccgactagcacatgttgggatgaagaatcttcataagcttctaaagggagagcaca29520

ttttaggattaaccaatgttcattttgagaatgacagggtttgtagcgcatgccaggcag29580

gaaagcaagttggagcccatcatccacacaagaacatcatgacgaccgacaggccgcttg29640

agctactccacatggatctattcggcccgattgcttacctaagcatcggcgggagtaagt29700

attgtcttgtgatagtggatgattattctcgcttcacttgggtgttctttttgcaggaaa29760

aatctcaaacccaagagaccttaaaaggattcttgagacgggctcaaaatgagttcgcct29820

taaggatcaagaaaataagaagcgacaacggaacggagttcaagaactctcaaattgaag29880

gcttccttgaggaggagggcatcaagcatgagttctcttctccctacacgtcacaacaaa29940

atggtgtagtagagaggaagaatcgaactctattggacatggcaagaaccatgcttgatg30000

agtacaagactttggatcggttttgggctgaggcggtcaacaccgcctgctacgccatca30060

accggttatatctacaccgaatcctcaagaagacatcttatgaactcctaaccggtaaaa30120

agcccaatatttcatattttagagtctttggtagcaaatgttttattcttgttaaaagag30180

gtagaaaatctaaatttgctcctaagactgtagaaggctttttactaggatatgattcaa30240

acacaagggcatatagagtctttaacaagtccactggacaagttgaagtttcttgtgacg30300

ttgtgtttgatgagactaacggctctcaagtagagcaagttgatcttgatgaaataggta30360

atgaagaggctccatgcatcgcgctaaggaacatgtccattggggatgtgtgtcctaagg30420

aatccgaagagcctccaaatgcacaagatcaactatcctcctccacgcaagcatctccac30480

cgactcaaaatgaggatgaagctcaagttgatgaagtagaagatcaagcaaatgagacac30540

ctcaagatgacgacaatgatcaagggggagatgcaaatgatcaagacaaggaggatgaag30600

agcataggccgccacacccaagagtccaccaagcaatccaacgagatcaccccgtcgaca30660

ccatcctcggcgacattcataagggggtaactactagatctcgtattgcacatttttgtg30720

agcattactcttttgtttcctctattgagccacacagggtagaggaagcactccaagatt30780

cggattgggtggtggcgatgcaagaggagctcaacaacttcactaggaatgaggtatggc30840

atttagttccacgtcctaatcaaaatgttgtaggaaccaaatgggtcttccgcaacaagc30900

aagatgagcatggtgtggtgacaaggaacaaagctcgacttgtggccaaaggatactccc30960

aagtcgaaggtttggatttcggtgaaacctatgcacccgtagctaggcttgagtcaattc31020

gtatattattggcctatgatacttaccatggctttaagctttatcaaatggacgtgaaaa31080

gtgccttcctcaatggaccaatcaaggaagaggtctatgttgagcaacctcccggctttg31140

aagacagtgagtaccctaaccatgtctataagctctctaaggcgctttatgggctcaagc31200

aagccccaagagcatggtatgaatgccttagagatttccttattgctaatggcttcaaag31260

tcggaaaagccgatcctacactctttactaaaactcttgaaaatgacttgtttatatgcc31320

aaatttatgttgatgatattatatttggatctactaacgagtccacttgtgaagagttta31380

gtaggatcatgacacagaaattcgagatgtctatgatgggggagttgaagtattttctag31440

gattccaagtcaagcaactccaagagggcaccttcattagccaaacaaaatacactcaag31500

atattctaagcaagtttggaatgaaggatgccaagcccatcaagacacccatgggaacta31560

atgggcatctcgacctcgacacgggaggtaagtccgtggatcaaaagctataccggtcga31620

tgataggttctttactctatttatgtgcatctcgaccggacattatgctttccgtatgca31680

tgtgtgcaagattccaagccgaccctaaggaagcccaccttacggccgtaaaacgaatct31740

tgagatatctggcttatactcctaagtttgggctttggtatcctaggggatccacatttg31800

atttgattggttattcggatgccgattgggcagggtgcaaaatcaataggaagagcacat31860

ccgggacttgccagttcttgggaagatccttgggtgtcttgggcttcaaagaagcaaaat31920

tcggtcgctctttccaccgccgaagccgagtatattgcccgcaggccactgttgcgcgca31980

actgctttggatgaggcaaaccctgcgggactatggttacaaactaaccaaggtcccttt32040

gctatgtgataatgagagtgcaatcaaaatggtcgacaatcccgtcgagcatagccgcac32100

taagcacatagccattcggtatcactttttgagggatcaccaacaaaagggagatatcga32160

gatttcatacattaatactaacgatcaattagctgatatctttaccaagcctcttgatga32220

acaatcttttaacaaacttaggcatgagctcaatattcttgattctaggaacttcttttg32280

ttaaattgcacacattgttcttttatatacctttgatcatatctcttttatatgctatga32340

ctaatgtgttttcaagtctatttcaaaccaagtcataggtatattgaaagggaattggag32400

tcttcggcgaagacaaaggcttccactccgtacctcatccttcgccatcacttcaagcaa32460

ctctccgttctcgggggagataagcatgagcatcaaagaaaaggactttgggggagaaat32520

gagcccaaagccaaaggaccggacttcgtctttggtataatcttaactcatttatttatg32580

accaaaagggaaaatagcacttcgagggctctaatgattccgtttttggcgattcatgcc32640

aaaaagggggagaaatgagcccaaagcaaaaggaccgcaccaccaccaatttcaaaaact32700

tagtgttgaatatttttcaatttgtatcttattttcaattggtatcttattgtgttcaaa32760

agggggagaaagtagtattttaaaatgatatatcaaaaaccctcttgaatactaagagga32820

ggatctcttttagggggagttttgtttaagtcaaaggaaaagcatttgaaacagggggag32880

aaaatttcaaatcttgagaatgctttgcaaaaatcctattcatttacctttgactatttg32940

caaaagaactttgaaaaggatttacaaaataatttgcaaaaacaaaactcgtggtgcaag33000

cgtggtccaaaatgttatataaagaaagaaacaatccatgcatatcttgtaagtattcat33060

attggctcaattccaagcaacctttacacttacattatgcaaactagttcaattatacac33120

ttctatatttgctttggtttgtgttggcatcaatcaccaaaaagggggagattgaaaggg33180

aattaggcttacacctagtccctaattaattttggtggttgaattgcccaacacaaacaa33240

ttggactaactaagtttgcacaagtttatagattacacaggtgtaaaaggttcacactca33300

gccaataaaaggaccaagtttttggattcaacaaaggagcatagtgggaaccgaaggccc33360

tctggtctgagagcaccggactgtccggtgtacaccggacagtgtccggtgcaccagagg33420

actcaagctccaactcgcccccttcgggaattgccagaggcgactccgctataattcacc33480

ggactgtccggtgtacaccggacagtgtccggtgcgccaagggaggtcggcctcaggaac33540

tcgccagcttcgggaaaagccatcggcccgtccactataattcaccggactgtctggtgt33600

gcaccggactgtccggtgcgactcgagagcaacggctatttccgcgccaacggctctctg33660

ccgcgcatttaatgcgcgctctgcgcgcgcagatgtcaggcgcgcccactccggcacacc33720

ggacaaggaacagtagatgtccggtgtgcaccggacacccaggcgggcccacaagtcaga33780

agcttcaacggctagaatccaacggcagtgatgacgtggcaggggcaccggactgtccgg33840

tgtgcaccggactgtccggtgcgtcatcgaacagacaacccagccaacggtcaagtttgg33900

tggttggggctataaatacccccaaccaccccaccattcattgcatccaagttttccact33960

tctcaactactacaagagctctagcattcaattctagacacaccaaaagagatcaaatcc34020

tctccaattccacaaaaggctttagtgattagagagagagatttgccgtgttctttttga34080

gctcttgcgcttggattgcttcttttctttcgcacttgttcttgtgatcaaaaacccatt34140

gtaatcaaggcaagaggcaccaattgtgtggtggcccttgttgggaagttttgttcccaa34200

ctttgattagagaagagaagctcactcggtccgagggaccgtttgagagagggaagggtt34260

gaaagagacccggcctttgtggcctcctcaacggggagtaggtttgcgagaaccgaacct34320

cggtaaaacaaatccacgtgtcacactcttcatttgcttgcgatttgttttgcgccctct34380

ctcgcggactcgtttcttcattactaacgctaacccggcttgtagttgtgtttatatttg34440

taaatttcagtttcgccctattcaccccccctctaggcgactatcaaaaacagtgcaacc34500

atgagggtggaatggggtgcccttagctgaataattagaggatccggggtgtagttcact34560

tagccatcgtgccgtcaatggggctcggtgtatgcggctcgctctgccaagtttgggttc34620

gccccttggggaggagtgcggtgcatttaggaaacctaacgggtggctacagtcccgggg34680

aatctttgtaaaggctacgtagtgatgccctgctgggtcaccttggtagtgatcaatgga34740

gagtcatgatctccgggcagaatgggaatcacggcttgtgggtaaagtgcacaacctctg34800

cagagtgtttgaaaactgatatatcagccgtgctcacggttatgagcagccaagggagct34860

ccagtgattagtggtacttgatcagagatactttggtacaggtggttatgagatcgatga34920

ttctggttatgactatgatgctggtaagtggtactctttccgtttggaaaggagtacgtt34980

tgggttaataacttgggttaatgctaaaacttggctttctattagtaaataataatctga35040

ccaactaaaagcaactgcttgacttatccccacataaagctagtccactacagccaaaca35100

ggatacttgctgagtatgttgatgtgtactcacccttgctctacacaccaaacccccccc35160

ccatccccaggttgtcagcattgcaaccactgctcagtcgaagatgaagctgtggaagga35220

gacttccaggagttccaagattacgacgagttctaggtgtgggttagcggcaacccccag35280

tcggctgcctgtgaaggccgcggttatctacgtttcttttccgcactttgatttattgta35340

agaactatatggacgtctcagacgtatgatgtaatcgactatttcccttagtaatactat35400

tttgagcactgtgtgatgatgtccatgttatgtaactgctgtgtacgtgaataactgatc35460

ctggcacgtacatggttcgcattcggtttgccttctaaaaccgggtgtgacacctgatta35520

ctctcaagcaaagcctataggtagtttaagaggttgagtacaatgagaaacatttcaatc35580

attatttgcaaaagaaacattttgatcatattaaggaaaatcataggagtgaaagaaaaa35640

caatgtgtgcaaataactgaacctcctgcagctccatcatgctggccaaagtatgcttcg35700

acgaattcaaaatatcatcatatgcttgctctatctgcttatcatgattgcaaccttgtt35760

cggatgggttcctggcttcctgcaacagcacagtagaaaacaaactggagtttagaattc35820

aacttgagacttcagagttgaaacaaaacctgtattcacatatgtagcaccgatgcatat35880

acaaatacctttatcagaaacatgaagatatgtgaatacattattttctcaataaaagtc35940

atgatagaattcatgcaaatttttttagaaataaataaatgatgaggcatactacaattc36000

taaaagcagtagcagtgcaacacgaacgaacattcaaattgccccacattcttgaaactg36060

tgctgctttgctctcgttccaaaaaaactccgctaaagtaaaacttggaaaggtctggtt36120

ttgcgtgagccaccaacaccaaccaaactttacgtcactatgacagtttcagcctttcgg36180

tcccggcgacagccatggcggacgcgggggtgacagggggtgctggccaagctgggtgag36240

ctgacggaggaggaggcgacgacgctgctgcgcgtggacgccgagatacgggcattgtgg36300

cagaagctggcctacctgcaggcgctcgtacgcggggccggccgccagcgccgcgaccgc36360

gcaagcgagctgctcctgctctggctacgcgagaccagagaggttgctttcgcgggtggt36420

tctgccatacatagcggcgatggctcttcctcccaggactaccatatctacacataacaa36480

tcatgctctcatgaagttgtgatgtaataggtcacacgattttaatgtataagattgtga36540

agagtaaattaattcaaatgaattcatggacatgggacaactatgtgttaaaaacagaat36600

ctcctatgtatcctaaccatgtgtaatgacatgacaaaatgacacttgtatatgagcaat36660

taaagcatcatactaccttgtagagttctcttcggcgaatacgttgaatgatttcttttg36720

ctttcttcagtttattgttaggagcagtctcgtagacacagtcaagtcctagcattagct36780

cactgcgagagtatcagccattccagactccaggagctcctcccgggtagcctgcctggc36840

aatgggcgcagatatcctcttgagctcactcttaatgctaggttgtaggctccttgcttc36900

caggtccatcttccgaatctgcagacacaaaacaaaaacataaaattacttttcccccaa36960

cccccaaattgaagactaaggcacataaacatctcgcaactccgttggaacaccaaaact37020

gcaccgacggaaacgaaaaatccctaatctctgcttttctacccattccccaaatgtcgt37080

atgtcccgttagccaaggggatccccaaaacagaacgtgtcacgacatctatttaccagc37140

gattcagcttcctcaacgccggattggatctcagagagcttctgcttcttcctctctgcg37200

gacagacccaccgaaatcagaccaaaacaaacggtcaacaaaagaaggctttccaagcgg37260

cgctactgacgctcggctgcctggccggcgagcctcgcagtcggccctcgatgaaggagg37320

tggtggagacgctggagcgggtggaggcgatgaagagccgggcacgcggcgcgcgtacac37380

cgcccggttcctgcccccccccccccccgacgacgaccatggcgcttcgccgatggccca37440

cttgcctggtcgccccgccgtcgcgtcagcgagcgcgctcagcagctccgtgacccgctc37500

gtcgaaccgcgtggccacggcgagcggcacgaggcctgggttcgtgacgttgaccggggg37560

gcggcgggggtgaaagggaattaggctcacacctatttcctaattgattttggtggttga37620

attgtctaacacaaataattggactaactagtttgctctagtctataagttttacaggtg37680

ccaaaggttcataataagccaataaaaagaccaagaaagggttcaaacaaaaagagcaaa37740

agacatcccggaaggcaccctggtctggcgcaccggactgtccggtgtgccaccggacag37800

tgtccggtgcaccagggcactcgaagctgaactcgctaccttcgggaaaatcagagggcg37860

ctccgctataattcaccagactgtccggtgaagcaccggactgtccggtgtgccagcgga37920

gcaacggctacttcgcgcgcaacggtcgactgcaacgcattcaatgcgcgcctgcgcgcg37980

cagagggcagagcactcacagttggcgcaccggacagtctacaggacctgtccggtgcac38040

caccggacagcccagaggccccacaagtcagagctccaacgatcgaaccccaacgatctg38100

ctgacgtggctggcgcaccggactgtccggtgcgccatgcgaccgcagccttccaacggc38160

catttttggtggtttagggctataaataccccaaccaccccacattcaatggcatccaag38220

tttcccaccttcaacacattacaagagctataacattcaattctagacactccaaaagat38280

caaatcctctcccaagtccggaatcactccaaatcaaatagtgactagagagagcgacat38340

ttgtgttcatttgagctcttgcgcttggatcgcttcttttctttctcattcttcttgtga38400

tcaaactcaattgtaaccaaggcaagagacaccaattgtgtggtggtccttgcaggaact38460

ttgtgttccgtttgattgagaagagaagctcactcggtctaagtgaccgtttgagagagg38520

gaaagggttgaaagagacccggtctttgtgaccacctcaatggggagtaggtttgcaaga38580

accgaacctcggtaaaacaaatcatcgtgtctcgctctttatatttctaacgttaacccg38640

gcttgtagttgtgcttaagtttgtaaatttcagattcgccctattcaccccccctctagg38700

cgactttcaattggtatcggagccggtgcttcattagagcctaactgctcgaagtgatgt38760

cgggagcatccgccatgagggatctcgggaccggcgacaagaccgcatgctcgggaagaa38820

ctcactcaagggagtccgcccacaagcataaggaggaatcgtcttcctccatcaagtccc38880

atcggatgggtgacaaaaagaagaagatgaggaaggtggtctactacgagaccgactctt38940

cgtcaccctccacctccggctcggaatcggcctccaccacttcaaagcgccatgagcgca39000

agaagtatagtaagatgccccttcgctatcctcgcatttctagacgcactccatcactct39060

tcgttccattaggcaaaccacctatatttgaaggtgaagattattctatgtggagtgata39120

aaatgaggcatcacctaacctcactccacaaaagcatatgggatattgttgagtatggag39180

tgcaggtaccaaagaagggagataaagattacgactcggaggaggttgaacaaatccaac39240

atttcaaatccaagtcgagaggagtataataaggtgcaagggttgaagagtgcaaaggat39300

atctgggacgtgctaaagaccgcgcacgaaggagacgaggtaaccaagatcaccaagcgg39360

gagacgatcgagggggagctcggtcgcttccggcttcgccaaggggaggagccacaagat39420

atgtacaaccggctcaagaccttggtgaaccaagtgcgcaacctcgggagcaaaaaatgg39480

gatgaccatgaaatggttaaggttattcttagatcacttgtgttccttaaccctacgcaa39540

gttcaattaattcgtggtaatcctagatatacactaatgactcccgaggaagtaatagga39600

aactttgtgagctttgagttgatgatcaaaggctcaaagaaaattatcgagcacgacggt39660

ccctccacgcccgaagcacaaccggtcgcattcaaggcgacagaggagaagaaagaggag39720

tctacatcaagtagacaacccatcgacgcctctaagctcgacaacgagaaaatggcgctc39780

atcatcaagagcttccaccaaatcctcaaacaaaggaaggggaaagattacaagccttgt39840

tccaaaagggtgtgctacaagtgtggtaagcccggtcatttcattgttaaatgtccttta39900

tctagtgatagtgacaggggcgacgacaagaagggcaagaggagagaaaagaggaggtat39960

tacaagaagaagggcggcgatgcccatgtgtgccgcgagtgggactccgacgagagttcc40020

tccgactcctcatccgacgaggacgccgccaacatcgccgtcaccaaagggctcctcttc40080

cccaacgtcggccacaagtgcctcatggcaaaggacggcaaaaagaagaaggtaaaatca40140

aaatcctccactaaatatgcatcctctagtgatgaagataatgctagtgatgaggaggat40200

aatttgcgtaccctttttgtcaacctaaacatgcaactacaggaaaaactaaatgaatta40260

attagtgctattcatgagaaagatgatctcttggactttcaagaggacttcctaattaag40320

gaaaataagaagcatgttaaggttaaaaatgcttatgctctagaagtagaaaaatgtgaa40380

aaattatctagtgagctaagcacttgccatgatactattaccatccttagaaataaaaat40440

actaaactaattgctaaggttgattctaatatttgtgatgtttcaattcccaatcttaga40500

gatgataatgttaatttgcttgctaagattgaagaattgaatgtctctcttgctagcctt40560

agggttgaaaatgaaaaattgattgctaaggctaaagaattagatgtttgcaatgcttcc40620

atttctgatcttagaaataacaatgatattttacgtgctaagattgttgaacttaattct40680

tgcaaaccctctacatctgccattgagcatgtcattatttgcactagatgtagagatatt40740

aacattgatgctattcatgatcatatggctttaattaaacaacaaaataatcatatagca40800

aaattagatgctaaaattgccgagcatgacttaaaaaatgaaaaatttaaatttgctaga40860

agcatgctctatagtgggagacgccctggcatcaaggatggcattggcttccaaaaggga40920

aacaatgtcaaacttaatgcctctcctaaaagattgtcaaactttgttaagggcaaggct40980

cccatgcctcaggataatgagggttacattttgtaccctgccggttatcccgagagcaaa41040

attaggagaattcattctaggaagtctcactctggccataatcatgcttttatgtataag41100

ggtgagacatctagctctaggcaatcaacccgtgcaaaattgcctaagaagaaaactcct41160

gctgcatcaaatgatcataacatttcattcaaaacttttgatgcatcttatgttttaact41220

aacaaatccgacaagatagttgccaagtatgttgggggcaaacacaagggatcaaagact41280

tgtgtttgggtacccaaagttcttgtatctaatgtcaaaggacccaaaaccatttgggta41340

cctaaaatcaagaactaaacttgttttgtaggtttatgcatccgggggcccaagttggat41400

catcgatagcgggtgcacaaaccatatgacaggggagaagaaaatgttctcctcctatga41460

gaaaaaccaagatccccaaagagcgatcacattcggggatggaaaccaaggtttggtcaa41520

aggattgggtaaaattgctatatctcctgaccattccatttccaatgtgtttcttgtaga41580

ttctttagattacaacttgctttcagtttcgcaattatgtaaaatgggctacaactgtct41640

ttttacagatataggtgttactgtctttagaagaagtgatgattcagtagcatttaaggg41700

agtgttagagggtcagctatacttggtagattttgatagagctgaactcgacacttgctt41760

aattgctaagactaacatgggctggctctggcatcaccgactagcacatgttggaatgaa41820

gaatcttcacaagcttctaaagggagaacacattttgggactaacaaatgttcactttga41880

gaaagataggatttgtagcgcatgtcagacagggaagcaagttggtactcatcatccaca41940

caagaacatcatgatgactgacaggccactcgagctcctacatatggacctattcggccc42000

gatagcttacataagcatcggcgggagtaagtactatctagttattgtggatgattatac42060

tcgcttcacttgggtattctttttgcaggaaaaatctcatacccaagagaccttaaaggg42120

attcttgggacgggctcaaaatgagttcggcttaagaatcaaatttgttttaagcgacaa42180

cgggacggagtcaagaatctcaaatcgaaggcacgatctcctagatccggcccaaaannn42240

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn42300

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnncgctgatgaatcagcttgattcg42360

tgacacgggaggtaagtccgtggatcaaaaggtataccggtaagataataggctctttac42420

tctatttatgtgcatctcgaccggatattatgctttccgtatgcatgtgtgcaagattcc42480

aagctgaccctaaggaagctcaccttacggccgtaaaacgaatcttgagatatttggctt42540

atactcctaagtttgggctttggtatcctaggggatccacatttgatttgattggttatt42600

cggatgctgattgggcggggtgtaaaatcaatagaaagagcacatcagggacttgccagt42660

tcttgggaagatccttggtgtcttgggcttcaaagaagcaaaattcggtcgctctttcca42720

ccgccgaagccgagtacattgccgcaggccattgttgcgcgcaattgctttggatgaggc42780

aaaccctgcgggactatggttacaaattaacctaagtccctttgctatgtgataatgaga42840

gtgcaatcaagatggcggataatcccgtcgaacatagccgcactaaacacatagccattc42900

ggtatcattttcttagggatcaccaacaaaagggagatatcgagatttcttacattaaca42960

ctaaagatcaattagccgatatctttaccaagcctcttgatgaacaaacctttaacaaac43020

ttaggcatgagctcaatattcttgattcgcgcaatttcttttgctaaattgcacacatag43080

ctcatttatatacctttgatcatatctctttcatatgctatgactaatatgttcttcaag43140

tctatttcaaaccaagtcataggtgtattgaaagggaattggagtcttcggcaaagacaa43200

aggcttccactccgtaactcatccttcgtcgtcgctctgggccactctccatctttgggg43260

gagagagcaaaagacttcgtctttggtacaatcttaactcatttatttatgaccaaaggg43320

gaagaaagtacttcgagggctctaatgattccgtttttggcgattcatgccaaaggggga43380

gagagtatgagcccaaagcaaacggaccgcaccaccaccaatttcaaaaacttagttttt43440

caaagagtattttcaattggtatcctattgtgttcaaaagggggagaaagtagtattttc43500

aaaaatgatatatcaaaaccctcttgaacactaagaggtggatctcatttagggggagtt43560

ttgtttagtcaaaggaaaagcatttgaaacagggggagaaaatttcaaatcttgaaaatg43620

cttcataaaatcgtattcatttacctttgactttttgcaaaagaactttgaaaaggattt43680

acaaaatagtttgcaaaaacaaaacatgtggtgcaagtgtggtccaaaatgataaaaaca43740

aaggaacgatccatgcatatcttgtaagtatttatattggctcaaatccaagcaaccttt43800

gcacttacattatgcaaactagttcaattatgcattttatacttgctttggtttgtgttg43860

gcatcaatcaccaaaaagggggagattgaaagggaattaggcttacacctagttcctaaa43920

taattttggtggttgaattgcccaacacaaataattggactaactagtttgctctagtgt43980

acaagttatacaggtgccaaggttcacaacaagccaattaaaaagaccaaagttgggttc44040

aaaatagagagccaaaggcatcccgaaaggctccctggtttggcgcaccggactgtccgg44100

tggcgcaccggacagtgtccggtgcaccaggggacctcgcgcagaactcctcagcctcgg44160

gaatttttcggagccgccgcgctataattcaccggactgtccggtgtacaccggacagtg44220

tccggtgctccaagaaaacacggctccagaacttggcagcctcgggaaatcagaacggct44280

gctccgctataattcaccggacatgtccggtgtacaccggactgtccggtgcaactgcgg44340

agcaacggctacttcgcgccaacggtcacctgcaggcgcattcaatgcgcgccagaagcg44400

cgcagaagtcaggcacgcccatgctggcgcaccggacactctacagtacatgtccggtgc44460

gccaccggacatcaaggcgggcccagaaggcagaactccaacggtcaaattccaacggct44520

ttggtgacatggctggcgcaccggactgtccggtgcaccatacgacagacagcctccacc44580

aacggtcatgtttggtggttggggctataaataccccaaccaccccaccattcattgcat44640

ccaagttttccagcttccaaccactatacaagagctagcattcattgcaaagcacaccaa44700

aagagatcaaatcctctcccaactccacacaaagccttagtgattagagagagtgatttg44760

tagtgttcatttgagctcttgcgcttggatcgcttcttttctttggcattctttcttgtg44820

atcaaacactcacttgtaattgaggcaagagacaccaattgtgtggtggtccttgcgggg44880

agtttgattcccaagtgatttgagaagagaagctcactcggtccaagggaccgtttgaga44940

gagggaagggttgaaagagacccggcctttgtggcctcctcaacggggagtaggtttgag45000

agaaccgaacctcggtaaaacaaatccacgcgtctcacttcattattcgcttgcgatttg45060

ttttcacgccctctctcggactcgttcttatttctaacgctaacccggcttgtagttgtg45120

tttatatttgtaaatttcagtttcgccctattcaccccccctctaggcgactatcaaata45180

gccagtgctttttggtctgcgagttcctgcacttggttaatcaactgtgtcgcttgatct45240

tctacttgtttgcacgagaaggtcaaagccactttcgaagctattagttcagaacacaca45300

acatctagctaaatacatcaccagtttgaagtcattgattgtattcttgatatcatcttt45360

attcttgaatgtcatttgtgccagttcatttaactcttgtgctgcaaaccaacctgacat45420

cgtcaattcatttaatctctcaatctcagtttcctttttttgtttcacattgaagctccc45480

taattgttgcttctcgatgtgcagtggcctgattagctacaagaagctcttgttccatgg45540

actcgatctctggaccgcactatcgttgcctcagcccctaggtcgtgctgccctctggcc45600

tcctcatcgtacaattcaccaacatctccaatgtaagtgcagcaggttcagtaatgaact45660

cagaagtggcatcagaatactccaagagttttttgatctttttgcctggatatataccaa45720

gggaaatgcattcaaaactcctatagatgacgaatcccatctctccctcttttctcggac45780

acggatccccaggtccgtctccgtgctttactcatttgttttttacaagttcagatccac45840

ttgcgtactcacacggtggacatctgttatgcacatgtgtaaaccagcataagtccttac45900

actcgaaaatgcatgtgttatttagcttgagaataaataaaattattagcaaggagaaaa45960

caaaaaaataggactaaacaatagagtcacattggtttaaattagtacctagaagtaaaa46020

aaagatgatctaaattagatacatcataccaaataccatattactattccagttaccccg46080

tctactatgcctagatatcaaattcttgaaggttggccttctcattttcagtaatagcct46140

gacgaaagtagagtatgtttgtatgagcaattatgctgctcactactgccttgcgctata46200

ataggccactactgattttacatgctttttctacattagatagctcacaaacatgctacc46260

tcaaaaaaatgatggcaaaggggagccacaaaatgtcaattattttgtcaagtattagca46320

gttttcttgtgtatgtgatcagactaacactgcatgtctttgttttcctgtaaaactatg46380

tatgatgaaaccatggtgtgattgtattggctggccttaccctgttttgttgcaatgcat46440

tcgttgttgtacaggtaatatgttgaaacacaattcattgcatatgacaattctgttttt46500

tctttctagaatattgacatattgtttgatcattattttctaagcaataatcatggctat46560

tcttatagtattgcataatacctttttcttttcgaaccctagcgcattgattctttagtg46620

aagtgattatagtgattccagcgggagagtagggtggggagcagagggttgattctggac46680

tgatttcggtggagattaaatggggagcagtgaggagcatgtttttttagatcccaccag46740

aatatgtgcgccattttgctatttggctgaggagtgatgctcagggagaatccgttctca46800

ggagctgtgccaaatgtcgccttaggttttatgatatgacctgacttctgtgttaatatt46860

tgttagatctttattttatttgaggttacaaaggtggtgttctcaagctagaaacaaagt46920

tgtggctaggtcaaaactagatgatgctcttgaaccgtgtcttttgactctgttacttgt46980

tgcaggtttgatgttcactaatatgttgttcaactttgagcaggtcagcaattgactggt47040

gttgctggtagcctggcatatctggcacctgaggttctactaggaaattactcccaaaag47100

gtagatgtatgggctgccggggtgcttctgcatgttctgttgatgggcactcttccgttc47160

caaggaaaatctatcgaagctatctttgatgttataaagactgctgaacttgactttcac47220

aatagtcagtgggcatctgtgtcacttcttgcttatgatctcattggtcgaatgcttaat47280

cgagaggtctcttcaaggcccgatgccgaagatgttctccgtaagttcaagcacccttgt47340

aacttgtgctttatatatatatataatatatatatatatatgattctcaatttatcattg47400

acttttcctaatggctttcaacacagggcacccatgggtcttattctacactgattgcct47460

gcagaaagctgaattctctaacctatgggatactaacaaaactgcagctcccatgattca47520

tcgggagatagtcaggtttggttactgcgagtcttcatcttcaaaatcctcaagtgacaa47580

ctctgaagagcgagatgaatgcggtatagttgatgcactggcgacaacaataacacaggt47640

gaggatctcagagcccaagaggagtcggctgttcagcctacccaacgggttgttgccgcc47700

aagcaggaacagtctccgaacatgaagatgatgaatccgtgtgtggctttctaacttgac47760

ctacctagctcccatccccatgcatgtataaacgagataaacgagctctgtgattttata47820

gatggaaaattttcaccgtggttgatgttttgcgattgctagctcgctgagcctgcaatc47880

ctctgtaaatatatcattgttgtcatcatttttgtacatcgatgacaccgtaattgattc47940

gatt47944

2

1524

dna

artificialsequence

cdna

2

atggggagcagtgaggagcatgtttttttagatcccaccagaatatgtgcatccgtgtca60

cttcttgctcatgatctcattggccgaatgcttaatcgagaggtctcttcaaggcccaat120

gccaaagaagttctccctcccatgattcatcgggagatagtcaggtttggttactgtgag180

tcttcatcttcaaaatcctcaagtgacaactctgaagagcgagatgaatgcggtatagtt240

gatgcactggtgacaacaataacacagattcggaagatggacttggaggcaaggagccta300

cagcctagcattaaggctggtttgcttgcaaagctgagggagtataaatctgacctcaac360

aacgtcaagatgggtctatccgcagagaggaagaagcagaagctctccgagatccaatcc420

ggcgttgaggaagctgaatcgctgattcagaaaatggacctggaggcaaggagcctacag480

cctagcattaaggctggtttgcttgcaaagccgagggattataaatctgacctcaacaac540

gtcaagagtgagctcaagaggatatctgcgcccaatgccagtggcctgattagctacaag600

aagctcttgttccatggactcgatctctggaccgcactatcgttgcctcagcccctaggt660

cgtgctgccctctggcctcctcatcgtacaattcaccaacatctccaatgtcagcaattg720

actggtgttgctggtagcctggcatatctggcacctgaggttctactaggaaattactcc780

caaaaggtagatgtatgggctgccggggtgcttctgcatgttctgttgatgggcactctt840

ccgttccaaggaaaatctatcgaagctatctttgatgttataaagactgctgaacttgac900

tttcacaatagtcagtgggcatctgtgtcacttcttgcttatgatctcattggtcgaatg960

cttaatcgagaggtctcttcaaggcccgatgccgaagatgttctccggcacccatgggtc1020

ttattctacactgattgcctgcagaaagctgaattctctaacctatgggatactaacaaa1080

actgcagctcccatgattcatcgggagatagtcaggtttggttactgcgagtcttcatct1140

tcaaaatcctcaagtgacaactctgaagagcgagatgaatgcggtatagttgatgcactg1200

gcgacaacaataacacaggtgaggatctcagagcccaagaggagtcggctgttcagccta1260

cccaacgggttgttgccgccaagcaggaacagtctccgaacatgaagatgatgaatccgt1320

gtgtggctttctaacttgacctacctagctcccatccccatgcatgtataaacgagataa1380

acgagctctgtgattttatagatggaaaattttcaccgtggttgatgttttgcgattgct1440

agctcgctgagcctgcaatcctctgtaaatatatcattgttgtcatcatttttgtacatc1500

gatgacaccgtaattgattcgatt1524

3

1082

dna

artificialsequence

cdna

3

cctgcccttccattcttcccccgctgcccccggtcaacgtcacgaacccgggcctcgtgc60

cgctcgtcgtggccacactgttcgacgagcgagtcacagagctgctgagcgtgctcgctg120

atgcggcggtggggcgaccaggcaggtggtccatcggcgaagcgccatggtcgtcgtcgg180

ggggcacgaaccaggcggtgtacgcgcgccgcgcgcccggctcttcatcgcctccacccg240

ctccagcgtctccaccacttccttcatcgagggccgactgcttggctcgctggccaggca300

gccgagcattagttgcgccgcttggaacgcctgcttttgttgatcgtttgttttggtctg360

atttcagtgggtctatccgcagagaggaagaagcagaagctctccgagatccaatccggc420

gttgaggaagctgaatcgctgattcagaaaatggacctggaggcaaggagcctacagcct480

agcattaaggctggtttgcttgcaaagccgagggattataaatctgacctcaacaacgtc540

aagagtgagctcaagaggatatctgcgcccaatgccagattcggaagatggacctggaag600

caaggagcctacaacctagcattaagagtgagctcaagaggatatctgcgcccattgcca660

ggcaggctacccgggaggagctcctggagtctggaatggctgatactctcgcagtgagct720

aatgctaggacttgactgtgtctacgagactgctcctaacaataaactgaagaaagcaaa780

agaaatcattcaacgtattcgccgaagagaactctacaagatatggtagtcctgggagga840

agagccatcgccgctatgtatggcagaaccacccgcgaaagcaacctctctggtctcgcg900

tagccagagcaggagcagctcgcttgcgcggtcgcggcgctggcggccggccccgcgtac960

gagcgcctgcaggaagccaggaacccatccgaacaaggttgcaatcatgataagcagata1020

gagcaagcatatgatgatattttgaattcgtcgaagcatactttggccagcatgatggag1080

ct1082

4

2321

dna

artificialsequence

cdna

4

tgaggagcatgtttttttagatcccaccagaatatgtgcatccgtgtcacttcttgctca60

tgatctcattggccgaatgcttaatcgagaggtctcttcaaggcccaatgccaaagaagt120

tctccgtaagttcaagcacccttgtaacttgtgctttatatatatgattctcaatttatc180

attgacttttcctaatggctttcaacacagggcaccatgggtcttattctacactgattg240

cccgcagaaagctgaattctctaacatatgggatactaacaaaactgcagctcccatgat300

tcatcgggagatagtcaggtttggttactgtgagtcttcatcttcaaaatcctcaagtga360

caactctgaagagcgagatgaatgcggtatagttgatgcactggtgacaacaataacaca420

ggtgaggatctcagagcccaagaggagtcggctgttcagcctacccaacgggttgttgcc480

gccaagcaggaacagtctccgaacatgaagatgatgaatccgtgtgtggctttctaactt540

gacctacctagctcccatccccatgcatgtataaacgacatttggggaatgggtagaaaa600

gcagagattagggattttcgtttccgtcggtgcagttttggtgttccaatggagttgcga660

gatgtttatgtgccttagtcttcaatttgggggttgggggaaaagtaattttatgttttt720

gttttgtgtctgcagattcggaagatggacttggaggcaaggagcctacagcctagcatt780

aaggctggtttgcttgcaaagctgagggagtataaatctgacctcaacaacgtcaagagt840

gagctcaagaggatatttgcgcccaatgccaggcaggctacccgggaggagctcctagag900

tttggaatggctgatactctcgctgtgagctaatgctaggacttgactgtgtctacgaga960

ctgctcctaacaataaactgaagaaagcaaaagaaatcattcaacgtattcgccgaagag1020

aactctacaaggtagtatgatgctttaattgctcatatacaagtgtcattttgtcatgtc1080

attacacatggttaggatacatacttaagtttctaacgtaggcgtccacacaacggattg1140

gtgcacggttctgccgatgtatcccacgcacgtgcatggaaggaggcaggcacccttccc1200

cgccgccccggatctcgcgccagcccccgccctaccccgcctgcccttccattcttcccc1260

cgctgcccccggtcaacgtcacgaacccgggcctcgtgccgctcgtcgtggccacactgt1320

tcgacgagcgagtcacagagctgctgagcgtgctcgctgatgcggcggtggggcgaccag1380

gcaggtggtccatcggcgaagcgccatggtcgtcgtcggggggcacgaaccaggcggtgt1440

acgcgcgccgcgcgcccggctcttcatcgcctccacccgctccagcgtctccaccacttc1500

cttcatcgagggccgactgcttggctcgctggccaggcagccgagcattagttgcgccgc1560

ttggaacgcctgcttttgttgatcgtttgttttggtctgatttcagtgggtctatccgca1620

gagaggaagaagcagaagctctccgagatccaatccggcgttgaggaagctgaatcgctg1680

attcagaaaatggacctggaggcaaggagcctacagcctagcattaaggctggtttgctt1740

gcaaagccgagggattataaatctgacctcaacaacgtcaagagtgagctcaagaggata1800

tctgcgcccaatgccagattcggaagatggacctggaagcaaggagcctacaacctagca1860

ttaagagtgagctcaagaggatatctgcgcccattgccaggcaggctacccgggaggagc1920

tcctggagtctggaatggctgatactctcgcagtgagctaatgctaggacttgactgtgt1980

ctacgagactgctcctaacaataaactgaagaaagcaaaagaaatcattcaacgtattcg2040

ccgaagagaactctacaagatatggtagtcctgggaggaagagccatcgccgctatgtat2100

ggcagaaccacccgcgaaagcaacctctctggtctcgcgtagccagagcaggagcagctc2160

gcttgcgcggtcgcggcgctggcggccggccccgcgtacgagcgcctgcaggaagccagg2220

aacccatccgaacaaggttgcaatcatgataagcagatagagcaagcatatgatgatatt2280

ttgaattcgtcgaagcatactttggccagcatgatggagct2321

5

1082

dna

artificialsequence

cdna

5

cctgcccttccattcttcccccgctgcccccggtcaacgtcacgaacccgggcctcgtgc60

cgctcgtcgtggccacactgttcgacgagcgagtcacagagctgctgagcgtgctcgctg120

atgcggcggtggggcgaccaggcaggtggtccatcggcgaagcgccatggtcgtcgtcgg180

ggggcacgaaccaggcggtgtacgcgcgccgcgcgcccggctcttcatcgcctccacccg240

ctccagcgtctccaccacttccttcatcgagggccgactgcttggctcgctggccaggca300

gccgagcattagttgcgccgcttggaacgcctgcttttgttgatcgtttgttttggtctg360

atttcagtgggtctatccgcagagaggaagaagcagaagctctccgagatccaatccggc420

gttgaggaagctgaatcgctgattcagaaaatggacctggaggcaaggagcctacagcct480

agcattaaggctggtttgcttgcaaagccgagggattataaatctgacctcaacaacgtc540

aagagtgagctcaagaggatatctgcgcccaatgccagattcggaagatggacctggaag600

caaggagcctacaacctagcattaagagtgagctcaagaggatatctgcgcccattgcca660

ggcaggctacccgggaggagctcctggagtctggaatggctgatactctcgcagtgagct720

aatgctaggacttgactgtgtctacgagactgctcctaacaataaactgaagaaagcaaa780

agaaatcattcaacgtattcgccgaagagaactctacaagatatggtagtcctgggagga840

agagccatcgccgctatgtatggcagaaccacccgcgaaagcaacctctctggtctcgcg900

tagccagagcaggagcagctcgcttgcgcggtcgcggcgctggcggccggccccgcgtac960

gagcgcctgcaggaagccaggaacccatccgaacaaggttgcaatcatgataagcagata1020

gagcaagcatatgatgatattttgaattcgtcgaagcatactttggccagcatgatggag1080

ct1082

6

3646

dna

zeamays

6

atggcacactttgatgaactagaggataaaacaacagattatgttgatttatcggttcaa60

gaatttgctcttaagcaacctcaatgtggcatggcttataattactatggaaatttaagg120

ctttatgtagtagccaataaagctgaattggcctcttcaatatttgaaatcgataaggta180

aacaaaggcggagttaatgcatctatgccagtgaccacttccactcctaattcgaatcaa240

aattcatgaaccggttatggaacaaatagagaatcaagtttcggtgaggtataatacgat300

tcccctaacccatggaatttacctagtaaaaatcctgtagttaatagtgtactagtaact360

tctgtcaccgacttgaataaagctttgaatgagtataaaaatgagatgtctaaatttatt420

gagaatagcttggtgtatagattaagcctagtagaaacacttataacaagttgtatgctt480

caattttttttgatttttttggaagctactcatagttggagggtaccaaatttacaaaaa540

aaaattggtgattataatagtaaatctaccatagaacatgttagcttgtttcttgctctg600

agaggtgaagctagtagcatgaaaattgaatgtgcgttatttttctttttcacttactgg660

tacaatttttgcatggtttatgttgttgccctgcttgttgtattggttcatgggctggtc720

tgtgaaataatttggcgatagccattttcttttttgagatacattgcttttgctatatat780

atctagatatggtgcatatttaaatgcataataaaaatgtaaaaatctaaaacgtcttat840

aatttaggacagatgaaagtactagatattagacatttttagtgtttttattaaaatgga900

atatgtaccgcctttgatgctacaacttttacttagcttttaaaacacaccattctaaat960

tgtaaaaaaatattaaaaatgtgttttgcaagatgaatatactaacctttgttatgataa1020

tagttttcatatgttaatggaacaagctaaaaagtttggcaaagtatagtcctatagctt1080

ccatttcgactcagagagagtatgttgtatccactaaccgtgtacacaagatagcccaac1140

taattaattattttgtgagctatcacccaaccttctgtttatcatggattcatggaaaaa1200

tgtaattgccatcattacactaaaaactaaaacttatgaaggagaaccattgtcttgcta1260

tatatgagatgacaaaattttccaaagaagagagaagccggcagaacccatcctgtttca1320

aatctcttctactacttaagtttctaacgtaggcgtccacaaaacggattggtgcacggt1380

tctgccgatgtctcccacacacgcgcatggaaggaggcaggcacccttccccgccgcccc1440

ggatctcgcgccagccccagccctaccccgcctgcccttccattcttccccagccgcccc1500

ccggtcaacgtcacgaacccgggcctcgtgccgttcgccgtggccacgcggttcgacgag1560

cgggtcacggagctgctgagcgcgctcgctgacgcggcggcggggcgaccaggcaggtgg1620

gccatcggcgaagcgccatggtcgtcgtcggggggcaggaaccaggcggtgtacgcgcgc1680

cgcgcgcccggctcttcatcgcctccacccgctccagcgtctccaccacctccttcatcg1740

agggccgactgcgaggctcgccggccaggcagccgagcgtcagttgcgccgcttggaacg1800

cctgcttttgttgatcgtttgttttggtctgatttcggtgggtctatccgcagagaggaa1860

gaagcagaagctctccgagatccaatccggcgttgaggaagctgaatcgctggtaaatag1920

atgccgcgacacgttctggtttggggatccccttggctaacaggacatacgacatttggg1980

gaatgggtagaaaagcagagattagggatttttcgtttccgtcggtgcagttttggtgtt2040

ccaacggagttgcgagatgtttatgtgccttagtcttcaatttgggggttgggggaaaag2100

taattttatgtttttgttttgtgtctgcagattcagaaaatggacctggaggcaaggagc2160

ctacagcctagcattaaggctagtttgcttgcaaagctgagggagtataaatctgacctc2220

aacaacgtcaagagtgagctcaagaggatatctgcgcccaatgccaggcaggctacccgg2280

gaggagctcctggagtctggaatggctgatactctcgcagtgagctaatgataggacttg2340

actgtgtctacgagactgctcctaacaataaactgaagaaagcaaaagaaatcattcaac2400

gtattcgccgaagagaactctacaaggtagtatgatgctttaattgctcatatacaagtg2460

tcattttgtcatgtcattacacatggttaggatacataggagattctgttttttaacaca2520

tagttgtcccatgtccatgaattcatttgaattaatttactcttcgcaatcttatacatt2580

aaaatcgtgttacctattacatcacaacttcatgagagcatgcttgttctgtgtagatat2640

ggtagtcctgggaggaagagccatcgccgctatgtatggcagaaccacccgcgaaagcaa2700

cctctctggtctcgcatagccagagcaggagcagctcgcttgcgcggccgcagcgctggc2760

ggtcggccccgcgtacgagcgcctgcaggtaggccagcttctgctgcaatgcccgaatct2820

cggcgtccacgcgcagcagcgtcgtcgcctcctcctccgtcagctcacccagcttggcca2880

gcacccccgtcacccccgcgtccgccatggctgtcgccgggaccgaaaggctaaaactgt2940

cacaatgacgtaaagtttggttggtgttggcggctcacgcaaaaccagacctttccaagt3000

tttactttagcggagtttttttggaacgagagcaaagcagcacagtttcaagaatgtggg3060

gcaatttgaatgttcgttcctgctgcactgctactgcttttagaattgtagtatgcttca3120

tcatttatttatttctaaaaaaacttgcatgaattctatcgtgacttttattgagaaaat3180

aatgtattcacgtatcttcatgtttctgataaaggtatttgtatatgcatcggtgctaca3240

tatgcgaatacaagttttgtttcaactctgaagtctcaagttgaattctaaactccagtt3300

tgttttctactgtgctgctgcaggaagccaggaacccatccgaacaaggttgcaatcatg3360

ataagcagatagagcaagcatatgatgatattttgaattcgtcgaagcatactttggcca3420

gcatgatggagctgcaggaggttcagttatttgcacacattgtttttctttcactcctat3480

gattttcctcaatatgatcaaaatgtttcttttgcaaataatgattgaaatgtttctcat3540

tgtactcaacctcttaaactacctataggctttgcttgagagtaatcaggctacaaagga3600

tgccaatggtattgctgctctctatattgttcttgttctaatgtaa3646

7

3646

dna

artificialsequence

cdna

7

atggcacactttgatgaactagaggataaaacaacagattatgttgatttatcggttcaa60

gaatttgctcttaagcaacctcaatgtggcatggcttataattactatggaaatttaagg120

ctttatgtagtagccaataaagctgaattggcctcttcaatatttgaaatcgataaggta180

aacaaaggcggagttaatgcatctatgccagtgaccacttccactcctaattcgaatcaa240

aattcatgaaccggttatggaacaaatagagaatcaagtttcggtgaggtataatacgat300

tcccctaacccatggaatttacctagtaaaaatcctgtagttaatagtgtactagtaact360

tctgtcaccgacttgaataaagctttgaatgagtataaaaatgagatgtctaaatttatt420

gagaatagcttggtgtatagattaagcctagtagaaacacttataacaagttgtatgctt480

caattttttttgatttttttggaagctactcatagttggagggtaccaaatttacaaaaa540

aaaattggtgattataatagtaaatctaccatagaacatgttagcttgtttcttgctctg600

agaggtgaagctagtagcatgaaaattgaatgtgcgttatttttctttttcacttactgg660

tacaatttttgcatggtttatgttgttgccctgcttgttgtattggttcatgggctggtc720

tgtgaaataatttggcgatagccattttcttttttgagatacattgcttttgctatatat780

atctagatatggtgcatatttaaatgcataataaaaatgtaaaaatctaaaacgtcttat840

aatttaggacagatgaaagtactagatattagacatttttagtgtttttattaaaatgga900

atatgtaccgcctttgatgctacaacttttacttagcttttaaaacacaccattctaaat960

tgtaaaaaaatattaaaaatgtgttttgcaagatgaatatactaacctttgttatgataa1020

tagttttcatatgttaatggaacaagctaaaaagtttggcaaagtatagtcctatagctt1080

ccatttcgactcagagagagtatgttgtatccactaaccgtgtacacaagatagcccaac1140

taattaattattttgtgagctatcacccaaccttctgtttatcatggattcatggaaaaa1200

tgtaattgccatcattacactaaaaactaaaacttatgaaggagaaccattgtcttgcta1260

tatatgagatgacaaaattttccaaagaagagagaagccggcagaacccatcctgtttca1320

aatctcttctactacttaagtttctaacgtaggcgtccacaaaacggattggtgcacggt1380

tctgccgatgtctcccacacacgcgcatggaaggaggcaggcacccttccccgccgcccc1440

ggatctcgcgccagccccagccctaccccgcctgcccttccattcttccccagccgcccc1500

ccggtcaacgtcacgaacccgggcctcgtgccgttcgccgtggccacgcggttcgacgag1560

cgggtcacggagctgctgagcgcgctcgctgacgcggcggcggggcgaccaggcaggtgg1620

gccatcggcgaagcgccatggtcgtcgtcggggggcaggaaccaggcggtgtacgcgcgc1680

cgcgcgcccggctcttcatcgcctccacccgctccagcgtctccaccacctccttcatcg1740

agggccgactgcgaggctcgccggccaggcagccgagcgtcagttgcgccgcttggaacg1800

cctgcttttgttgatcgtttgttttggtctgatttcggtgggtctatccgcagagaggaa1860

gaagcagaagctctccgagatccaatccggcgttgaggaagctgaatcgctggtaaatag1920

atgccgcgacacgttctggtttggggatccccttggctaacaggacatacgacatttggg1980

gaatgggtagaaaagcagagattagggatttttcgtttccgtcggtgcagttttggtgtt2040

ccaacggagttgcgagatgtttatgtgccttagtcttcaatttgggggttgggggaaaag2100

taattttatgtttttgttttgtgtctgcagattcagaaaatggacctggaggcaaggagc2160

ctacagcctagcattaaggctagtttgcttgcaaagctgagggagtataaatctgacctc2220

aacaacgtcaagagtgagctcaagaggatatctgcgcccaatgccaggcaggctacccgg2280

gaggagctcctggagtctggaatggctgatactctcgcagtgagctaatgataggacttg2340

actgtgtctacgagactgctcctaacaataaactgaagaaagcaaaagaaatcattcaac2400

gtattcgccgaagagaactctacaaggtagtatgatgctttaattgctcatatacaagtg2460

tcattttgtcatgtcattacacatggttaggatacataggagattctgttttttaacaca2520

tagttgtcccatgtccatgaattcatttgaattaatttactcttcgcaatcttatacatt2580

aaaatcgtgttacctattacatcacaacttcatgagagcatgcttgttctgtgtagatat2640

ggtagtcctgggaggaagagccatcgccgctatgtatggcagaaccacccgcgaaagcaa2700

cctctctggtctcgcatagccagagcaggagcagctcgcttgcgcggccgcagcgctggc2760

ggtcggccccgcgtacgagcgcctgcaggtaggccagcttctgctgcaatgcccgaatct2820

cggcgtccacgcgcagcagcgtcgtcgcctcctcctccgtcagctcacccagcttggcca2880

gcacccccgtcacccccgcgtccgccatggctgtcgccgggaccgaaaggctaaaactgt2940

cacaatgacgtaaagtttggttggtgttggcggctcacgcaaaaccagacctttccaagt3000

tttactttagcggagtttttttggaacgagagcaaagcagcacagtttcaagaatgtggg3060

gcaatttgaatgttcgttcctgctgcactgctactgcttttagaattgtagtatgcttca3120

tcatttatttatttctaaaaaaacttgcatgaattctatcgtgacttttattgagaaaat3180

aatgtattcacgtatcttcatgtttctgataaaggtatttgtatatgcatcggtgctaca3240

tatgcgaatacaagttttgtttcaactctgaagtctcaagttgaattctaaactccagtt3300

tgttttctactgtgctgctgcaggaagccaggaacccatccgaacaaggttgcaatcatg3360

ataagcagatagagcaagcatatgatgatattttgaattcgtcgaagcatactttggcca3420

gcatgatggagctgcaggaggttcagttatttgcacacattgtttttctttcactcctat3480

gattttcctcaatatgatcaaaatgtttcttttgcaaataatgattgaaatgtttctcat3540

tgtactcaacctcttaaactacctataggctttgcttgagagtaatcaggctacaaagga3600

tgccaatggtattgctgctctctatattgttcttgttctaatgtaa3646

8

10605

dna

zeamays

8

aggaatcttaaacatgtggaacaggtgctcaacacatttagcaactagttgttgatgacc60

cataactttgcagccttcataatgcacacaattgatgcatcaattgcatacctcctgtct120

ttgtcaacattttcaacaccttttttcttctcatcaacaggaggcgatggaatccaaaaa180

gagtgacaacaaaaatattagtataataactaaactctaagtctaccaaacagtgaaaga240

atagtgaaacaggaaataccttaattcttatcattatgttaataaatttaaaattgaact300

aaaaaacatcattaggtgagatggatctatttgtcggttccctgttaaggatcttgtatt360

tcacacaagaaagtgaatggagcaaacacacactacatcatctgacatgtttagttgtgt420

tgcaatttcaaaatatctaagcctgtcggatccattgaacaatagtagtaatgtggcatg480

tcaaaaaatgtcaaacatgttaatatagcaccattttttgtgatgcagaaatgacccgag540

attacctgatatgtcataacaacgagctcaatagggttagcatcaaacttcgcacttacg600

ttgcaggttcccatccatgtcagcttcatgtgcttcgttcttgtttgatggaactccttg660

aaaacatctacacatttcagtgcctcatgagactcgccatagcgagcggggtggaggatg720

gtgacaatttcggcagcctcgccgggggcagcactgcgacgagcgaggcgggaggggtgg780

caatctcgtgggggaggggccaggcacgaccgtccaaccacggcgggcggagacacggta840

tccatgtgagatatagccacctcatctagccttatctccaagattttaaatcactgatca900

tcataagcgatcagatggaggccagttcacatcgaccgacaggactgatactaacaggac960

catccacacttgcacatactataaagattaataagagattacataagaactaagtagtga1020

atcagacacccattgctgaccttgttaatcagcccatcgccaatgaaggtgctgctgatc1080

ttcttgaacaacctgtacaatgcctcgaccttgttcacgctgactgcaagaatgacaagc1140

agaggaaacccaaccaggcaggaaaatgatgaccaacactgaatagaaaaagtaaatgaa1200

cacctccagctatcaaaattttaaagcaatgtgaagtcctcaaagaaccaggacaacact1260

catgattttttataactaagggaattgtttatcatcaattcattctaaaatacaagacaa1320

tcaaaagaactaagcaaagcatgagatacaaaaattcaaagcacatgtatagtgtcttgg1380

taaaaaatttacaagatggtgaatgaattcaactcaggttgtctacttcagcattagttt1440

gcactgtccagaaaaagaacaacagcaagattggaataatgctatggccaccagaataaa1500

aggtcagagctgtcttttaatgctaatattgttcatgccaaacatttctttgttagcttg1560

tgaatttatacttggacactggactgggccttgatcgacgctggcaatatcatgctgaac1620

tctgaaggcaccaaaactgttagctccttccctcgtcaatttgtcaattcaacatgtctg1680

cttcaaaatggttatgcgtaggttgaagaaaagttgggagtttacaaaataatacaatgg1740

gatgcctgttctatcatctaacttaagccatgtatcaaggttgcaagttacataaaatac1800

gcttatattctgatggttggaaccacacattctacacgtttcccaaaacaatgaaaaagg1860

tagttgtcgaaagatttaagcatctaaagtgtccactctctctgagagcatcaaaataaa1920

gtagtacgtcttatgttttaaactatttattgaagtaccaaactatacggctactaaaga1980

tttatttagatgagtaaacgaaataatttatggtatataaattaagaaggggtgattagt2040

catgaaaaataaaatgtcacaattaccagcagcacgtgattttctaaataatttaagcat2100

gtgcggtgctcttccagataaaacttaggggacgaccacctagttcattgaaagagggga2160

ataaaccaagctccaactttcaagcttgtcaaggcttgtcattattaatttaaacaggac2220

agccaattctcagacatgatgttccaaactgctaatgaatatataatgctcaaaataaac2280

aactaggttcttaactgtcaattacacccacaagatgcacataattagaaaaggtaaaag2340

agaaggcaaatggaataccaggaattatatgactactaaatcatttatttagataagtag2400

atgaaataatttatggtacataatataagaacgggtgattagttatgagaaataaaaggt2460

caccattaccagcagcatgtgttgttctaaataatttaagcctgtgtggtattctttgag2520

ataaaacataggagacgaccacctaattcattggaagaggggaacagacgaagctccaac2580

cttcgagcttgtcaaggcttggcattattaatttaaacaggacagacaatgctcaatctg2640

aactgccattgtatctacaatactcaaaataaacaactagattctgaacaaccagattat2700

ttgtactcattccatgtctcataaacaaggaaaaaataacaaccagattatttgtactca2760

ttccatgtctcataaactttgggcaccatccatccaacacatccaatctaaacacaccaa2820

acgatggggaatggaaagagcagtattcgattcaacaatggcaaacaaatatcactgaat2880

tagaccaagaataaacctaattagacaacgacctcccaaccatcattcgtcaggctgtaa2940

agaagataaagctgccatggggcatggatcaagcagaacaccagagatgaatccaaacac3000

acagaaaatcacgcgcgctgtctacaatgacaacaagccccacatttcattgcagtacac3060

tgggctacaaaggcacgtacaacaaagagctagggaaacattgcggagggcacgagagag3120

cagctaacttgacaatatagcagactgagcttgcactgttagcaggcgaggaagggaatc3180

atggggacggagaatggggtccatgcccgcgaaggagaaggcggacgccgccacggtggc3240

accggcgcacgcgcacacagggaacccgcacaggcagccatggatgctgcctcgccattg3300

cgccggtcgtctctgccacgctcctctctctctcccgctgcatcgccgtggatggggcaa3360

gcagagagcagggactgcgacgatctgggcggaggactcgccttggagagcgcggacgca3420

gacgggattctagggagagagcgaagacggggcgcgcgcggcgctcgcgcggcgtggtgg3480

cggcgagattagcgggggtggggggagggcggagccgtggtgagggtgtggacgccctcc3540

ttaccctcttaagtagtagtagagatataatccgttccaaaatatccatccgttcaattt3600

atatttcgtttgatctttttaccctaaatttgattgactcatcttattaaaaaagttcat3660

aactattattaatctttattgagatatcatttagcatataatatactttaagtgtggttt3720

tagattttttttaaaaaaaaaaattcgcaaaaattaaatgaaacgacccaatcaaacttg3780

aaaagtaaaactaattataaatttgaacggaaggagtaagaggatgtttgaatgtactag3840

agctaatagttggttgctttaaaatttgctagtagaattagctagctaataaatatctag3900

ataactattagctaatttgctaaaacagctaatagttgaactattagctagattgtttgg3960

atgtattcggctaattttaatggctaactattagctatagtacaatattcaaacacctcc4020

taattaaaatggacaaatatctcttcttttggtcccttgcgttagatttttcatatctcc4080

ttatttagtataaaagaatcatcaaaaagtggacaacccctagtggaacaccattttagt4140

agtggttgcatgaaacctttcgcgcaccagtttctatgtgtcactctaaaaatgggacag4200

catgtacgtagtgcctatatatatacaagtcatctatcgttgcctcctcagttcatcact4260

aatcacacttattgtgccctcgacgagtatctatagctagctcattaatcgattcggggg4320

tgtgttgtcgaaggcggcaatggcgagctactcgtcgcggcgtccatgcaatacctgtag4380

cacgaaggcgatggccgggagcgtggtcggcgagcccgtcgtgctggggcagagggtgac4440

ggtgctgacggtggacggcggcggcgtccggggtctcatcccgggaaccatcctcgcctt4500

cctggaggccaggctgcaggagctggacggaccggaggcgaggctggcggactacttcga4560

ctacatcgccggaaccagcaccggcggtctcatcaccgccatgctcaccgcgcccggcaa4620

ggacaagcggcctctctacgctgccaaggacatcaaccacttttacatgcagaactgccc4680

gcgcatctttcctcagaagtgagtccgatgctgccgccattgttcttgcatccatccagc4740

atcgtacgtacgtcctctatacatctgcggatcatcatgtgcgcatgtttgtggcatgca4800

tgcatgcatgtgagcaggagcaggcttgcggccgccatgtccgcgctgaggaagccaaag4860

tacaacggcaagtgcatgcgcagcctgattaggagcatcctcggcgagacgagggtaagc4920

gagacgctgaccaacgtcatcatccctgccttcgacatcaggctgctgcagcctatcatc4980

ttctctacctacgacgtacgtacgtcgtcacgaatgattcatctgtacgtcgtcgcatgc5040

gaatggctgcctacgtacgccgtgcgctaacatactcagctctttcctatctgctgcgcc5100

aatttgcaggccaagagcacgcctctgaagaacgctctgctctcggacgtgtgcattggc5160

acgtccgccgcgccgacctacctcccggcgcactacttccagactgaagacgccaacggc5220

aaggagcgcgaatacaacctcatcgacggcggtgtggcggccaacaacccggtaactgac5280

tagctaactggaaaacggacgcacagactccatgtccatggcggcccacaaggtcgatgc5340

taattgttgcttatgtatgtcgcccgattgcacatgcgtagacgatggttgcgatgacgc5400

agatcaccaaaaagatgcttgccagcaaggacaaggccgaggagctgtacccagtgaagc5460

cgtcgaactgccgcaggttcctggtgctgtccatcgggacggggtcgacgtccgagcagg5520

gcctctacacggcgcggcagtgctcccggtggggtatctgccggtggctccgcaacaacg5580

gcatggcccccatcatcgacatcttcatggcggccagctcggacctggtggacatccacg5640

tcgccgcgatgttccagtcgctccacagcgacggcgactacctgcgcatccaggacaact5700

cgctccgtggcgccgcggccaccgtggacgcggcgacgccggagaacatgcggacgctcg5760

tcgggatcggggagcggatgctggcacagagggtgtccagggtcaacgtggagacaggga5820

ggtacgaaccggtgactggcgaaggaagcaatgccgatgccctcggtgggctcgctaggc5880

agctctccgaggagaggagaacaaggctcgcgcgccgcgtctctgccatcaacccaagag5940

gctctagatgtgcgtcgtacgatatctaagacaagtggctttactgtcagtcacatgctt6000

gtaaataagtagactttattttaataaaacataaaaatatatatatgttcttgaatataa6060

aattgataaccaaattaaaattcgaaccatcacttatacataattttactttatttttta6120

taaaacgtgaacgggaaggactaccgtgaatgactatagaaccaatcatactagtataaa6180

atatatgatgacactacgggagagacaaactttgtctggcgctaaatattttgccgagtg6240

tgaattcacgggcactaggcaaagatcttctttgccgagtgttacgctgggcaaagtaag6300

acactaggtaaatcagtcatttgccgagtgtccgccactaggcaaagcaaaacactggca6360

aatcaaaagtttacctagtgccagacactaggcaaaaaaaaaacgctcggcaaatcggaa6420

gtttccctagtgccagacactagacaaagaaaaacacttgataaactagcgtcgtcagct6480

aacaccatccaccaaccgttaacgttgccgagtatctgacttcgacactcggcaaagaag6540

gtctctttgcctagtgtcggtctggaacactaggcaaagaggcactttacctagtgtcgt6600

attttgacactcagtaaaataattttttttctttctgcttccaaactttttatgatgtgt6660

tcctatagcacctagaactacatgtcaagttttggtaaaatttttgaagtttttgctata6720

tttacttaatttattttatttaattgaatttcttttgataattcaaatttgaactcggca6780

aggtaagaagcgagggtagcctggaaacacactttgcctagtgttacactcggtacagga6840

gcctcccctgcctagtgctgcactcgacaaaagattcgcctttgcctagcgctgcactcg6900

gcacaggagtcgcctttgcctagtgctgcactaggcaaagcctccgttaccgtgccttcc6960

atcgtcatggaaacttttcttcgccgagtgacgtgtggcactaggcaaagtttttgccga7020

gtgcccgagaaatggcactcggcaaggactctttgccgatcccttcgttgccgacttctt7080

tttgccgagtgcaacactaggcaaaccatttgccgagtgtaaaagaggctttgcctagtg7140

tctgtggcactaggcaaagaagacgagtcctgtagtgaacctagtaggccagtgcgggac7200

cattccaaaaaatacctataaaaataaatttaatattaaattaaacatatggtccacgta7260

ccaagatattaaactcaaaagaacaattattacaatttatcttagctaaaaggccgagaa7320

aaagtatatgttaaaaaggagtgtgatcccatttttatagctcgctcggtcgatcgcccg7380

tccacttttaggtaacgaggtggtaccatgtaggagtgttgcgttgcgtgcgacttccta7440

tcatgttgggcttaggtggcttctcacgacccaatgataggcgagaagtgtggaagatga7500

acaaacctacttgtttcgtgcacgacgcatgtgtttgaacaacgagttagattagaaaaa7560

aaatataatgacttttttttttgcaaaagtgaggataatgaaaaccagaaaaactggtgc7620

ttcataagagtagagatttgatggtaaatatagtagtaatgcaatggctatactacacgc7680

gagagtccaatggcaagccggtgtgttggggcgaaggcgaagacgctacccttcgctcca7740

ggcctttgtcaactcgctgcaccaacagaggcaagatgaccggcgcggcccacccttcgt7800

cctcttcactgcaagacgaaggcctacgacgaagtctctccatcccacgtcctcgcctta7860

cctggaggcccacgtgggattcggcccatcgtaacaggccccgcacggacaggcgtgtta7920

cgggtttgatttgtaatagcttttctgtaatgacagtttgtaaccctcccttatgggaat7980

attctggggataatccaggtgtctgagggcataagcgtccttacatcgggacgttgggcg8040

ctcgggcacctataaatacccccgtacagtgcccttgagaggctggattaacatagcaat8100

tgccatctcgagttaaaccttgcttgcatcctttccactctcccgttggatcaacttgcc8160

caagagagctagttccaacatttggcgcccaccgttcgtgctacgagcaaaccacccgcg8220

atggcacccaaaagagctagttcgaaggcagccccatccgtcgacgaagcggcgaaggca8280

gcactgctagctgagaaaaagggcaaggccctcgcagacaacacccaccaagaagctggc8340

gaagacgaagcactcagtaagagacagcgcaacgatcaacacactctcgaaggcaccctc8400

cgcacctacagctccggaggccaaccacaagtaccacccctaggcttcgctccactagag8460

ggcgaggacacaacagaggacggcgaagtcatcggcgtctcagcagaagaacaactacag8520

ttatgggccctgcgcctcaagaaccgcaacctccaaaagcagaaagaaatcctcgaagcc8580

aagcgccaacgcgtctccgcgcaagccaaagtgcgttagatgatacgagacgaggagcag8640

agggcccgggaactagagcaagagattgcgctcatgcagagcgaaggacagcatgatcta8700

cagcatggcccacccctccagcagcgcgcgccagctagagatttattcattccccagcgc8760

gggcccttcatcccacacgccgcagctttccaaggcatcaactaccttgatgagcgaagc8820

cccctggcgccgcaactccaagtgtcaccttggcccgccaacttcagggcagggagctac8880

cccaagtacaatggcagcaccgacccagcacaatacatcatgagctatcaagtcgctgtc8940

gcatcatccggaggggacgacgccacaatggccaagtccttcatcatcgccctcgaaggt9000

ccggccttgacctggtacaccaggttgcccccactgtccatcgactcctggcgaagtctc9060

cgggacaagtttctgcttaactttcaagggtaccgcccagacatcgatgccttggccaag9120

ctgtcactctacaaacaacaagagaaagaaaccctacgggagtactaccgcaagttcctg9180

gctctcaagtcgcaactgccctcggtcgacgaccaaatcgccatacactacgccatcagt9240

ggccttcgggctggcgtcctatacagtcactgcatcaggtacccacccaaaaacctccaa9300

gagctctatcagttgtttgaaaagtacgccagatccgaagagctccatcagcgcaaggtc9360

gagtctcaaagaaagcccaaggaccctccgcagtctagccaaacatggacaagaccttca9420

cagtcagactccggtcgggacaaccgcagtcagcagcaggtgcataacattgccaaccag9480

caccccgccagcgaagcccctcgccgccaagattatccccccagggccgcggcaatggca9540

cgcgtggtcggggctggggacgggcgcaacagccgtgcagatattactgcctgttttcac9600

ggcgaagactgcacgcacccaaccaaggattgtccggaaacgaaggccaccagggacagg9660

atgtctcgggcacaacccgccgacaacccaagagttgtcgcgcacacataccaacaccac9720

cacccacaaccatacaaccacggccccgcccagcatctacccaaccacgcatatcaacac9780

caccaggagttacaagtcataccacctccacccccgcctccgcatcaaccaaacatccac9840

caccaaaatcaccccaagcaccaaaacaggaagacttcgctgatcagccgtatcgcggag9900

tcattcacatgatcaccggaggggtccagcattgactttgacacgaagcgacaaaagagg9960

aatcactaccgaagcatcaaccacgtcgccatcaccggctcggtcgtgcaaacgaagtgg10020

tcacatgtgccgctaaccttcgacgccagagatgttgatctgcgcagcgcaccccacatt10080

gatgccatggtaatcaactgcagtgtggcaggctgggacctgcacaaagtcctagttgac10140

aacggcagccaggcggacatcatcttcctccatgccttcgaccgcatgggcatcagccac10200

agccctctcaagccttcgaacaatcccctatatggcttcggcggcaagggcaccttccct10260

gtgggcaagatagagctacccctatccttcggcgtagcacccaatgcgcgaagcgaatag10320

gtcacctttgacatcattgacatggtctatccctacaacgccataatgggtcggggctct10380

atcaacaaatttgaagcggcaatccacggattatacctctgcatgaaaattccgggtcca10440

caatgcgtaataacggtgtacgggaaccagcagactgcgcataacattgagagagatttc10500

gttcccgatcaacggaacgtacactgccttacgacgcagcgcgaagtccccgaggctacc10560

tgcctagctgccaacaaaaatgaaaaggcacagctaaaaagcaac10605

9

11001

dna

zeamays

9

aaatggccgaagctattttggatgaagccatctctcgactattaaacgaagctgcggaag60

cagttttaaaagaagaatagttgttattgtaaaaacatttggaatgtaatatttgctgaa120

caaagtgtgtaatatttttataatttgaatgtaatatataagctgctcgtaactcaattc180

tttacgatgcatgaaactttacgtacataccgtttttgagccttcggcgaaaaaacacct240

tcccttcttttcatgcttcgtgaagaatatccatacttcgtaaaaacattatgcttcata300

agcaatagatctctttttcatattagagttgatgaagttgtacttgttcaaaacttattg360

tgccttggcactgcttcttcgaaacaatctcgaagatcaacattgtatccccttcttgtg420

ttattgatgcaatatgatgttatgctatgcaaaatgatgtgatgatgttatgctatgcaa480

aatgatatttatgtcgaagatacataaacattcccacagtagagcacacaatctttttgc540

cgtttatttttcggcttcaccgcttatttttcggtgtatcagcgctgacttttcgctgta600

agcctcccttaggtgcttcttcgccttttacttcggcggtatttgcgttgactttttgcg660

cttcgccttatacttcggtggaatcagcgtttatttctcgctgtaagctctgcattccct720

ttggaacgacttttgagcagaaaacttacgctgcgctcccttagaaatgactttttgtaa780

cttcggcaaacttacgctgcgtttcatagaacgacttttttgtagtttcggagatacttt840

ctgtagccacaagttcttaagaacgagttttcatgcttcatcaactttttgaattccgta900

agtctgtggagaagatatattttcactatgacaaaaacaaagctgttacaagaaattgaa960

aacaacaagaaaaacttaggctttcaatgattgttctttattaaaaagaaaaatgataac1020

taatgcaagaactatttcagaagtaggatatctgttagtagatgtgctttgactctggca1080

caatactgttgactgtgcgagcttcggactcctctctgaagtctcgttgctgatgagtgt1140

gctggctcccttctggctgctggcctcgttgtattggtggtggaggtggaagctgttgcc1200

aagatgcctgaggttggcttgccgaagcaacagaagctgcaggatggttacccttaccca1260

catactctggaatgtacggtgagtggtacgaagcagtatgcataacttgcttcggctggc1320

tctgttgggctgcagtttctgctatctctttctgtttctggatggtgacatggcacatcc1380

tggtagtatggcccttgtcctcaccgcagaatagacaataaattttcctgggctgatccc1440

caaaccttcctccgaagcccctggctcctctgccccttggggctggaggccgaggataac1500

tctgttgctgccccgaagcctaggaggaatattgcggcctctgttgctgacttcccctgt1560

cgtcattctgagtggaatgaattgatctgacatgtctagggtgtactctccctccgaagc1620

ccctggtcatctcggagaatctgtaagaaggcccccaacacctctctttcccattcgttt1680

aggtgagaaaacacattggggagcactagggagttctttcctagtgaggcgtctgtgatt1740

tgataatcacaagattaaggatttcattagtgcatgtgtagtagcaagtgtgcatccacc1800

ttcctcattaagcttgtttaggataagccagagtttgtgccggttactcttgatgttcaa1860

caacaccaagatggcttggtggtaattaagagcttggtgatctctcagtggtgctcgtga1920

gagtcccaactcattgtgtaataaaagattataggtgattcaccatgccggagtggtgaa1980

taatcaacccgtagagagcattgagtccttgaatggatcgatggggggctacacccttgt2040

gtgggtcaagtcagagttttagcagttcttgcacccatgatctcatcgtgaagcatagat2100

aaatttaaattcttttgaattatttatatatgacaacactattcgtcgctctaggtgact2160

atcacctaccctaaaatgacttaacaaatctttattaattgttaagtcattcacattttt2220

gttaatccactccaaagtcagggtgtttagtgtttttacatccatgtctccttagactca2280

cggtgtctctcccagattctctctcaccctcacctctctctcactagccactagggaacg2340

caacacccatcgatggctcttcgccccatgaaacgttcacacaatcgcaattgtcgaggc2400

atgcatggctgggagagcagacatggaggcatacgtgctagggttgcacatgggcaagag2460

ggtgggtgtggctattcagatatgcatggtgagcaagatgggtgaggttgtgggcatgat2520

gaggggataaggaagaataagatctcttttgttaggctgtctccagcagctatcgtatcc2580

cattccctatcgcatcccctattttaaactttactatgcaaacaatgtaatatatagtgc2640

agattccctattttacacaatgtgttgtagacaaccttggagctcttgcataaaagctct2700

agttttggctctagctcctctgagaaaacaatccccaccatgtttttaggaagaatccct2760

gaagggcaccccatttggttggaaatacatctcctcctacaggattatgtttgacttttt2820

tttgcaatgtgggacccacaggggagaggaggacgagaaggaaccggagagcctattttt2880

tgggctcctggcttcgcttggtttctaggggcggctccttcctattttcacaaaggagct2940

agtagaggagcctcccatttcatgattttttgaaggatctatttaaggagccttgaaaga3000

gccctaccaaggtaggcctagaaataataaaggaggaaaaagagaaggtatcacaacttt3060

tgtctacaacgtgaaaatgtttggctaaatagataaaacagtttgaattttatcgattca3120

attgtttattgagggcatgtttgggagggctttagttctagcttctttcgcgaaaaatcc3180

agagccctacaaaatgacgtttggtaaaacgacttcttccgaaaaacacccaaaaaccca3240

agatattttatactacgaaggaaaggtcacacatcctagttagcttcactggttctagct3300

ccttccaattttgcaaaaaagtcacaaaggataagccattttttcaaatgatttgtgaaa3360

tgcctacgctaaaaagtctacttttccaaaaaaactagagctagagccgtttttggcaag3420

tcagaaccctaccaaatagtccctcagtttaagcaaagtgaggctatactgaagctaaat3480

tatgccaaattgggcctacatctccatattttcaaccaaatgctttagggtttcttgtaa3540

tcgacatgatttgtttcttcataaatagtatatggaccgctccaaaatactccatccgtt3600

tcaatttatattacgtttgatctttttaccctaaatttgatcgactcgtcttattaaaaa3660

agttcataactattaataatctttactgtgatatcatttagcatataatatactttaagt3720

gtagctttgattttttttttgcaaaaattaaatgaaacgacccaatcaaacttgataaaa3780

aagtaaaactaattataaatttggacataaggagtaggagggtgtttgaatacactagag3840

ttaatagttagttgtcttaaaatttgctagtacaattagctagctaacaaatatttaggt3900

aactattagctaatttgctaaaaacagctaatagttgaactattagttgaactattagct3960

agactgtttggatgtattcaactaattttagcagctaactattagttatagtataatatt4020

caaacacctcctaattaaaatggacaaatatctattcccttggtcccttgcgttagattt4080

tccatatatcctcatttagtataaaaagaatcatcaaaaagtggacaacccctagtggaa4140

caccattttagtagtggttgcatgaaacctttcgcgcatcagttactatgtgtcactcta4200

aaaatggggcagcatgtacgcagtgcctatatttatacaaggcatctatcgttgcctcct4260

cagttcatcactaatcacacttattgtgccctcgacgagtatctagctagctcattaatc4320

gatcaatcggggtgtgcggtcgaaggcggcaatggcgagctactcgtcgcggcgtccatg4380

caatacctgtagcacgaaggcgatggccgggagcgtggtcggcgagcccgtcgtgctggg4440

gcagagggtgacggtgctgacggtggacggcggcggcgtccggggtctcatcccgggaac4500

catcctcgccttcctggaggccaggctgcaggagctggacggaccggaggcgaggctggc4560

ggactacttcgactacatcgccggaaccagcaccggcggtctcatcaccgccatgctcac4620

cgcgcccggcaaggacaagcggcctctctacgctgccaaggacatcaactacttttacat4680

ggagaactgcccgcgcatcttccctcagaagtgagtccgatgctgccgccattgttctcg4740

catccatccagcatcgtacgtcctctatacatctgcggatgatcatttgcgcatgtttgt4800

ggcatgcatgtgagcaggagcaggcttgcggccgccatgtccgcgctgaggaagccaaag4860

tacaacggcaagtgcatgcgcagcctgattaggagcatcctcggcgagacgagggtaagc4920

gagacgctgaccaacgtcatcatccctgccttcgacatcaggctgctgcagcctatcatc4980

ttctctacctacgacgtacgtacgtcgtcacgaatgattcatctgtacgtcgtcgcatgc5040

gaatggctgcctacgccgtgcgctaacatactcagctctttccgatctgctgcgccaatt5100

tgcaggccaagagcacgcctctgaagaacgcgctgctctcggacgtgtgcattggcacgt5160

ccgccgcgccgacctacctcccggcgcactacttccagactgaagacgccaacggcaagg5220

agcgcgaatacaacctcatcgacggcggtgtggcggccaacaacccggtaactgactagc5280

taactgcaaaacgaacgcacagactccatgtccatggcggcccacaaggtcgatgctaat5340

tgttgcttatgtatgtcgcccgattgcacatgcgtagacgatggttgcgatgacgcagat5400

caccaaaaagatgcttgccagcaaggacaaggccgaggagctgtacccagtgaacccgtc5460

gaactgccgcaggttcctggtgctgtccatcgggacggggtcgacgtccgagcagggcct5520

ctacacggcgcggcagtgctcccggtggggcatctgccggtggctccgcaacaacggcat5580

ggcccccatcatcgacatcttcatggcggccagctcggacctggtggacatccacgtcgc5640

cgcgatgttccagtcgctccacagcgacggcgactacctacgcatccaggacaactcgct5700

ccgtggcgccgcggcaaccgtggacgcggcgacgccggagaacatgcggacgctcgtcgg5760

gatcggggagcggatgctggcacagcgggtgtccagggtcaacgtggagacagggagcga5820

ggtacgaaccggtgaccggagaaggaagcaatgccgatgccctcggtgggctcgctaggc5880

agctctccgaggagaggagaacaaggctcgcgcgccgcgtctctgccatcaaccccagaa5940

gctctagatgtgcgccctacgatatctaagacaagtggctttactgtcaatcacatgctt6000

gtaaataagtagactttattttaataaaatataaatatatatatattctgataaccaaga6060

ttcgaaccctcacttatacacaattttatcttattttttataaaatgagaatggaaagga6120

ctaccgtgaacgactatagaaccaatcatactagtttaaaatgctcgtaagctatgacga6180

acctagtaggccggtgctggaccattccaaaaaacctataaaaataaatttaatattaaa6240

ttaaacatatggtctatatatcagatattaaactcaaaagaataattattataatttatc6300

ttagctaaaaggttgagaaaggtatgcgttaaaaaagagttttaacccatttttatagct6360

tatttgatcgcccgtccacttttagggagcgaggtggtactatgcagaagtgttgcgctg6420

tgtgcgacttactatcatgttgggtttaggtggattctcacgacccaatgatagacgaga6480

agtgtgggagatgaacaaacctacgcatttcgcgtacgacacatgtgtttgaacaacgag6540

ttagattggaaaaaatataatgaccttttttgcaaaaatgactacaatgaaaaccaggaa6600

aaccggtgcttcataggagtagagatttgacggtaaattgttacgatctactggtatttg6660

ctgcgaggatgtattcgcttggtgaaaacagaattacagagtagcagtagcagggaagac6720

agtagcgagaggagaagaagaaacttgaggaagaagaagataaatgtagttgttacatcc6780

tgccttcgccgtaggtctcagcgagcatatatcttcaggtcctccattctgggccctgga6840

atctcacattggccttacgctggcgtgttcctcttctcggcccaactgtagtcttctctt6900

gaggcccaccagtctccacattcctttgttgctgctatagctcctcggacacggctgctt6960

ccgcctgctgctgcacctggatgtcttctgaagtcgacttgcgtggagggacagtgctgc7020

cattcccctcccgataacacgctgcttgtccccaagcaggcgctcgagggaacctctgac7080

gaagtggaatcaggtcctcccaagttgccagagatggatgcaactcagaccacagaatca7140

accgttgtgatgctccattaggcccccatcggcattgtagtactcgttcaggaatttggt7200

ggttgtcaaggcgatcaggaagctgtgccaccaccactaaagaacccactgccttcttga7260

gttgtgaaacatggaacacgggatggatagtagaagtaactggaagctccagcctgtaag7320

caacagatcccaacttagcagcaactggaaatggcccaaaaagcgaaaatctagcttctg7380

atttgcccgaggtgcaagcgatgactgcacataaggctgcataacagccttctcttgcag7440

ccactctccgaggataggcacaggagtggaatccaaaatgtcaatgccaaaatgcttggg7500

tgcataaccatatagcacctcaaatggagacattttcaatgctgaatgccaactagaatt7560

gtaccaaaattctgccaagtacaaccagtcaatccatttgtgaggacaagcatgcacaaa7620

acatctcaaaaaggtctcaaggcactgattaactctctttgtttgtccatcggattgagg7680

gtgataggaagaactcatattcagtgatacaccagccagggtaaacaatgatttccagag7740

ctgactagtaaagattttatcgcgatcagagaccatagcagatggcataccatgcaggcg7800

ataaatgtgttgcataaaggccttggccaccactgcagctgtgaaggggtgtttaagggg7860

aatgaaatgtccaaacttggagaatttgtcaaccacgacaagaatacaatttttacctcc7920

ggacacaagcaagccttcaacaaaatccatagtgatcgtttgccaagctcctgaaggcac7980

atgaagaggttggagtaggcctgggtatttcactctctcaggtttagcttgctgacaagt8040

ggcacatgctgcaatgaactggatgacagatttcttcatgttcggccaggcaaacaactg8100

cttcagccgatggtaagcgactgctatacccgagtgacccccaacagcagaactatggag8160

agcagacaatatagactgttgaagtgtatgattgttaccaacccaaatacggcctttaaa8220

cttaagcaacccttcttgaagagtaaaatgaggaacaacatcttggtcaacaaccaactt8280

agataacaaggtcttagccgaagggtccaacaaatatccatccattaccaaacgagtcca8340

ctatggtgaacagactgagagagcatgtaatgtgatagcatgttgtcttctcgacaaggc8400

gtcagcaactctattttcatgtccatgcttgtatacaatcttatattgcaaccccaaaag8460

tttagtaaagacttttgctgccatggagtgttaagccattgctcattcaaatgcaccaaa8520

ctcttttggtcagtataaataacaaactccccatgaagtaagtaagctcgccattgttcc8580

accgcgaccaatatggctaagtactccttctcataggttgataagccttgagtcttaaca8640

ccaagaggtttgctgagaacgctaatggatgaccattctgcaaaagtacagctcccaccc8700

cattcttgcaagcatcggtctcaatagcaaaaggttggtgaaagttggataatgctaaca8760

ccggggctgagatcacagcttgcttcaaggtattgaaggagatttcttgatcttgagtcc8820

aaacatagaacacccctttctttcacagtgcatttagaggtttggcaataatagcaaaat8880

gactgacaaatcgcctataataacccgccaaaccaaggaagctccttaactctttaacat8940

tggagggcacaggccagttcaacacagcatcaacctttgcaggatcagtatccactccag9000

cagcactgatcacatgacccaagtaagcaatagatgtttgagcaaatttacacttagact9060

tcttgacaaaccagtggtctttttggagaatggtgagaacttgggccaagtgagatacgt9120

gatcgtcaaatgacctgctgtagactaagatgctatcaaagaagactacaacacacttcc9180

tcaacaaaagggccaaagaagagttcatagcgccctgaaaggtaccaggtgctcccgaca9240

atccaaaaggcatgactcaaaactcaaattgaccatgatgtgtctagaacgctgttttaa9300

actcttccctaggcttcaacctcacccgatgataccctgaagccaagtccaatgtggtga9360

accaacatgcaccatgcaactcatccattaactgttcaaagatggggatggaaaacgggc9420

tttgaccgttaaagcattcaaatagcgataatccacacaaaactgaaaagtgccatcctt9480

ctttctcaccaacagcacaggagaattaaaagatgatgcactgggtctgataatccccga9540

ctgaaccatttctgccacctgacgttcaatttcatctttcggagctggtgggtatcgata9600

gggtctgatattaactgggctggcaccagcaaccaatggtatactatgatcacaacttct9660

ttctagaggtaaggacataggtttagcaaaaaccgactgaaactggttcaacagctgaac9720

aatttcaggaggaagggtagcctcatcagtcagagcaacagacacttgagatagggaaat9780

ctgaaccaacaattcgtcaaccggctctacagtttccccttgcaagagcacttgcacacc9840

atgataagggatcaacatccatcgttgtttccaatgcacttccataggactgaaagtctc9900

taaccaatcaagtcccaatatcacatcaaaggattgtaatggcaacaccttcagatcaaa9960

ggaaaacccatacccctaaattgtccattgagcttgagtaaacacttgggagcaagtcat10020

aacaccttcattagccactttaacctggagacaagttggtaccaaagtaatgtcggttaa10080

cctggaaatcatagtgtgactgataaaagagtgtgaactgcttgaatccacaagaataac10140

gatttcatactcctgtatagaccctctcaacaaaatggatcctttggcccttgatcttga10200

aacagcctcagcagatagagccaagaagagttgttcctcgtggactgaatcttctgatgg10260

agaagactcattcagaaatgcaggcataagatcccagacttcctgcagggcatgaagttg10320

aactgtagtattgcagacatgtcccctatgccatttttcagcacaacggtcacaaagacc10380

acgtgcatgacgataagcacggagtgcagccagtttttcatcagtgcttcgatcagtctg10440

catgagacgctgcttaggtaaggatctcggcgcactggcacttgctgcagcagtagttgg10500

ttgaggaagaggcagggtcgtcttgtgctgggatttcgaccaaataccacgatcaccctt10560

gcagaattcactccgcataggtggagcagcgacctcatcctgcaaaaaagcgagggaaca10620

ggcaatatccagatccatcgacctttgaagcataacaacagcacgcatatcatcgcaaag10680

accatccacaaaccgtgtaacaaaatataacggatcaaccccagactcatacacagagag10740

ttgatctacgagagaagaaaattgttcaatatattgagttaaactacctaactatttgat10800

gcgaaataagtggtgcaacaacagctgatactggtccttaccaaaccgttcgttcatcaa10860

ttgacacaagagaggccaagacaaacggggatgacgaaacataactgacagaaaccagca10920

cgctactgtaggcgacaagtgcatagaggcaattttgacccatgaattcagatcaacttg10980

atatatatcaaagtaattctt11001

10

1284

dna

artificialsequence

cdna

10

atggcgagctactcgtcgcggcgtccatgcaatacctgtagcacgaaggcgatggccggg60

agcgtggtcggcgagcccgtcgtgctggggcagagggtgacggtgctgacggtggacggc120

ggcggcgtccggggtctcatcccgggaaccatcctcgccttcctggaggccaggctgcag180

gagctggacggaccggaggcgaggctggcggactacttcgactacatcgccggaaccagc240

accggcggtctcatcaccgccatgctcaccgcgcccggcaaggacaagcggcctctctac300

gctgccaaggacatcaaccacttttacatgcagaactgcccgcgcatctttcctcagaag360

agcaggcttgcggccgccatgtccgcgctgaggaagccaaagtacaacggcaagtgcatg420

cgcagcctgattaggagcatcctcggcgagacgagggtaagcgagacgctgaccaacgtc480

atcatccctgccttcgacatcaggctgctgcagcctatcatcttctctacctacgacgcc540

aagagcacgcctctgaagaacgctctgctctcggacgtgtgcattggcacgtccgccgcg600

ccgacctacctcccggcgcactacttccagactgaagacgccaacggcaaggagcgcgaa660

tacaacctcatcgacggcggtgtggcggccaacaacccgacgatggttgcgatgacgcag720

atcaccaaaaagatgcttgccagcaaggacaaggccgaggagctgtacccagtgaagccg780

tcgaactgccgcaggttcctggtgctgtccatcgggacggggtcgacgtccgagcagggc840

ctctacacggcgcggcagtgctcccggtggggtatctgccggtggctccgcaacaacggc900

atggcccccatcatcgacatcttcatggcggccagctcggacctggtggacatccacgtc960

gccgcgatgttccagtcgctccacagcgacggcgactacctgcgcatccaggacaactcg1020

ctccgtggcgccgcggccaccgtggacgcggcgacgccggagaacatgcggacgctcgtc1080

gggatcggggagcggatgctggcacagagggtgtccagggtcaacgtggagacagggagg1140

tacgaaccggtgactggcgaaggaagcaatgccgatgccctcggtgggctcgctaggcag1200

ctctccgaggagaggagaacaaggctcgcgcgccgcgtctctgccatcaacccaagaggc1260

tctagatgtgcgtcgtacgatatc1284

11

1140

dna

artificialsequence

cdna

11

atggcgagctactcgtcgcggcgtccatgcaatacctgtagcacgaaggcgatggccggg60

agcgtggtcggcgagcccgtcgtgctggggcagagggtgacggtgctgacggtggacggc120

ggcggcgtccggggtctcatcccgggaaccatcctcgccttcctggaggccaggctgcag180

gagctggacggaccggaggcgaggctggcggactacttcgactacatcgccggaaccagc240

accggcggtctcatcaccgccatgctcaccgcgcccggcaaggacaagcggcctctctac300

gctgccaaggacatcaactacttttacatggagaactgcccgcgcatcttccctcagaag360

agcaggcttgcggccgccatgtccgcgctgaggaagccaaagtacaacggcaagtgcatg420

cgcagcctgattaggagcatcctcggcgagacgagggtaagcgagacgctgaccaacgtc480

atcatccctgccttcgacatcaggctgctgcagcctatcatcttctctacctacgacgcc540

aagagcacgcctctgaagaacgcgctgctctcggacgtgtgcattggcacgtccgccgcg600

ccgacctacctcccggcgcactacttccagactgaagacgccaacggcaaggagcgcgaa660

tacaacctcatcgacggcggtgtggcggccaacaacccgacgatggttgcgatgacgcag720

atcaccaaaaagatgcttgccagcaaggacaaggccgaggagctgtacccagtgaacccg780

tcgaactgccgcaggttcctggtgctgtccatcgggacggggtcgacgtccgagcagggc840

ctctacacggcgcggcagtgctcccggtggggcatctgccggtggctccgcaacaacggc900

atggcccccatcatcgacatcttcatggcggccagctcggacctggtggacatccacgtc960

gccgcgatgttccagtcgctccacagcgacggcgactacctacgcatccaggacaactcg1020

ctccgtggcgccgcggcaaccgtggacgcggcgacgccggagaacatgcggacgctcgtc1080

gggatcggggagcggatgctggcacagcgggtgtccagggtcaacgtggagacagggagc1140

12

16619

dna

zeamays

12

atcttttattggtttgagttgaacctatatgcacctgtagaatataatctagagcaaact60

agttagtccaattatttgtgttgggcattcaaccaccaaaattatttataggaaaaggtt120

aaaccctatttccctttcatccgggcccttgcggcggaccgtccgcgacaccagggtgag180

ccttggacaggaacactgcaaaaacacaagttaacactacggatcgtccgatggagaagc240

gagcaccgtccgagaccaagcacggaccgtccggcctcaggcgcgaatcgcccggtcgtt300

gaaaaaccagaaaaacccgaaggtgacgggttcggtaaaatgcatttttagcgtccttgc360

ggatcgtcctgggtgcacggtcggaccgtccacgactgctttatctgacatttgacgacg420

cattaaaagctctatagccgttactcctgaccgttgtgatttcagtcgttgatgtgcagg480

ggtacggaccgtccgcggtcggtagaaaatgagcaacgactaggaagtggttggaggcta540

taaatacaagagaaaattcctggtatgccatcagattttatactcatcccttgtgtgcca600

ctgagtggcatataagtatatttttttgtgtctatgacatgtggggccagtggcatacaa660

ggaatgagtatatttttcagtggcatacagggaattggccctaaatacaaccccaaccac720

ctccattcaaatgatccaagcactccactcattcacattcaatacaggagctagcaatac780

attccaagacacactcaaagctttcaatctctcaaagtcccacaatttagacaagtgatc840

attagtgcttagtgacttgagagagtgtgatctatgtgttatttgtcgctcttgttgctt900

ggctttcacaattgggctttcttcatctctttctcaaccttctaagtgaattataaagca960

agcaagagacacctaattttgtggtgatccttgtggggtcttagtgacccgtgtgattaa1020

gaagaagcactcgaccggtctaagtgaccgactgagagagggaaagggttggaatagacc1080

cggactttgtggcctccttaacggggactaggttctttggaatcaaacctcggtaaacaa1140

atcgctgtgtttatttgtgttgattttcactcgatttgtttcccctcccttcctctctct1200

aaaattcccttgctcatattgttgtgagttggctctcaaagttatctgcattgattgggc1260

aactacttgcaaggataactatcttccgcactccgaattatttctgacattaaccccggg1320

cataatgtgtgttttaagtgtataattttcatgtttcgcctatttacccccctctaggcg1380

actttcaaatgttctccttcacttgtgatgtctacaaccataatcagctcaacatttgga1440

ctatcacccttgaacacttatgttgaactttaaaagttgtgcactaagcacttgtccaac1500

acttaacacacttgtcagtcctttaattgggttgtcatctaaaccaccaaaaaccacaaa1560

gagatctttcaccggggtccgtggttcatggccgtactgctcggtctcaagatttttatt1620

ataaaatcactagagctctactatttatggttcggtgtgccatcgaaccgtctccaacgg1680

gtacatccaacgagcgccagcacaccaacgactagttgacgtggtctacggtccagaggc1740

tcatcagacttgtcaggaggctcgtcgactggtctggtgccacacccgtcgacttgggat1800

cgaacaagaatgatggcaagaggacgaagcgcatcaagaagatcatctactacgactcct1860

cttacccttcacacaaggacgacgattccacctcctccaagaaaaatacggttaaacaag1920

gttactctaagacatcttttatttattctcgcattccttacaatttcaatgctcatttgc1980

tttctattcatcttggcaagcctgctcgctttgatgggaggactattcttggtggagcca2040

taaaatgcgtagccatatttttttgctccaccctagcatttgggatgtcgtagacaatgt2100

aatgcaattgctaggtagcgatgataaaaattataatactattattgcccaataatctat2160

tcataatagcgcccaagctaaagaggtgcgctaagggatcctttttatagcccaaagagg2220

tccataggcgttgctccttccttctaaacatcgatgaaattgtgttgtctgcgaggacat2280

caaaccgggtctgtgcacaccttctccgggatctgatttggtgctttccttaactgattc2340

acagcggaccgatccgatgcaccaccggaccgtctgccacatcagctagtcgttggcctt2400

caaactccaccgggaagtagccgttggaggcggtccggtgcgccattggaccgtgcaaag2460

tgaagggtcgatgatttttataaataaaatctcgagacctcaaagttcagctctgtaggc2520

ggtccggtgcacatggacctggttcgatgcattaggtcgcctaacactaagttctatgcc2580

ctgcggacttgatccggtgcaccaccagacgagtccgatgaggcctagaacaacccaagt2640

aaggctgttttgagcctaacttcttcaaatccttttggctattcttgggagctttccaac2700

aacttagacaaacataattagcacatattccaattgattaggtgtggagaactcaccttt2760

tactttgtcgttcaccatgatttgcattttggcttaatctaagtattcgaaccacttttc2820

tcacaggatagagttagagttcaaataaagtgctaaacacatagtattagacacatgcaa2880

cttatctaagtaatcaaacctcatgattttacccttttgtccaaagctgcacactttagc2940

cttcattttagttctttaggatctagtactttcaaattgacttcaagtgcttgtgctcgt3000

actcatatcaaattagttagtccatgttgttgtgctaaacacttaatcactaaaacatgt3060

agaaatggttatctaacacatttttctttcataagtaaaacaggagatttatattgtaga3120

tgttattgtttgttgatgaaatttgaaataagagatataagagcaactcgaaaagcctag3180

ctaaatcgatttgtatcggtaaaaatagaaaactgatgattaaaataggatccaacaaac3240

tctctttgctcctctctatgctatcctgctcagcatcacgtcgaggtctctagccatatt3300

tgctgagctcacctgcctcgccatcttcattctctcgtgcatcctcaccgtccctgcgcc3360

ttgccgtcgttgcagctcactgtcccacgccctgctgtcgccatgcctcgccgaccctgc3420

ggctcaccttaccgcgcctcgtcatcgtcgcggctcgccgtctcgtgcctcgccaacccc3480

gctccttgacgtcctcacggcttacagtccccatcgtcctcgtggctccacgcctcgcca3540

tatttgcggttcactgtccacgtgccctgccatcctagagcccgacatcgatatgtcaaa3600

tgagacaggagaaaaagaaatgaacatgtgattataatcagtgatttgaatattgataga3660

taagatttgaagagtctgttgtgtcatatcacctttttatgaaactctttattttttaga3720

gtttttataaaactctaaatttagctaaaattatatctagtcttttagagttactctaac3780

aagagatgagatagcgagctggcgagctgctggagacagccgagagtagagatagtagag3840

gagactagaaactccattaggcctagcccagagtcagctagggatgccgcccgatccact3900

acgtactgaaagcgatcccggcccatgaacgctagtgggttgtaattcggcttacccaag3960

tacccagccgttcctccacacctctgcactacccgaaacccacgcccaacggccgtttcc4020

cgcaccccctatccgggaaggaagaaagccagaactcacccctgcttcgtctggcggcgc4080

cgtttcccgcacgcgatgccgtcgacggcggaggatttcccggcgataaggaagctgggg4140

aagctcttccggcttaccgaagtgtacctctggtaagtctcctgtctcccctacccgctt4200

ctcagcgtagggtttgccgtttgcgaggagtacgtctcctcaaactactctctcttcctg4260

cagggacgattcgtatggcgctggacctcacgatggacagaagaacgggcgctcggcgga4320

ggctgctctcgtggtacggtctctgacccgctttggtgtagctcactcttaagctttctg4380

agttgggggtgcgcttgtgcttcgactagtagatggctaatttcgtcgggctactggtaa4440

tttcttggtatctgcattgtcgagaaagaaggcccgacgcataatttcatgcttgcccaa4500

gagtctacttaacacaaggaattggttttgtggcgtggtttgtgcattgcgcccaaactg4560

tagcctgtacaaaatgttaatcgtcgcgtgcattttaaacaaagttttgtattatacgac4620

aaaataccctcggcacatttgttacagactaccgataagtgcatacctatttctcctagt4680

tctatcaggaaataatcctggacctcgaaatgacagcctcgtctggctagaaccctacta4740

aacattttgagtgatcacttttcattactcattttcttgatgaaagcacattactgacat4800

ggaagtttgctacataagacataacacttccttgtagtgctttatttaattattgaccga4860

tgatctttttggaaaattaagctgtattaaacaattgtagcttcggtgatgattgttgga4920

ttaagcattagtctgctgcagtcctcttcttgattctgatatgacagttatttgttgatt4980

aaaataatgatggtttgctttacacttcgatctcctttgagaggaaaacatgtgaaggtg5040

tggactagatcatgtatagaccaacagcattatcttattaaaacacttctaaataactta5100

gcaatttcataaccatttttacacttctgaggaattcatcttgtcgtgaaagagtcaatt5160

aacttagctgcttagcagactgtgtcaagcttattacttgtatgttgtgccctacaaatt5220

actatgaggtttataatgtacatagcaatttgacgaccttcaacttttcaggactctcat5280

acagataaaacatgcaatgaagcatccaatagcactgacaaaggttagatgtatttttct5340

tgtattctagtatcttccttggtcaattttctttacagaggatgttacaatgtactctac5400

tttttttgtgtggaaacaacccactagagaaaaaaaatcacttcatttgagaaatcttaa5460

gaatctgaactctgaagctcagcatgcttccacaccaccacttttcaggccactgtctct5520

tcattggagtaaatgacgcttcttttagagagaaagagaggggggggggtctgtttataa5580

ttgaatcaagagattttattggtcacctgatttctgttgcatgacgtgggacctggatag5640

acttcagatttgccttagttgataagttcaccggcactagtgaaagaaaagtatatggta5700

caggtactcttatgaacaggcaaccacttagcaattcagcatctaatagagaaggaccaa5760

gtcttcaactaaagtcacaatatacctttagtactattagaggggctgaccctccttgtc5820

tagtgcttgtgagatcataggagatggggcgtggtggttaagattgtggatcatattcgc5880

caagctcccagggtcaatgtgattgagggatgtggtatgctacttgtgaaagggttcaaa5940

aaggccaggatgacattgttcctacattcctggaatggggatgacacccccaggacaagg6000

aaggtggcacagttccacgttcctggtgacatgttgtgtatcaaatgggaggccaaatca6060

ccacgggattactctaggaagggaagatgatgttgataatttgagtcattgttgcaacat6120

ctgttcatggtttcatgcctcattttataagtcatattgcccacacataacattgtaata6180

gtaaaatcaacaccagttattttacgttttcccttgtatgtcaaccgatttcttactgtg6240

tatatgatctgtctatcaataggccatttctttgttgaagatttggaattggtcaatctc6300

atgggttctttggggcttcctgtttcattcagcacaagtaaagtggtcagttgctcacat6360

agtgtgacaccatgttactgttccctaccggttccattgcttaattcctttatgcattgc6420

agaacaagaacacatgcaacaagggaaagaaaaaaggaagacaagcaccgctcaaagcag6480

caaacactcaaatcaatgatgctgtgaggatatgtatcaatactgaagatagagaaaatt6540

ctgttgaatcattggatgctatggagcaaacgcactcatgcaatttatttgtgacaccac6600

tgggtcaaaatgaaccctcccgtgatgacactgacaagaggcttagggaagacagctctt6660

gtgttgaagaacaagaagagtctggctgtagcaccatctactctgctggcaaagcccctg6720

gctgtgatgctaaaaatcatctcactgaacttggggcttttgagctttctgataacttgg6780

ccaactcagcaaaagaagaatactcaattcaagaaaatcaagcttatgaaagtgtgttgc6840

tagattctgaagagatgtcaaggaatgactgtgttgatgatgaatctacacattcctgtg6900

ttggcatttatcaggatgaaagagtgtccacaaggggagatcaaacatctgaagaaactc6960

tatcagtaccccatgattacaatgatgttggcagagaagctagtctaagtttggcagagc7020

catcatctattgatgagcatgcacaaagctctgccaacaacttttactatgactatggtg7080

aatggagggttatctgggatccattctataatcggtattatttttacaacatccagacac7140

aagagtccacatggtgtcctcctgaaggactggaggattttgcatcatattgtagcccag7200

ataccactaaagagctagctgaactgggatctcagtgttcaagcatggcaccacaagaga7260

acagtaaaaaccctagtctcgtcctttgcagttgacattacgaatagttatatgcactac7320

gataaaaactttctacaatatgtaacacttgagcatgtggcaatgggtgtaaacatttaa7380

taataaggtagtgaaatccattacacacagtattgaattttgcactacaaatgctgaagg7440

agaaacctaaattgtcaatgctttttggtgacattaattattgccattgatttcctgctt7500

gtaggtgcttcatttatctgtctccaatttactcatatgctagcttcttgtttgggacta7560

aaggctttgctgttgttttagtatgtcacacatttctctttaatctcaccatcacagatc7620

tggctactcatgtcaatcatttagaagcacaggagcaagatcactgcattcatgatttat7680

ctgacattcctgttgaaaagccaatatatcaaaggtagggaataccaaactgtacaatgt7740

tgaacaagttattgtttttttttgttaattctgttcatctatgcagtatgataactacct7800

ctgacaaagcacagcacactgaaaataagtacagcgattcaacaactactgtgttagaga7860

tgaaccaggaagttgctagcaccaaaacgaaaaagagagtaaggagatctcgatcgtgta7920

aggcgataatatatggcatctgctttctaggagtttgttcctgttacaattttaggttgc7980

gcatttacacaatagtttcttggtttctttgagcaaatgcagctttgcatgactgctaca8040

ttgcctacttatgtctaggtaacttttctttgcaaactgcaaagttatgtctaggtaact8100

atgccttctagaaaacctccttgttagctatgtattagtgagacttgcctaatatttatt8160

ttcttgtggtccgttcttgtgctctttgtacatatttgccaataaccattttaattgttc8220

tacagatcattcatgccaagacatggcagggaacgtctctaatgacatcatcaagtactg8280

ggctcagcggtattcacttttctcactttttgatagtggtataaagatggatgaagtagg8340

gtggttttcagttacgccagagccaattgcaaagcatcatgcatctcgtgtgggtgcggg8400

agtaatgattgattgtttcacaggagttggtggaaatgccatccaatttgccaaaaagta8460

cgtcaatgttatcttgcaattgagttatgtgatggtctaatgtatcatttgcttgaacac8520

ttcctgtttagtagcaactgttatttttcttatgtcacgagaatgcaatggctatatcac8580

cttaagcagtatgctatgtccactgtccagtttaactaaggcatctgcttccagtaatat8640

gcaaggctcttcttacttttgctgttatttaatatatggaagtgtccttacggaggtgtt8700

attgtggacattttgagcatgttcatcatgtcacttgagttagtagagccagccttagtt8760

gtttgcagtgtaggtggatttattttatgttatcaatgtttcttctacagtactaagact8820

attgttccacattaactatgtctccttttccaggtgcaagcatgtaattgcagttgatat8880

tgatccacaaaagattgattgcgcgcatcataatgcatccatttatggagtaaatgatca8940

catagatttcattgtaggtgattttatacatatagctcctcatctgaaggtaatgccttt9000

ttcttggaattattacttttaagtttctcaacacgtcacttctattagctatatgttttt9060

gtagctgtttgcgagagtgaatttattgttgacattgttctcatttgcccacccatttta9120

ggataggggcttggtactacaaatatcttgatacttcaagtcctacaaaaagaaatttat9180

gtttcatattttttccatttgaacgtcgagattttatggtcccatggagttctccctatt9240

tttcgatgatgcccatcttttggcagtaccttctttgtgtacacaataaatgggaggata9300

ttttctgcagggagaaactgctttcatgtcgcctccttggggtggccctgactatgccaa9360

agttgatgtttatgatatgaaaagcatgcttattccttgtgatgggttagttccttgttt9420

ctattttaagagagtaatttctttcagtttgcactcactgatgtttacttactttgtgag9480

taaaacgcaccagagatccattaacctttaaggaggtgttatctatgtccatcaacactc9540

aaactgcatttttgggttcctaaactttttaagtgattcaccggagttccgtaccccttc9600

gtttatatttgtattttgcagaaacctcactctgttttatttctccttcgcatgtagggg9660

tggtaatggatcacgattcaaacgtttctccacgattcgtttgagcccttaattaatttt9720

agtacaaaaataaatagaaatagagatagagcctgatcctaatatgatttgatcctcaaa9780

ttttatagcgtagaatttagagcccattaccatttaccacccctattcacatgcctaccc9840

ctctccatcttctggattgaatgttccaacctaatttacactcgtagtttctttgatctg9900

ccaatcaaatccagagcctaattgctataacattagaacgaacacgccatattaccagaa9960

tactcgatgcagatatggatagaagcgaggcgctaagcgcagccagccttggcttcttgc10020

tctgcaggccgatcagggcgccagccaaagccaaccatgcgcgcacgtgactgcaatgct10080

actctctcttcgcctttgccatcgtcgtcgcaggatgttacgttgtgcttatgctggctc10140

ccacgagtgccgccgcccagagcgagctgagcgcacgcagccactgcttgtggttcacga10200

gcgtgagcatgccctcaccacctgctgctgcgcccttgctgcttgcttgcttgccggtgg10260

tgtacattatggacggattaaattgaatggatttacctgttccagaaaaagatctgatcg10320

acgatgggatgctatcttgtatggctccggatcaatgaagattaatggaacaaccaatcg10380

aaggctcagagcaggctagttggtgcccggaagactctggccagaagatggaaatgggta10440

agcgtgtgaaggaaaaaagaaatagagggggatttctacaaaaaacaaacataatgaaga10500

ggtatggatttcaggtgaaccacttaaaaaataaaaagggcatacccagtgccgtaggct10560

tcccgcactgtgcggggtcgtctggggaagggtatctttaagcgtcaagtcttacccgca10620

taatatgcagaggctggggctcgaacccgggacctttcggttatagacggtaggctctac10680

cgccgcaccaagcccgcccttgaaccatttaaaaaatttaggactcaaaaatacagtttg10740

acagttgatggacctagatgacacctccttaaaaaatttaatggacctatggtgcatttt10800

aatcttttgttgatcgactatactcaatgttgaactctttaggtactctctttttaaact10860

cggaaccatgatagcttcaagagtagtgatgtttcttcctcgcaacattgacctaaacca10920

attggcggacatgtccttgtctgtggatcccccgtgggcagttgaggtaagcccattttt10980

gctgattttgtgccaagctgacgtttcctatagatgtcacagtggtctctctctctgcag11040

gtcgagaagaacttcctcaacggaaagctgaaagccataacagcttactttgaagaacag11100

gatcgttgaaccaagcatcggcgctggtgatacaaatcatcttgttagctatgactcacg11160

acaaatttttgtggtgaccctaaacagaacctttgtgttcggagacagaaagaagcggtt11220

tatcatcttcaccgagcatagataatttatttgcagagatgagtcattggtatcatacaa11280

aagcagctcagcttatctcaattcacagcaagtgaaactgtcgaaggaaaactacaaggc11340

tgacagtcgaacgcgtgggagttagcttaattttgccttatgataagcaagcatgcttcc11400

tggtttatttcatacagctactagtagtttcagctgcaacagttgtgcgttggtgtgcgt11460

gtgattctcacatatctggtcctgcggatgtgagtgatgcaaatgtatgtgtcatcatcc11520

catgtaagggtttgtttgtttgctctcaatctatgtagattgagtgggattaagtgagtt11580

taaatctcagacaagtcaaaaaaaaatgttttcaatctcatccaatccacatatgatagt11640

aatacccgagtaaggcttagatgtaatagttggaataagaaaaacaagtcagccattttg11700

aagttttgtccttggagttctattaaaaggcattactgataaatctccaacagatttgca11760

gttgaagcaacatgtgaaacatatttatcatgttaaaacaatttgccttagtattcgatt11820

atgccatgaaatctgacatttccttacacatcccagtttatcattgtcaactgtctttag11880

gaatgtattgtatctgctgtttttacttgtatatgtatgttattttttgtcgttgtatgt11940

atatgttttattataaacatggccactaaggttgttctattcgttaaaataacacagatc12000

tataaacgactaaacaagcttcttgggataaagaatcatatggaggctggattttcgagg12060

agtctggtgcactgttttgctaatgatcagccccccccccccccccctctaaaaataaag12120

aaaatactggatttcctgttcatttattacattcatatgtaaatgcttctgtccttttct12180

atatctgggctggactttttgtgtgctcgtcactcaagttggttagtgtggttaatttta12240

ttatgctccgtgctctttcctaccgaacttggtctttgttagtatcattatcagtcagtt12300

atattttctcctcttgatgcttcatctaatctatttttgcaaagttgtcatgttatgtac12360

tatatgatcttttacaaggtttttgacttttcaaattattgtgtcgtatattatttgtac12420

tcagattgtgcttacaactttagtttatctatactttaaggggtgtttggtttctatggg12480

ctaatttataatcccttcattttattctattttcgtacctaaattgtcaagcacgaaaac12540

gaaaataaagttttaacttttatatttagcagtttatacactaaaatagaataaaataga12600

tgaagtaaaaattagtcctcagaaaccaaacatctcctaaatgtctagtaatagtcgcct12660

gaactgtagagcgcccaacacgcgccaccctgatttggtgtcttaaaatggcatgtgtat12720

ataggtggaatgggtttgacgagactgtaactactttttcttaattaaattatagatgga12780

cttaacttttctatatgcattttaaatatatttttctatatttttggtgggctgagttac12840

agtttatgtcaatataaattacaacagaccgaatctaaaattttattataaaatgtatgc12900

accaaccgttgactaaaaagataaaatttggacacctacattttagcaagtcacctgcta12960

atatatatctatactaggtaagtgtccgtgcgttgcaacgaaaacatataataatacgat13020

aacttatatataaaatatgtgttatactgttatgagaaaaagtttcacctgtcctatttt13080

tatcaatatgacaacagaggatcaatataaggccttggcatggcttcagaagttcagatt13140

aacgaattatggagcaagagcaactatttctggtgtttcagtagaaagaatggggatatg13200

tgtttatctctctcatacatgttacacaatgtgctatagaatgacacctctaggccgctg13260

ctacaactacagagaataaattatggatcatgggtgccctactaagttagctacacgtaa13320

aatctggggcgattgaccccctaatgcttcatcttggagatctcactaaaatacaccttc13380

cgcacaaaccgagccttgataaagctcgcgatcttcgtgtcctgatctttagtgcagaca13440

actgcaagggaaattattgtgccggtcagcaagagtggaaaatgtcagcagaaacacaag13500

aatggaaaattatatgatgtgcaagtaggacggcaccctatttaactaaaggtgtgtttg13560

gttcagttttctgaaccaattcgttgccaaaaaatctaaaatctcacacaaacggtacaa13620

catcagaatagatttttaaaagtttatagatttctcaagttcaattcaaaatcaccatct13680

accccaaatttttcagattactattattcatgttacaactatcactcttgtagtatctac13740

cattgatagttgttttaatcaaatatattttagctttctcacaatcctcagctcgaaaca13800

gattttcatggctcacagttggattcatattttcataaatctataggtgtgaaccaaata13860

gaccctaaaacatcatgttgtaatgttccaaaaaaatcatcacaaaaaacatgtatgcaa13920

cacacctaagtgcaacaacactaacctgcaggagcgttggcgtaggctggactcttgtgt13980

gtctgcaatgtggaaacctgcattgtaaatggttaggtaaatgcttcgcttaaaaatggc14040

agtaaatgcttcactaaaaatgcttccacgtgctcatgatcaagtaggttttatgttcaa14100

tctgcagttctacacaagtgcgattcattttgatactcatttctatttactttcagagct14160

cgtgctaactgtcataaagtgcaatgcatctatgactgccaattgatattgtgctcctgc14220

cataaagtgcactgcatcgtgctaactgtacactaatctgtgaggatctaagaccaatat14280

ttgtttacgttttctcctttagtatcctataaaaacaacacgcctagacaacccaaaaaa14340

tgtgtcccaataggaatatcagattctgacgctggcagcaatctcgaatgataatatttt14400

ttccaaacaagcgaggtccctaaatagaagcggcaacagataaaaactaaagataaaaac14460

taaagagtacagatgattggcatcacatcgggaatgaaatatgcctaacatatcaatttg14520

catattagattatttgctgagaacaataacgaaaacatatttagttgttcatcacaagtt14580

accttagattttgctgttcaaggtcctttgggtcttctttctgctagaacatacaagggt14640

atttcagatttgcaaacaaggaaaagcaagaacttcaatgatacatcattgtaaaaccaa14700

gtttccgatttaaataaagatgatgcttgcggtcaacacattcacaaatgtaaatgtgtg14760

aaatcgttcaaacataaggcttatgtggtcatgctcaggtagtatgtacagacctaaaaa14820

caaggtatatgacaacagtaccagccactaaacacacatggttaatcactaaaacaattc14880

tccgattaaccaggaaaactatagcagccactagaactatacaggtttctaccagtaatt14940

gcttcactaaaaaatgcccccatgtgtaaattttcaggtggtttgtacagacataaaaac15000

aagggtataggactacttcttgtgctaaaataaaagctggcactaaacagtgtatccagt15060

tcatcaggaaaacagttttagtgattaatcactaaaacaatacccatggtgcaaattatc15120

tgattaaccatgtacataccagtaattgcttcataaaaaatgcccccaatttgctcatgt15180

ttaaggaatatgtatagaactaaaaacaagggtatatgataacaataccagccactaaac15240

acacatggttaatcactaaaacaattctctgacaactcctataaaaagatagcaaccacc15300

agaaataaaccgcccacgacatccctaattgtagtcactaaaaactggtagcagtaattg15360

gttcactaaaaaatgaccataatcacgaaaactataacagtcaccagaactatataggtt15420

tcattagtaattgcttcactaaaaaatgtccccatgtgtaaattttcatgtcgtatgtat15480

agacctaaaaacaagggtatatgactacttgctatgctaaaacaatagctagcactaaac15540

agtgtattcagttaatcatccaaacaattttgtgattaatcactaaaacaatacccatgg15600

agcaaattatatgagaataagatctcgtcgttcctattgtgaagaatatactactacctc15660

cagtttcaaattacaatttcaaattacaagttgtttagaacatccacaaggtaattgcga15720

agaatatactaattgctctagtttcaaattagaagttgtttagagaaaaggtgttcctta15780

atagtctagttttagaagctacatccaactggtaaacataaattgcagaaaccttttatg15840

tggaagcctccgtcattgagtctgtcccctttagctgtaagtagttttctaaatattgtt15900

agtcaggcttagttgtttgagactctgtttccattcgtgaccatgggaactgtgaaatgt15960

gtagaagatgctcatgctcatgcatatgcatcgaattgttttgtaaagtcatcttaatgc16020

tcaaacagtttttttatctgccccagctgtacactgctttctgaattatgtcatttaggc16080

ttagctgtccgagataatttcatttgtgatgaaggtaaccggagcatctgtccttttgtt16140

ttaaacataaatattttgatagcttaacttgtgcgtcattttatcatgtactaacatggt16200

atatagatggcacttagcagtaacattcctgacttatgtgattgtcctgttagaatgctt16260

ctgcaatataatgggcttatgctaatctgtttggaatcccatgatgaataacaattatgg16320

atgttgggcattttgtatttttatgatgtaggcttaacatattttcttctctgctcagcc16380

ctgccggtggagacctctatttatagctataatccagccatatctagtgaactaacagat16440

ctattcctatctctagtgcttatccccttaacagatttattttcctatctctatttcttt16500

gatgttacttgctgcaggtgctatcccccactgatgcattatatgacaatcactcgaaga16560

atcagatgctaattaattgttttatacttgaccattgctaattaagtactccatttttt16619

13

17274

dna

zeamays

13

caaccctttccctctctcaaacggtcacttagaccgagtgaggcttcttccttaatctca60

tgggtcacttagaccccgcaaggatcaccacacaattggtgtctcttgcctcgcttacaa120

agcacttgagagtaagaagtgagaaagaaaagaaagccaagccaagcaaacaagagcaac180

aaagaaacacaaatgatcctttaacaagttctaatgcgctagagttgaatcgagaacttt240

gagtggatcgatcacttgaattgtgtctttgcagtggagtctattgctcttgtattgaat300

gcaatgtgttgaatgcttggatggttagagtggaggtggttgggggtatttatagccctc360

aaccaccaaacaactgttggggaggggttgctgtcgatgggcgcaccggacagtccggtg420

cgccagcaacgtcacccaaccgttagggttcgagcgcagacgactgttggagctttgtct480

tcttgtgccaccgaacagtcaggtgccgcaccggacaggcactgttcactgtccggtgcg540

cctctgacggctgctctaacttctgcgcgcactggtcgcacactgtagcgttcgcaggtg600

tccgttgcagtcgaccattgtgctggaagccgttgctccgtttggtgcatcggacagtct660

ggtggcacaccggacagtctggtgaattatagcggagtgcggcctgagaaacccgaaggt720

ggagagttcggagttgtacggtcctggtgcatcggacactgttcggtgcgccagaccagg780

gcacctttggtttctttgttcctttgcttttgaaccctaactttgatcttttattggttt840

gagttgaacctatatgcacctgtagaatataatctagagcaaactagttagtccaattat900

ttgtgttgggcattcaaccaccaaaattatttataggaaaaggttaaaccctatttccct960

ttcatccgggcccttgcggcggaccgtccgcgacaccagggtgagccttggacaggaaca1020

ctgcaaaaacacaagttaacactacggatcgtccgatggagaagcgagcaccgtccgaga1080

ccaagcacggaccgtccggcctcaggcgcgaatcgcccggtcgttgaaaaaccagaaaaa1140

cccgaaggtgacgggttcggtaaaatgcatttttagcgtccttgcggatcgtcctgggtg1200

cacggtcggaccgtccacgactgctttatctgacatttgacgacgcattaaaagctctat1260

agccgttactcctgaccgttgtgatttcagtcgttgatgtgcaggggtacggaccgtccg1320

cggtcggtagaaaatgagcaacgactaggaagtggttggaggctataaatacaaccccaa1380

ccacctccattcaaatgatccaagcactccactcattcacattcaatacaggagctagca1440

atacattccaagacacactcaaagctttcaatctctcaaagtcccacaatttagacaagt1500

gatcattagtgcttagtgacttgagagagtgtgatctatgtgttatttgtcgctcttgtt1560

gcttggctttcacaattgggctttcttcatctctttctcaaccttctaagtgaattataa1620

agcaagcaagagacacctaattttgtggtgatccttgtggggtcttagtgacccgtgtga1680

ttaagaagaagcactcgaccggtctaagtgaccgactgagagagggaaagggttggaata1740

gacccggactttgtggcctccttaacggggactaggttctttggaatcaaacctcggtaa1800

acaaatcgctgtgtttatttgtgttgattttcactcgatttgtttcccctcccttcctct1860

ctctaaaattcccttgctcatattgttgtgagttggctctcaaagttatctgcattgatt1920

gggcaactacttgcaaggataactatattccgcactccgaattatttctgacattaaccc1980

cgggcataatgtgtgttttaagtgtataattttcatgtttcgcctatttacccccctcta2040

ggcgactttcaaatgttctccttcacttgtgatgtctacaaccataatcagctcaacatt2100

tggactatcacccttgaacacttatgttgaactttaaaagttgtgcactaagcacttgtc2160

caacacttaacacacttgtcagtcctttaattgggttgtcatctaaaccaccaaaaacca2220

caaagagatctttcaccggggtccgtggttcatggccgtactgctcggtctcaagatttt2280

tattataaaatcactagagctctactatttatggttcggtgtgccatcgaaccgtctcca2340

acgggtacatccaacgagcgccagcacaccaacgactagttgacgtggtctacggtccag2400

aggctcatcagacttgtcaggaggctcgtcgactggtctggtgccacacccgtcgacttg2460

ggatcgaacaagaatgatggcaagaggacgaagcgcatcaagaagatcatctactacgac2520

tcctcttacccttcacacaaggacgacgattccacctcctccaagaaaaatacggttaaa2580

caaggttactctaagacatcttttatttattctcgcattccttacaatttcaatgctcat2640

ttgctttctattcatcttggcaagcctgctcgctttgatgggaggactattcttggtgga2700

gccataaaatgcgtagccatatttttttgctccaccctagcatttgggatgtcgtagaca2760

atgtaatgcaattgctaggtagcgatgataaaaattataatactattattgcccaataat2820

ctattcataatagcgcccaagctaaagaggtgcgctaagggatcctttttatagcccaaa2880

gaggtccataggcgttgctccttccttctaaacatcgatgaaattgtgttgtctgcgagg2940

acatcaaaccgggtctgtgcacaccttctccgggatctgatttggtgctttccttaactg3000

attcacagcggaccgatccgatgcaccaccggaccgtctgccacatcagctagtcgttgg3060

ccttcaaactccaccgggaagtagccgttggaggcggtccggtgcgccattggaccgtgc3120

aaagtgaagggtcgatgatttttataaataaaatctcgagacctcaaagttcagctctgt3180

aggcggtccggtgcacatggacctggttcgatgcattaggtcgcctaacactaagttcta3240

tgccctgcggacttgatccggtgcaccaccagacgagtccgatgaggcctagaacaaccc3300

aagtaaggctgttttgagcctaacttcttcaaatccttttggctattcttgggagctttc3360

caacaacttagacaaacataattagcacatattccaattgattaggtgtggagaactcac3420

cttttactttgtcgttcaccatgatttgcattttggcttaatctaagtattcgaaccact3480

tttctcacaggatagagttagagttcaaataaagtgctaaacacatagtattagacacat3540

gcaacttatctaagtaatcaaacctcatgattttacccttttgtccaaagctgcacactt3600

tagccttcattttagttctttaggatctagtactttcaaattgacttcaagtgcttgtgc3660

tcgtactcatatcaaattagttagtccatgttgttgtgctaaacacttaatcactaaaac3720

atgtagaaatggttatctaacacatttttctttcataagtaaaacaggagatttatattg3780

tagatgttattgtttgttgatgaaatttgaaataagagatataagagcaactcgaaaagc3840

ctagctaaatcgatttgtatcggtaaaaatagaaaactgatgattaaaataggatccaac3900

aaactctctttgctcctctctatgctatcctgctcagcatcacgtcgaggtctctagcca3960

tatttgctgagctcacctgcctcgccatcttcattctctcgtgcatcctcaccgtccctg4020

cgccttgccgtcgttgcagctcactgtcccacgccctgctgtcgccatgcctcgccgacc4080

ctgcggctcaccttaccgcgcctcgtcatcgtcgcggctcgccgtctcgtgcctcgccaa4140

ccccgctccttgacgtcctcacggcttacagtccccatcgtcctcgtggctccacgcctc4200

gccatatttgcggttcactgtccacgtgccctgccatcctagagcccgacatcgatatgt4260

caaatgagacaggagaaaaagaaatgaacatgtgattataatcagtgatttgaatattga4320

tagataagatttgaagagtctgttgtgtcatatcacctttttatgaaactctttattttt4380

tagagtttttataaaactctaaatttagctaaaattatatctagtcttttagagttactc4440

taacaagagatgagatagcgagctggcgagctgctggagacagccgagagtagagatagt4500

agaggagactagaaactccattaggcctagcccagagtcagctagggatgccgcccgatc4560

cactacgtactgaaagcgatcccggcccatgaacgctagtgggttgtaattcggcttacc4620

caagtacccagccgttcctccacacctctgcactacccgaaacccacgcccaacggccgt4680

ttcccgcaccccctatccgggaaggaagaaagccagaactcacccctgcttcgtctggcg4740

gcgccgtttcccgcacgcgatgccgtcgacggcggaggatttcccggcgataaggaagct4800

ggggaagctcttccggcttaccgaagtgtacctctggtaagtctcctgtctcccctaccc4860

gcttctcagcgtagggtttgccgtttgcgaggagtacgtctcctcaaactactctctctt4920

cctgcagggacgattcgtatggcgctggacctcacgatggacagaagaacgggcgctcgg4980

cggaggctgctctcgtggtacggtctctgacccgctttggtgtagctcactcttaagctt5040

tctgagttgggggtgcgcttgtgcttcgactagtagatggctaatttcgtcgggctactg5100

gtaatttcttggtatctgcattgtcgagaaagaaggcccgacgcataatttcatgcttgc5160

ccaagagtctacttaacacaaggaattggttttgtggcgtggtttgtgcattgcgcccaa5220

actgtagcctgtacaaaatgttaatcgtcgcgtgcattttaaacaaagttttgtattata5280

cgacaaaataccctcggcacatttgttacagactaccgataagtgcatacctatttctcc5340

tagttctatcaggaaataatcctggacctcgaaatgacagcctcgtctggctagaaccct5400

actaaacattttgagtgatcacttttcattactcattttcttgatgaaagcacattactg5460

acatggaagtttgctacataagacataacacttccttgtagtgctttatttaattattga5520

ccgatgatctttttggaaaattaagctgtattaaacaattgtagcttcggtgatgattgt5580

tggattaagcattagtctgctgcagtcctcttcttgattctgatatgacagttatttgtt5640

gattaaaataatgatggtttgctttacacttcgatctcctttgagaggaaaacatgtgaa5700

ggtgtggactagatcatgtatagaccaacagcattatcttattaaaacacttctaaataa5760

cttagcaatttcataaccatttttacacttctgaggaattcatcttgtcgtgaaagagtc5820

aattaacttagctgcttagcagactgtgtcaagcttattacttgtatgttgtgccctaca5880

aattactatgaggtttataatgtacatagcaatttgacgaccttcaacttttcaggactc5940

tcatacagataaaacatgcaatgaagcatccaatagcactgacaaaggttagatgtattt6000

ttcttgtattctagtatcttccttggtcaattttctttacagaggatgttacaatgtact6060

ctactttttttgtgtggaaacaacccactagagaaaaaaaatcacttcatttgagaaatc6120

ttaagaatctgaactctgaagctcagcatgcttccacaccaccacttttcaggccactgt6180

ctcttcattggagtaaatgacgcttcttttagagagaaagagagggggggggggtctgtt6240

tataattgaatcaagagattttattggtcacctgatttctgttgcatgacgtgggacctg6300

gatagacttcagatttgccttagttgataagttcaccggcactagtgaaagaaaagtata6360

tggtacaggtactcttatgaacaggcaaccacttagcaattcagcatctaatagagaagg6420

accaagtcttcaactaaagtcacaatatacctttagtactattagaggggctgaccctcc6480

ttgtctagtgcttgtgagatcataggagatggggcgtggtggttaagattgtggatcata6540

ttcgccaagctcccagggtcaatgtgattgagggatgtggtatgctacttgtgaaagggt6600

tcaaaaaggccaggatgacattgttcctacattcctggaatggggatgacacccccagga6660

caaggaaggtggcacagttccacgttcctggtgacatgttgtgtatcaaatgggaggcca6720

aatcaccacgggattactctaggaagggaagatgatgttgataatttgagtcattgttgc6780

aacatctgttcatggtttcatgcctcattttataagtcatattgcccacacataacattg6840

taatagtaaaatcaacaccagttattttacgttttcccttgtatgtcaaccgatttctta6900

ctgtgtatatgatctgtctatcaataggccatttctttgttgaagatttggaattggtca6960

atctcatgggttctttggggcttcctgtttcattcagcacaagtaaagtggtcagttgct7020

cacatagtgtgacaccatgttactgttccctaccggttccattgcttaattcctttatgc7080

attgcagaacaagaacacatgcaacaagggaaagaaaaaaggaagacaagcaccgctcaa7140

agcagcaaacactcaaatcaatgatgctgtgaggatatgtatcaatactgaagatagaga7200

aaattctgttgaatcattggatgctatggagcaaacgcactcatgcaatttatttgtgac7260

accactgggtcaaaatgaaccctcccgtgatgacactgacaagaggcttagggaagacag7320

ctcttgtgttgaagaacaagaagagtctggctgtagcaccatctactctgctggcaaagc7380

ccctggctgtgatgctaaaaatcatctcactgaacttggggcttttgagctttctgataa7440

cttggccaactcagcaaaagaagaatactcaattcaagaaaatcaagcttatgaaagtgt7500

gttgctagattctgaagagatgtcaaggaatgactgtgttgatgatgaatctacacattc7560

ctgtgttggcatttatcaggatgaaagagtgtccacaaggggagatcaaacatctgaaga7620

aactctatcagtaccccatgattacaatgatgttggcagagaagctagtctaagtttggc7680

agagccatcatctattgatgagcatgcacaaagctctgccaacaacttttactatgacta7740

tggtgaatggagggttatctgggatccattctataatcggtattatttttacaacatcca7800

gacacaagagtccacatggtgtcctcctgaaggactggaggattttgcatcatattgtag7860

cccagataccactaaagagctagctgaactgggatctcagtgttcaagcatggcaccaca7920

agagaacagtaaaaaccctagtctcgtcctttgcagttgacattacgaatagttatatgc7980

actacgataaaaactttctacaatatgtaacacttgagcatgtggcaatgggtgtaaaca8040

tttaataataaggtagtgaaatccattacacacagtattgaattttgcactacaaatgct8100

gaaggagaaacctaaattgtcaatgctttttggtgacattaattattgccattgatttcc8160

tgcttgtaggtgcttcatttatctgtctccaatttactcatatgctagcttcttgtttgg8220

gactaaaggctttgctgttgttttagtatgtcacacatttctctttaatctcaccatcac8280

agatctggctactcatgtcaatcatttagaagcacaggagcaagatcactgcattcatga8340

tttatctgacattcctgttgaaaagccaatatatcaaaggtagggaataccaaactgtac8400

aatgttgaacaagttattgttttttttgttaattctgttcatctatgcagtatgataact8460

acctctgacaaagcacagcacactgaaaataagtacagcgattcaacaactactgtgtta8520

gagatgaaccaggaagttgctagcaccaaaacgaaaaagagagtaaggagatctcgatcg8580

tgtaaggcgataatatatggcatctgctttctaggagtttgttcctgttacaattttagg8640

ttgcgcatttacacaatagtttcttggtttctttgagcaaatgcagctttgcatgactgc8700

tacattgcctacttatgtctaggtaacttttctttgcaaactgcaaagttatgtctaggt8760

aactatgccttctagaaaacctccttgttagctatgtattagtgagacttgcctaatatt8820

tattttcttgtggtccgttcttgtgctctttgtacatatttgccaataaccattttaatt8880

gttctacagatcattcatgccaagacatggcagggaacgtctctaatgacatcatcaagt8940

actgggctcagcggtattcacttttctcactttttgatagtggtataaagatggatgaag9000

tagggtggttttcagttacgccagagccaattgcaaagcatcatgcatctcgtgtgggtg9060

cgggagtaatgattgattgtttcacaggagttggtggaaatgccatccaatttgccaaaa9120

agtacgtcaatgttatcttgcaattgagttatgtgatggtctaatgtatcatttgcttga9180

acacttcctgtttagtagcaactgttatttttcttatgtcacgagaatgcaatggctata9240

tcaccttaagcagtatgctatgtccactgtccagtttaactaaggcatctgcttccagta9300

atatgcaaggctcttcttacttttgctgttatttaatatatggaagtgtccttacggagg9360

tgttattgtggacattttgagcatgttcatcatgtcacttgagttagtagagccagcctt9420

agttgtttgcagtgtaggtggatttattttatgttatcaatgtttcttctacagtactaa9480

gactattgttccacattaactatgtctccttttccaggtgcaagcatgtaattgcagttg9540

atattgatccacaaaagattgattgcgcgcatcataatgcatccatttatggagtaaatg9600

atcacatagatttcattgtaggtgattttatacatatagctcctcatctgaaggtaatgc9660

ctttttcttggaattattacttttaagtttctcaacacgtcacttctattagctatatgt9720

ttttgtagctgtttgcgagagtgaatttattgttgacattgttctcatttgcccacccat9780

tttaggataggggcttggtactacaaatatcttgatacttcaagtcctacaaaaagaaat9840

ttatgtttcatattttttccatttgaacgtcgagattttatggtcccatggagttctccc9900

tatttttcgatgatgcccatcttttggcagtaccttctttgtgtacacaataaatgggag9960

gatattttctgcagggagaaactgctttcatgtcgcctccttggggtggccctgactatg10020

ccaaagttgatgtttatgatatgaaaagcatgcttattccttgtgatgggttagttcctt10080

gtttctattttaagagagtaatttctttcagtttgcactcactgatgtttacttactttg10140

tgagtaaaacgcaccagagatccattaacctttaaggaggtgttatctatgtccatcaac10200

actcaaactgcatttttgggttcctaaactttttaagtgattcaccggagttccgtaccc10260

cttcgtttatatttgtattttgcagaaacctcactctgttttatttctccttcgcatgta10320

ggggtggtaatggatcacgattcaaacgtttctccacgattcgtttgagcccttaattaa10380

ttttagtacaaaaataaatagaaatagagatagagcctgatcctaatatgatttgatcct10440

caaattttatagcgtagaatttagagcccattaccatttaccacccctattcacatgcct10500

acccctctccatcttctggattgaatgttccaacctaatttacactcgtagtttctttga10560

tctgccaatcaaatccagagcctaattgctataacattagaacgaacacgccatattacc10620

agaatactcgatgcagatatggatagaagcgaggcgctaagcgcagccagccttggcttc10680

ttgctctgcaggccgatcagggcgccagccaaagccaaccatgcgcgcacgtgactgcaa10740

tgctactctctcttcgcctttgccatcgtcgtcgcaggatgttacgttgtgcttatgctg10800

gctcccacgagtgccgccgcccagagcgagctgagcgcacgcagccactgcttgtggttc10860

acgagcgtgagcatgccctcaccacctgctgctgcgcccttgctgcttgcttgcttgccg10920

gtggtgtacattatggacggattaaattgaatggatttacctgttccagaaaaagatctg10980

atcgacgatgggatgctatcttgtatggctccggatcaatgaagattaatggaacaacca11040

atcgaaggctcagagcaggctagttggtgcccggaagactctggccagaagatggaaatg11100

ggtaagcgtgtgaaggaaaaaagaaatagagggggatttctacaaaaaacaaacataatg11160

aagaggtatggatttcaggtgaaccacttaaaaataaaaagggcatacccagtgccgtag11220

gcttcccgcactgtgcggggtcgtctggggaagggtatctttaagcgtcaagtcttaccc11280

gcataatatgcagaggctggggctcgaacccgggacctttcggttatagacggtaggctc11340

taccgccgcaccaagcccgcccttgaaccatttaaaaaatttaggactcaaaaatacagt11400

ttgacagttgatggacctagatgacacctccttaaaaattttaatggacctatggtgcat11460

tttaatcttttgttgatcgactatactcaatgttgaactctttaggtactctctttttaa11520

actcggaaccatgatagcttcaagagtagtgatgtttcttcctcgcaacattgacctaaa11580

ccaattggcggacatgtccttgtctgtggatcccccgtgggcagttgaggtaagcccatt11640

tttgctgattttgtgccaagctgacgtttcctatagatgtcacagtggtctctctctctg11700

caggtcgagaagaacttcctcaacggaaagctgaaagccataacagcttactttgaagaa11760

caggatcgttgaaccaagcatcggcgctggtgatacaaatcatcttgttagctatgactc11820

acgacaattttttgtggtgaccctaaacagaacctttgtgttcggagacagaaagaagcg11880

gtttatcatcttcaccgagcatagataatttatttgcagagatgagtcattggtatcata11940

caaaagcagctcagcttatctcaattcacagcaagtgaaactgtcgaaggaaaactacaa12000

ggctgacagtcgaacgcgtgggagttagcttaattttgccttatgataagcaagcatgct12060

tcctggtttatttcatacagctactagtagtttcagctgcaacagttgtgcgttggtgtg12120

cgtgtgattctcacatatctggtcctgcggatgtgagtgatgcaaatgtatgtgtcatca12180

tcccatgtttgtttgtttgctctcaatctatgtagattgagtgggattaagtgagtttaa12240

atctcagacaagtcaaaaaaaaatgttttcaatctcatccaatccacatatgatagtaat12300

acccgagtaaggcttagatgtaatagttggaataagaaaaacaagtcagccattttgaag12360

ttttgtccttggagttctattaaaaggcattactgataaatctccaacagatttgcagtt12420

gaagcaacatgtgaaacatatttatcatgttaaaacaatttgccttagtattcgattatg12480

ccatgaaatctgacatttccttacacatcccagtttatcattgtcaactgtctttaggaa12540

tgtattgtatctgctgtttttacttgtatatgtatgttattttttgtcgttgtatgtata12600

tgttttattataaacatggccactaaggttgttctattcgttaaaataacacagatctat12660

aaacgactaaacaagcttcttgggataaagaatcatatggaggctggattttcgaggagt12720

ctggtgcactgttttgctaatgatcagaccccccccccccctctaaaaataaagaaaata12780

ctggatttcctgttcatttattacattcatatgtaaatgcttctgtccttttctatatct12840

gggctggactttttgtgtgctcgtcactcaagttggttagtgtggttaattttattatgc12900

tccgtgctctttcctaccgaacttggtctttgttagtatcattatcagtcagttatattt12960

tctcctcttgatgcttcatctaatctatttttgcaaagttgtcatgttatgtactatatg13020

atcttttacaaggtttttgacttttcaaattattgtgtcgtatattatttgtactcagat13080

tgtgcttacaactttagtttatctatactttaaggggtgtttggtttctatgagctaatt13140

tataatcccttcattttattctattttcgtacctaaattgtcaagcacgaaaacgaaaat13200

aaagttttaacttttatatttagcagtttatacactaaaatagaataaaatagatgaagt13260

aaaaattagtcctcagaaaccaaacatctcctaaatgtctagtaatagtcgcctgaactg13320

tagagcgcccaacacgcgccaccctgatttggtgtcttaaaatggcatgtgtatataggt13380

ggaatgggtttgacgagactgtaactactttttcttaattaaattatagatggacttaac13440

ttttctatatgcattttaaatatatttttctatatttttggtgggctgagttacagttta13500

tgtcaatataaattacaacagaccgaatctaaaattttattataaaatgtatgcaccaac13560

cgttgactaaaaagataaaatttggacacctacattttagcaagtcacctgctaatatat13620

atctatactaggtaagtgtccgtgcgttgcaacgaaaacatataataatacgataactta13680

tatataaaatatgtgttatactgttatgagaaaaagtttcacctgtcctatttttatcaa13740

tatgacaacagaggatcaatataaggccttggcatggcttcagaagttcagattaacgaa13800

ttatggagcaagagcaactatttctggtgtttcagtagaaagaatggggatatgtgttta13860

tctctctcatacatgttacacaatgtgctatagaatgacacctctaggccgctgctacaa13920

ctacagagaataaattatggatcatgggtgccctactaagttagctacacgtaaaatctg13980

gggcgattgaccccctaatgcttcatcttggagatctcactaaaatacaccttccgcaca14040

aaccgagccttgataaagctcgcgatcttcgtgtcctgatctttagtgcagacaactgca14100

agggaaattattgtgccggtcagcaagagtggaaaatgtcagcagaaacacaagaatgga14160

aaattatatgatgtgcaagtaggacggcaccctatttaactaaaggtgtgtttggttcag14220

ttttctgaaccaattcgttgccaaaaaatctaaaatctcacacaaacggtacaacatcag14280

aatagatttttaaaagtttatagatttctcaagttcaattcaaaatcaccatctacccca14340

aatttttcagattactattattcatgttacaactatcactcttgtagtatctaccattga14400

tagttgttttaatcaaatatattttagctttctcacaatcctcagctcgaaacagatttt14460

catggctcacagttggattcatattttcataaatctataggtgtgaaccaaatagaccct14520

aaaacatcatgttgtaatgttccaaaaaaatcatcacaaaaaacatgtatgcaacacacc14580

taagtgcaacaacactaacctgcaggagcgttggcgtaggctggactcttgtgtgtctgc14640

aatgtggaaacctgcattgtaaatggttaggtaaatgcttcgcttaaaaatggcagtaaa14700

tgcttcactaaaaatgcttccacgtgctcatgatcaagtaggttttatgttcaatctgca14760

gttctacacaagtgcgattcattttgatactcatttctatttactttcagagctcgtgct14820

aactgtcataaagtgcaatgcatctatgactgccaattgatattgtgctcctgccataaa14880

gtgcactgcatcgtgctaactgtacactaatctgtgaggatctaagaccaatatttgttt14940

acgttttctcctttagtatcctataaaaacaacacgcctagacaacccaaaaaatgtgtc15000

ccaataggaatatcagattctgacgctggcagcaatctcgaatgataatattttttccaa15060

acaagcgaggtccctaaatagaagcggcaacagataaaaactaaagataaaaactaaaga15120

gtacagatgattggcatcacatcgggaatgaaatatgcctaacatatcaatttgcatatt15180

agattatttgctgagaacaataacgaaaacatatttagttgttcatcacaagttacctta15240

gattttgctgttcaaggtcctttgggtcttctttctgctagaacatacaagggtatttca15300

gatttgcaaacaaggaaaagcaagaacttcaatgatacatcattgtaaaaccaagtttcc15360

gatttaaataaagatgatgcttgcggtcaacacattcacaaatgtaaatgtgtgaaatcg15420

ttcaaacataaggcttatgtggtcatgctcaggtagtatgtacagacctaaaaacaaggt15480

atatgacaacagtaccagccactaaacacacatggttaatcactaaaacaattctccgat15540

taaccaggaaaactatagcagccactagaactatacaggtttctaccagtaattgcttca15600

ctaaaaaatgcccccatgtgtaaattttcaggtggtttgtacagacataaaaacaagggt15660

ataggactacttcttgtgctaaaataaaagctggcactaaacagtgtatccagttcatca15720

ggaaaacagttttagtgattaatcactaaaacaatacccatggtgcaaattatctgatta15780

accatgtacataccagtaattgcttcataaaaaatgcccccaatttgctcatgtttaagg15840

aatatgtatagaactaaaaacaagggtatatgataacaataccagccactaaacacacat15900

ggttaatcactaaaacaattctctgacaactcctataaaaagatagcaaccaccagaaat15960

aaaccgcccacgacatccctaattgtagtcactaaaaactggtagcagtaattggttcac16020

taaaaaatgaccataatcacgaaaactataacagtcaccagaactatataggtttcatta16080

gtaattgcttcactaaaaaatgtccccatgtgtaaattttcatgtcgtatgtatagacct16140

aaaaacaagggtatatgactacttgctatgctaaaacaatagctagcactaaacagtgta16200

ttcagttaatcatccaaacaattttgtgattaatcactaaaacaatacccatggagcaaa16260

ttatatgagaataagatctcgtcgttcctattgtgaagaatatactactacctccagttt16320

caaattacaatttcaaattacaagttgtttagaacatccacaaggtaattgcgaagaata16380

tactaattgctctagtttcaaattacaagttgtttagagaaaaggtgttccttaatagtc16440

tagttttagaagctacatccaactggtaaacataaattgcagaaaccttttatgtggaag16500

cctccgtcattgagtctgtcccctttagctgtaagtagttttctaaatattgttagtcag16560

gcttagttgtttgagactctgtttccattcgtgaccatgggaactgtgaaatgtgtagaa16620

gatgctcatgctcatgcatatgcatcgaattgttttgtaaagtcatcttaatgctcaaac16680

agtttttttatctgccccagctgtacactgctttctgaattatgtcatttaggcttagct16740

gtccgagataatttcatttgtgatgaaggtaaccggagcatctgtccttttgttttaaac16800

ataaatattttgatagcttaacttgtgcgtcattttatcatgtactaacatggtatatag16860

atggcacttagcagtaacattcctgacttatgtgattgtcctgttagaatgcttctgcaa16920

tataatgggcttatgctaatctgtttggaatcccatgatgaataacaattatggatgttg16980

ggcattttgtatttttatgatgtaggcttaacatattttcttctctgctcagccctgccg17040

gtggagacctctatttatagctataatccagccatatctagtgaactaacagatctattc17100

ctatctctagtgcttatccccttaacagatttattttcctatctctatttctttgatgtt17160

acttgctgcaggtgctatcccccactgatgcattatatgacaatcactcgaagaatcaga17220

tgctaattaattgttttatacttgaccattgctaattaagtactccatttttta17274

14

1767

dna

artificialsequence

cdna

14

atgggttctttggggcttcctgtttcattcagcacaagtaaagtgaacaagaacacatgc60

aacaagggaaagaaaaaaggaagacaagcaccgctcaaagcagcaaacactcaaatcaat120

gatgctgtgaggatatgtatcaatactgaagatagagaaaattctgttgaatcattggat180

gctatggagcaaacgcactcatgcaatttatttgtgacaccactgggtcaaaatgaaccc240

tcccgtgatgacactgacaagaggcttagggaagacagctcttgtgttgaagaacaagaa300

gagtctggctgtagcaccatctactctgctggcaaagcccctggctgtgatgctaaaaat360

catctcactgaacttggggcttttgagctttctgataacttggccaactcagcaaaagaa420

gaatactcaattcaagaaaatcaagcttatgaaagtgtgttgctagattctgaagagatg480

tcaaggaatgactgtgttgatgatgaatctacacattcctgtgttggcatttatcaggat540

gaaagagtgtccacaaggggagatcaaacatctgaagaaactctatcagtaccccatgat600

tacaatgatgttggcagagaagctagtctaagtttggcagagccatcatctattgatgag660

catgcacaaagctctgccaacaacttttactatgactatggtgaatggagggttatctgg720

gatccattctataatcggtattatttttacaacatccagacacaagagtccacatggtgt780

cctcctgaaggactggaggattttgcatcatattgtagcccagataccactaaagagcta840

gctgaactgggatctcagtgttcaagcatggcaccacaagagaacaatctggctactcat900

gtcaatcatttagaagcacaggagcaagatcactgcattcatgatttatctgacattcct960

gttgaaaagccaatatatcaaagtatgataactacctctgacaaagcacagcacactgaa1020

aataagtacagcgattcaacaactactgtgttagagatgaaccaggaagttgctagcacc1080

aaaacgaaaaagagagtaaggagatctcgatcgtatcattcatgccaagacatggcaggg1140

aacgtctctaatgacatcatcaagtactgggctcagcggtattcacttttctcacttttt1200

gatagtggtataaagatggatgaagtagggtggttttcagttacgccagagccaattgca1260

aagcatcatgcatctcgtgtgggtgcgggagtaatgattgattgtttcacaggagttggt1320

ggaaatgccatccaatttgccaaaaagtgcaagcatgtaattgcagttgatattgatcca1380

caaaagattgattgcgcgcatcataatgcatccatttatggagtaaatgatcacatagat1440

ttcattgtaggtgattttatacatatagctcctcatctgaagggagaaactgctttcatg1500

tcgcctccttggggtggccctgactatgccaaagttgatgtttatgatatgaaaagcatg1560

cttattccttgtgatgggtactctctttttaaactcggaaccatgatagcttcaagagta1620

gtgatgtttcttcctcgcaacattgacctaaaccaattggcggacatgtccttgtctgtg1680

gatcccccgtgggcagttgaggtcgagaagaacttcctcaacggaaagctgaaagccata1740

acagcttactttgaagaacaggatcgt1767

15

1767

dna

artificialsequence

cdna

15

atgggttctttggggcttcctgtttcattcagcacaagtaaagtgaacaagaacacatgc60

aacaagggaaagaaaaaaggaagacaagcaccgctcaaagcagcaaacactcaaatcaat120

gatgctgtgaggatatgtatcaatactgaagatagagaaaattctgttgaatcattggat180

gctatggagcaaacgcactcatgcaatttatttgtgacaccactgggtcaaaatgaaccc240

tcccgtgatgacactgacaagaggcttagggaagacagctcttgtgttgaagaacaagaa300

gagtctggctgtagcaccatctactctgctggcaaagcccctggctgtgatgctaaaaat360

catctcactgaacttggggcttttgagctttctgataacttggccaactcagcaaaagaa420

gaatactcaattcaagaaaatcaagcttatgaaagtgtgttgctagattctgaagagatg480

tcaaggaatgactgtgttgatgatgaatctacacattcctgtgttggcatttatcaggat540

gaaagagtgtccacaaggggagatcaaacatctgaagaaactctatcagtaccccatgat600

tacaatgatgttggcagagaagctagtctaagtttggcagagccatcatctattgatgag660

catgcacaaagctctgccaacaacttttactatgactatggtgaatggagggttatctgg720

gatccattctataatcggtattatttttacaacatccagacacaagagtccacatggtgt780

cctcctgaaggactggaggattttgcatcatattgtagcccagataccactaaagagcta840

gctgaactgggatctcagtgttcaagcatggcaccacaagagaacaatctggctactcat900

gtcaatcatttagaagcacaggagcaagatcactgcattcatgatttatctgacattcct960

gttgaaaagccaatatatcaaagtatgataactacctctgacaaagcacagcacactgaa1020

aataagtacagcgattcaacaactactgtgttagagatgaaccaggaagttgctagcacc1080

aaaacgaaaaagagagtaaggagatctcgatcgtatcattcatgccaagacatggcaggg1140

aacgtctctaatgacatcatcaagtactgggctcagcggtattcacttttctcacttttt1200

gatagtggtataaagatggatgaagtagggtggttttcagttacgccagagccaattgca1260

aagcatcatgcatctcgtgtgggtgcgggagtaatgattgattgtttcacaggagttggt1320

ggaaatgccatccaatttgccaaaaagtgcaagcatgtaattgcagttgatattgatcca1380

caaaagattgattgcgcgcatcataatgcatccatttatggagtaaatgatcacatagat1440

ttcattgtaggtgattttatacatatagctcctcatctgaagggagaaactgctttcatg1500

tcgcctccttggggtggccctgactatgccaaagttgatgtttatgatatgaaaagcatg1560

cttattccttgtgatgggtactctctttttaaactcggaaccatgatagcttcaagagta1620

gtgatgtttcttcctcgcaacattgacctaaaccaattggcggacatgtccttgtctgtg1680

gatcccccgtgggcagttgaggtcgagaagaacttcctcaacggaaagctgaaagccata1740

acagcttactttgaagaacaggatcgt1767

16

434

prt

zeamays

16

metglyserserglugluhisvalpheleuaspprothrargilecys

151015

alaservalserleuleualahisaspleuileglyargmetleuasn

202530

arggluvalserserargproasnalalysgluvalleupropromet

354045

ilehisarggluilevalargpheglytyrcysgluserserserser

505560

lysserserseraspasnserglugluargaspglucysglyileval

65707580

aspalaleuvalthrthrilethrglnilearglysmetaspleuglu

859095

alaargserleuglnproserilelysalaglyleuleualalysleu

100105110

argglutyrlysseraspleuasnasnvallysmetglyleuserala

115120125

gluarglyslysglnlysleusergluileglnserglyvalgluglu

130135140

alagluserleuileglnlysmetaspleuglualaargserleugln

145150155160

proserilelysalaglyleuleualalysproargasptyrlysser

165170175

aspleuasnasnvallyssergluleulysargileseralaproasn

180185190

alaserglyleuilesertyrlyslysleuleuphehisglyleuasp

195200205

leutrpthralaleuserleuproglnproleuglyargalaalaleu

210215220

trpproprohisargthrilehisglnhisleuglncysglnglnleu

225230235240

thrglyvalalaglyserleualatyrleualaprogluvalleuleu

245250255

glyasntyrserglnlysvalaspvaltrpalaalaglyvalleuleu

260265270

hisvalleuleumetglythrleupropheglnglylysserileglu

275280285

alailepheaspvalilelysthralagluleuaspphehisasnser

290295300

glntrpalaservalserleuleualatyraspleuileglyargmet

305310315320

leuasnarggluvalserserargproaspalagluaspvalleuarg

325330335

hisprotrpvalleuphetyrthraspcysleuglnlysalagluphe

340345350

serasnleutrpaspthrasnlysthralaalaprometilehisarg

355360365

gluilevalargpheglytyrcysgluserserserserlysserser

370375380

seraspasnserglugluargaspglucysglyilevalaspalaleu

385390395400

alathrthrilethrglnvalargilesergluprolysargserarg

405410415

leupheserleuproasnglyleuleuproproserargasnserleu

420425430

argthr

17

143

prt

zeamays

17

metleuasnarggluvalserserargproasnalalysgluvalleu

151015

arglysphelyshisprocysasnleucyspheiletyrmetileleu

202530

asnleuserleuthrpheproasnglypheglnhisargalaprotrp

354045

valleuphetyrthraspcysproglnlysalaglupheserasnile

505560

trpaspthrasnlysthralaalaprometilehisarggluileval

65707580

argpheglytyrcysgluserserserserlysserserseraspasn

859095

serglugluargaspglucysglyilevalaspalaleuvalthrthr

100105110

ilethrglnvalargilesergluprolysargserargleupheser

115120125

leuproasnglyleuleuproproserargasnserleuargthr

130135140

18

162

prt

zeamays

18

metgluglyglyarghisproserproproproargileserarggln

151015

proproprotyrproalacysproserileleuproproleupropro

202530

valasnvalthrasnproglyleuvalproleuvalvalalathrleu

354045

pheaspgluargvalthrgluleuleuservalleualaaspalaala

505560

valglyargproglyargtrpserileglyglualaprotrpserser

65707580

serglyglythrasnglnalavaltyralaargargalaproglyser

859095

serserproproproalaproalaserproproleuproserserarg

100105110

alaaspcysleualaargtrpproglyserargalaleuvalalapro

115120125

leuglythrproalaphevalaspargleuphetrpserasppheser

130135140

glyserileargargglugluglualaglualaleuargaspproile

145150155160

argarg

19

87

prt

zeamays

19

metaspleuglualaargserleuglnproserilelysalaglyleu

151015

leualalysproargasptyrlysseraspleuasnasnvallysser

202530

gluleulysargileseralaproasnalaargpheglyargtrpthr

354045

trplysglnglyalatyrasnleualaleuargvalserserarggly

505560

tyrleuargproleuproglyargleuproglyargsersertrpser

65707580

leuglutrpleuileleuser

85

20

279

prt

zeamays

20

metalahispheaspgluleugluasplysthrthrasptyrvalasp

151015

leuservalglngluphealaleulysglnproglncysglymetala

202530

tyrasntyrtyrglyasnleuargleutyrvalvalalaasnlysala

354045

gluleualaserserilephegluileasplysalaserthrlysarg

505560

ileglyalaargphecysargcysleuprohisthrargmetglugly

65707580

glyarghisproserproproproargileserargglnproglnpro

859095

tyrproalacysproserileleuproglnproproprogluarglys

100105110

lysglnlysleusergluileglnserglyvalgluglualagluser

115120125

leuileglnlysmetaspleuglualaargserleuglnproserile

130135140

lysalaserleuleualalysleuargglutyrlysseraspleuasn

145150155160

asnvallyssergluleulysargileseralaproasnalaarggln

165170175

alathrargglugluleuleugluserglymetalaaspthrleuala

180185190

progluglngluglnleualacysalaalaalaalaleualavalgly

195200205

proalatyrgluargleuglnglualaargasnprosergluglngly

210215220

cysasnhisasplysglnilegluglnalatyraspaspileleuasn

225230235240

serserlyshisthrleualasermetmetgluleuglnglualaleu

245250255

leugluserasnglnalathrlysaspalaasnglyilealaalaleu

260265270

tyrilevalleuvalleumet

275

21

428

prt

zeamays

21

metalasertyrserserargargprocysasnthrcysserthrlys

151015

alametalaglyservalvalglygluprovalvalleuglyglnarg

202530

valthrvalleuthrvalaspglyglyglyvalargglyleuilepro

354045

glythrileleualapheleuglualaargleuglngluleuaspgly

505560

proglualaargleualaasptyrpheasptyrilealaglythrser

65707580

thrglyglyleuilethralametleuthralaproglylysasplys

859095

argproleutyralaalalysaspileasnhisphetyrmetglnasn

100105110

cysproargilepheproglnlysserargleualaalaalametser

115120125

alaleuarglysprolystyrasnglylyscysmetargserleuile

130135140

argserileleuglygluthrargvalsergluthrleuthrasnval

145150155160

ileileproalapheaspileargleuleuglnproileilepheser

165170175

thrtyraspalalysserthrproleulysasnalaleuleuserasp

180185190

valcysileglythrseralaalaprothrtyrleuproalahistyr

195200205

pheglnthrgluaspalaasnglylysgluargglutyrasnleuile

210215220

aspglyglyvalalaalaasnasnprothrmetvalalametthrgln

225230235240

ilethrlyslysmetleualaserlysasplysalaglugluleutyr

245250255

provallysproserasncysargargpheleuvalleuserilegly

260265270

thrglyserthrsergluglnglyleutyrthralaargglncysser

275280285

argtrpglyilecysargtrpleuargasnasnglymetalaproile

290295300

ileaspilephemetalaalaserseraspleuvalaspilehisval

305310315320

alaalametpheglnserleuhisseraspglyasptyrleuargile

325330335

glnaspasnserleuargglyalaalaalathrvalaspalaalathr

340345350

progluasnmetargthrleuvalglyileglygluargmetleuala

355360365

glnargvalserargvalasnvalgluthrglyargtyrgluproval

370375380

thrglygluglyserasnalaaspalaleuglyglyleualaarggln

385390395400

leuserglugluargargthrargleualaargargvalseralaile

405410415

asnproargglyserargcysalasertyraspile

420425

22

401

prt

zeamays

22

metalasertyrserserargargprocysasnthrcysserthrlys

151015

alametalaglyservalvalglygluprovalvalleuglyglnarg

202530

valthrvalleuthrvalaspglyglyglyvalargglyleuilepro

354045

glythrileleualapheleuglualaargleuglngluleuaspgly

505560

proglualaargleualaasptyrpheasptyrilealaglythrser

65707580

thrglyglyleuilethralametleuthralaproglylysasplys

859095

argproleutyralaalalysaspileasnhisphetyrmetglnasn

100105110

cysproargilepheproglnlysserargleualaalaalametser

115120125

alaleuarglysprolystyrasnglylyscysmetargserleuile

130135140

argserileleuglygluthrargalalysserthrproleulysasn

145150155160

alaleuleuseraspvalcysileglythrseralaalaprothrtyr

165170175

leuproalahistyrpheglnthrgluaspalaasnglylysgluarg

180185190

glutyrasnleuileaspglyglyvalalaalaasnasnprothrmet

195200205

valalametthrglnilethrlyslysmetleualaserlysasplys

210215220

alaglugluleutyrprovallysproserasncysargargpheleu

225230235240

valleuserileglythrglyserthrsergluglnglyleutyrthr

245250255

alaargglncysserargtrpglyilecysargtrpleuargasnasn

260265270

glymetalaproileileaspilephemetalaalaserseraspleu

275280285

valaspilehisvalalaalametpheglnserleuhisseraspgly

290295300

asptyrleuargileglnaspasnserleuargglyalaalaalathr

305310315320

valaspalaalathrprogluasnmetargthrleuvalglyilegly

325330335

gluargmetleualaglnargvalserargvalasnvalgluthrgly

340345350

argtyrgluprovalthrglygluglyserasnalaaspalaleugly

355360365

glyleualaargglnleuserglugluargargthrargleualaarg

370375380

argvalseralaileasnproargglyserargcysalasertyrasp

385390395400

ile

23

380

prt

zeamays

23

metalasertyrserserargargprocysasnthrcysserthrlys

151015

alametalaglyservalvalglygluprovalvalleuglyglnarg

202530

valthrvalleuthrvalaspglyglyglyvalargglyleuilepro

354045

glythrileleualapheleuglualaargleuglngluleuaspgly

505560

proglualaargleualaasptyrpheasptyrilealaglythrser

65707580

thrglyglyleuilethralametleuthralaproglylysasplys

859095

argproleutyralaalalysaspileasntyrphetyrmetgluasn

100105110

cysproargilepheproglnlysserargleualaalaalametser

115120125

alaleuarglysprolystyrasnglylyscysmetargserleuile

130135140

argserileleuglygluthrargvalsergluthrleuthrasnval

145150155160

ileileproalapheaspileargleuleuglnproileilepheser

165170175

thrtyraspalalysserthrproleulysasnalaleuleuserasp

180185190

valcysileglythrseralaalaprothrtyrleuproalahistyr

195200205

pheglnthrgluaspalaasnglylysgluargglutyrasnleuile

210215220

aspglyglyvalalaalaasnasnprothrmetvalalametthrgln

225230235240

ilethrlyslysmetleualaserlysasplysalaglugluleutyr

245250255

provalasnproserasncysargargpheleuvalleuserilegly

260265270

thrglyserthrsergluglnglyleutyrthralaargglncysser

275280285

argtrpglyilecysargtrpleuargasnasnglymetalaproile

290295300

ileaspilephemetalaalaserseraspleuvalaspilehisval

305310315320

alaalametpheglnserleuhisseraspglyasptyrleuargile

325330335

glnaspasnserleuargglyalaalaalathrvalaspalaalathr

340345350

progluasnmetargthrleuvalglyileglygluargmetleuala

355360365

glnargvalserargvalasnvalgluthrglyser

370375380

24

589

prt

zeamays

24

metglyserleuglyleuprovalserpheserthrserlysvalasn

151015

lysasnthrcysasnlysglylyslyslysglyargglnalaproleu

202530

lysalaalaasnthrglnileasnaspalavalargilecysileasn

354045

thrgluasparggluasnservalgluserleuaspalametglugln

505560

thrhissercysasnleuphevalthrproleuglyglnasnglupro

65707580

serargaspaspthrasplysargleuarggluaspsersercysval

859095

glugluglnglugluserglycysserthriletyrseralaglylys

100105110

alaproglycysaspalalysasnhisleuthrgluleuglyalaphe

115120125

gluleuseraspasnleualaasnseralalysgluglutyrserile

130135140

glngluasnglnalatyrgluservalleuleuaspsergluglumet

145150155160

serargasnaspcysvalaspaspgluserthrhissercysvalgly

165170175

iletyrglnaspgluargvalserthrargglyaspglnthrserglu

180185190

gluthrleuservalprohisasptyrasnaspvalglyarggluala

195200205

serleuserleualagluproserserileaspgluhisalaglnser

210215220

seralaasnasnphetyrtyrasptyrglyglutrpargvaliletrp

225230235240

aspprophetyrasnargtyrtyrphetyrasnileglnthrglnglu

245250255

serthrtrpcysproprogluglyleugluaspphealasertyrcys

260265270

serproaspthrthrlysgluleualagluleuglyserglncysser

275280285

sermetalaproglngluasnasnleualathrhisvalasnhisleu

290295300

glualaglngluglnasphiscysilehisaspleuseraspilepro

305310315320

valglulysproiletyrglnsermetilethrthrserasplysala

325330335

glnhisthrgluasnlystyrseraspserthrthrthrvalleuglu

340345350

metasnglngluvalalaserthrlysthrlyslysargvalargarg

355360365

serargsertyrhissercysglnaspmetalaglyasnvalserasn

370375380

aspileilelystyrtrpalaglnargtyrserleupheserleuphe

385390395400

aspserglyilelysmetaspgluvalglytrppheservalthrpro

405410415

gluproilealalyshishisalaserargvalglyalaglyvalmet

420425430

ileaspcysphethrglyvalglyglyasnalaileglnphealalys

435440445

lyscyslyshisvalilealavalaspileaspproglnlysileasp

450455460

cysalahishisasnalaseriletyrglyvalasnasphisileasp

465470475480

pheilevalglyasppheilehisilealaprohisleulysglyglu

485490495

thralaphemetserproprotrpglyglyproasptyralalysval

500505510

aspvaltyraspmetlyssermetleuileprocysaspglytyrser

515520525

leuphelysleuglythrmetilealaserargvalvalmetpheleu

530535540

proargasnileaspleuasnglnleualaaspmetserleuserval

545550555560

aspproprotrpalavalgluvalglulysasnpheleuasnglylys

565570575

leulysalailethralatyrpheglugluglnasparg

580585

25

589

prt

zeamays

25

metglyserleuglyleuprovalserpheserthrserlysvalasn

151015

lysasnthrcysasnlysglylyslyslysglyargglnalaproleu

202530

lysalaalaasnthrglnileasnaspalavalargilecysileasn

354045

thrgluasparggluasnservalgluserleuaspalametglugln

505560

thrhissercysasnleuphevalthrproleuglyglnasnglupro

65707580

serargaspaspthrasplysargleuarggluaspsersercysval

859095

glugluglnglugluserglycysserthriletyrseralaglylys

100105110

alaproglycysaspalalysasnhisleuthrgluleuglyalaphe

115120125

gluleuseraspasnleualaasnseralalysgluglutyrserile

130135140

glngluasnglnalatyrgluservalleuleuaspsergluglumet

145150155160

serargasnaspcysvalaspaspgluserthrhissercysvalgly

165170175

iletyrglnaspgluargvalserthrargglyaspglnthrserglu

180185190

gluthrleuservalprohisasptyrasnaspvalglyarggluala

195200205

serleuserleualagluproserserileaspgluhisalaglnser

210215220

seralaasnasnphetyrtyrasptyrglyglutrpargvaliletrp

225230235240

aspprophetyrasnargtyrtyrphetyrasnileglnthrglnglu

245250255

serthrtrpcysproprogluglyleugluaspphealasertyrcys

260265270

serproaspthrthrlysgluleualagluleuglyserglncysser

275280285

sermetalaproglngluasnasnleualathrhisvalasnhisleu

290295300

glualaglngluglnasphiscysilehisaspleuseraspilepro

305310315320

valglulysproiletyrglnsermetilethrthrserasplysala

325330335

glnhisthrgluasnlystyrseraspserthrthrthrvalleuglu

340345350

metasnglngluvalalaserthrlysthrlyslysargvalargarg

355360365

serargsertyrhissercysglnaspmetalaglyasnvalserasn

370375380

aspileilelystyrtrpalaglnargtyrserleupheserleuphe

385390395400

aspserglyilelysmetaspgluvalglytrppheservalthrpro

405410415

gluproilealalyshishisalaserargvalglyalaglyvalmet

420425430

ileaspcysphethrglyvalglyglyasnalaileglnphealalys

435440445

lyscyslyshisvalilealavalaspileaspproglnlysileasp

450455460

cysalahishisasnalaseriletyrglyvalasnasphisileasp

465470475480

pheilevalglyasppheilehisilealaprohisleulysglyglu

485490495

thralaphemetserproprotrpglyglyproasptyralalysval

500505510

aspvaltyraspmetlyssermetleuileprocysaspglytyrser

515520525

leuphelysleuglythrmetilealaserargvalvalmetpheleu

530535540

proargasnileaspleuasnglnleualaaspmetserleuserval

545550555560

aspproprotrpalavalgluvalglulysasnpheleuasnglylys

565570575

leulysalailethralatyrpheglugluglnasparg

580585

26

13229

dna

zeamays

misc_feature

(10689)..(10788)

nisa,c,g,ort

26

ccccgcgcggccaaccctctctaagagggccctggtccttccttttatagtcgtaaggag60

tggatccaggtgtacaacgggggtgtagcagagtgctacgtgtctagcgggggagagcta120

gcgccctaagtacatgccgatgtggcagccggagagatcttggcacccagcgagtgtgat180

gtcgtggccatcggaggagcgacggagcctggcggagggacagctgttggagcggttgag240

tccttgctgacgtcctcctgcttccgtaagagagctgagagccgccgtcgtcacagagct300

tgcggggcgccatcattgcctatctggcggagctagccagataggacaccggtcttgttc360

tctgcggcccgagtcggctcggggcagggtgatgatggcgcttcctgttgacgtgactgg420

cctgcgccctaggtcgggcgacgtggaggctcctccgaagccgaggtcgagtctgtcttc480

catggtcgaggccgagcccgagcccctgggtcgggcgaggcggaggtcgttcggcagagg540

ccagggcggagtccgagccctggggtcgggcgaagcggagttcgtcgtcttctggggctg600

agcccgagtccgagccctgggtcgggcggagcggagttcgccgtcttccgggactttagc660

ccgagtccgagccctgggtcgggcggagcggagttcgccgtcttccggggcttagcccga720

gtccgagccctgggtcgggcggagcggagttcgccgtcttccgggacttaccccgagtcc780

gagccctggggtcgggcggagcttcctatggtgcctttggcagggcctgactgcccgtca840

gtctcactctgtcgagtggcactgcagtcggagtggcgcaggcggcgctgtccttctgcc900

aggccggtcagtggagcggcgaagtgacggcggtcacttcggctctgccggagggcgtgt960

gtcaggataaaggtgtcaggccacctttgcgttaaatgctcctgcgattcggtcggtcgg1020

tgcggcgatttagtcagggttgcttcttagcgaaggcaaggcctcgggcgagccggagat1080

gtgtccgccgttggaggggggcctcgggcgagacggaaatcctccggggtcggctgccct1140

tgtccgaggctaggctcgggcgaggcgtgatcgagtcgctcgaatggactgatccttgac1200

ttaatcgcacccatcgggcctttgcagctttatgctgatgggggttaccagctgagaatt1260

aggcgtcttgagggtacccctaattatggtccccgacaaccacaaacgcccacgtcgtgc1320

gcgtggaggtaaggctatctgcattcatcatactttaaactagttgggtgcccgtgcgtt1380

gcaacagatatcatataaattcatgtgttttgctacgcgacacgagggatcggtattatt1440

agtagtcatttttctattggacctagtgaggtactcgtacgttgctacggagatcatata1500

aatccatgtgttttgctacgcgacacgagggatcggtattattagaagtcattttctatt1560

ggacctagtgaggtacctgtatgtgcccgtacgttaccacgagagttaaaacctagtata1620

aaacataaatacagaatgacaaacatcattatgatatacaaattcgtgtaacaaaatatc1680

actgtagcactaatattcaataaaatgtagcaactcactgctcactaatgatgtgtccag1740

ggtaagtgggtaaagcacggtagacgtcttcttttgacttagtacaatgctcgtgtgttg1800

agacggtgcacaattattcgataaaatgtttgagcacaagtgcacaacgattacataaaa1860

ttgaaaaatacttatattaacaaagtctaattgtctcaaattctttttgccaacaaccaa1920

ttcactgtgttgcgatgatacacaataatatttcatgaatgactcaggcaatccacctaa1980

gtgggtaacccaaaagcacaccaatgtgatgcctacacgtgaacatctaacctctattac2040

tacaatagtcttgttggaaatttacaactcatctagaatttcaaaatactcaaaattatt2100

tagactctctctaaaccggagcatacaaaatagatcaggcagttttaacggaaacatgta2160

tgactctcaccatcctgtgcagcgctgctcgaataccagaaagtcccagctcagccaata2220

ctgcatcaagttctgctagttcctttttttcatctcctttttagacaactgtctttctgt2280

ctctttgggtggcgcagctagggcccacaactgtcttgatagcaggttcggctggggcat2340

ctttcggatcatgctttaggttcatcctcagctccatcatcaacctcaaaatcaagatcc2400

ttgtcatcactctcaatgtcctgcaacatatatagaagtacatctaagaacactatcaga2460

gaggctaactataatgacaatcagccaaacatgctccatccccctcgagatgagtaaatt2520

atacttccacttaatagtttagttttgatgcaaaaattatgacaaaaaacaagtagaaca2580

ataaccagagtttgctatgacaaacatgcagcacctgtcatgtataaccatttccaaatc2640

acttgcttacccaagatatagaaaaacttggaacaagtccctatccaaggtacatatttt2700

ttatttttgtttagcaggtggtcagctctgaaaattttctgtcaaggtgaagtttgcctt2760

atgacttgctcatagcagggtctgctttgaactcctccactgggaatgcaatgtcgtgat2820

cccacagaccgatcaaatgggattcaaatttaggctgtgctatatcgcgctctgaatcta2880

gtgataatgcgtctatccttggtttcttgggagaatgttgcatgatctctgggtagtcat2940

ctaagcttggaggtgaatcagaaagtttctgaccctcagaagctttaattgatccaccaa3000

gcttttgttgctcctcaatgatcatctgcaagtatattccttgtgctttaattgttagtt3060

gcagttgtctttgaacctgaaaacataacttatttagacaagcaaaagaaagttggcaag3120

gaaaatttagattgatcttgaagctaaaaaggactcataaactacaagcaataagacaaa3180

acaggagaactgccaacaatagattattgacaactaagcattactacaaccatgagaaga3240

aaccactgttaggtgtaacgtgacagttaataagtgaatattgagaatgggccaaaatat3300

cagcttgtagaagtagccccaactaagtaggacttctggtcatagtaggattgtcggtaa3360

actaaaatgatgtaaaaaacagaaatctctactacttattaagtaagcaatagtagtctg3420

cctccctcgttctgccgttctgccatccatcgatctggaccgttcatatcgcatcgtgcg3480

tctgcgtctcccacactctatctccctccaccgcacgaccacccaatccctaaccttccc3540

ccacacgttcctctctctctcgcgttgcccgccccctgcccctgccgcgatagaatgcct3600

cgcggcctccaccgccgacgcggcctccaccgcttcccttcctccctttgcctccaccgc3660

ccctcccccaacgtcgtcgccaccgccgctctagggcctcgacgcaggcagcttgggaac3720

cgccctcggcctcggtgtcgcacggccccacgaggaccgaccttgccgcatggcggttgt3780

ggaggcgtgcgaccccgcaaggatcgacatctcgtccaagacagcagccccctcagagcg3840

gacgacgcagcttatctctggtggaaggcacgtagccacgcccccactccctcctcgtgc3900

tcccgccgacgcctgcctcgcgctcctcgtcgaccgtctctccctctcctccgtcgttgc3960

atctcgcctctcgcgcacgcacaaaggtgagtctcccctcccccgactcacccctctagg4020

tctctgattcttctcatcggcgagcaaggacggaggagctggcggaggagctgtcgaata4080

taacctaattatgtcggtctcgggtgggatttggctattttgggcgccatggcctaggtg4140

agcgtacagatctggttctgcattttgttttcttatctgtcaatcatgttttttcatatt4200

actgcaggtgggtctgattaaatattacagacatgttttgcttcgaagccctacacctgt4260

ccatctaattttaggaactgacgattgacacagctttgttgtggcgctatttagtataag4320

ttatagaaatcggatctttggctctcatctgcttgtagttgcaacattgaggtagaacct4380

gggtgggtctgattgctcattgctgtgatatatgtctggaaaagcagagttatagtttgg4440

aaactagcgcgtagtacatgtcggtttctttaagtaattgcttatgctctgttgttttga4500

tttccagatgccccaaatgtgctgaacttaagctatcgcgtgagaacacaactttctggt4560

aagtggcaagctcctccgtttcttattttaccgcattggaagttggaacaccatgatagc4620

tctcaggacgaatgagtatcagctaattttattgttttaacaatagagcataaaacctta4680

tgcatattttaaaggttcatgtatcatttagttatggccttatgaggaagcttcatcatc4740

atatctgtagataccttgattcgtggggatggtaattttttgtattcttgttcttttgat4800

ttttgaaggaaacacatcttataatacgatagcacccattggaagtcctcgcatacagtg4860

cagggtagaattatgcttcatgtgttgccccttcaccacgatatgccaaattgaatgtag4920

tttcatcagttatgctcaattatggatgttcagaggcatccaatttcagtgtgtactgca4980

atacttgggtccacctatagttgatagatgttctgtattttgttttcttataaattagat5040

ttggcttgcattatattgttctcttttggaacagagtcactctccgacagctcatcgccg5100

cctgcttcgaccgcctgtacacaaggtttaagaagcgctctgcgaacttccacggccgcg5160

ccgtgctgcttctagctatttacctttttctatgatgcaaatgtttatatgcatactatg5220

ttacttgagaaacattaaagtacttgatgtacctaaacacattttgtttagtgatgtata5280

gtgatatagcattattgtctatattaatatttatattgctggtagcttctactttaattg5340

atctcaatggggcatttgggtggctagcaattcacattgataatttaaaagtgaatttca5400

ggtgtacatttgatggcctccgatatggtgctgccttcaattctctacaatgcgcgagaa5460

tgctgctcaggagggtattaatggctcaacacagatgacctcctcggagtcatgtttcta5520

attatctacactatgattctccttctgttgataaaatattgttttattgtgctgtgagct5580

aatgataacagtgatggtaagtaaatatggtccatgcatattctcatcatagatggctga5640

aaaactccgagtgctgctacgctaccagagtcttcatgtgcatacttacttcaagaactc5700

aaggtacatagttttctcaacagaagaatatgtatctgtttgattccagctgaattgctt5760

actaaactcagtgtgtcactttaaatgatatgggatgaagttgggcaagaccaaagtgaa5820

agtgggagaatacccgaagaacttcttgttggacgaacttggagaaaccaatactaaaac5880

tcagtgtcaaccgcttgcaacaggcaatttaggtcgatgtgctcgcacgctgtgtgacca5940

tgtctgagcactccccacccacccaatcgcttcccacgtcatgccgccacgtcgagaatt6000

tgtacacaactcaggttgcccattttactctgttattgaacctcgcttttctgtagaccc6060

aggtatgctagaagtaggtaatggtagcatgaccttagcagagtatcgctcacattttag6120

catttgggttgtcatgaaggtaagttttcactgagattggaccgatgttgtgtcatcata6180

ttttgagtaaagggtttgaccgaagaattttgaaaaataaaagactgtagttcattgaag6240

gtgataactggtatgcaagtattactattaatgattctccacttactgatattacatttg6300

gataagaggaaggagatatcctaattctaatgggttaactgctagcagttccttatgtac6360

tgttatgcttcaggcccctttattaattgcctcatgtggaaagagggtgtgatgatcttt6420

tgttgtttttgtagcgagaagctcaaagaaggggaagacatctacatctatcagtaagtg6480

atccagtttagtagatgaatcaccgctatgggtgtttttctattcttgaataggcatgtg6540

gttctattgatggttgtttgatgtttggtatattgtgttctgactgtaggggtgacaaat6600

ccttttgcagcatggagtgcagagaaaatttcatggtagacgagatggaaggtgagccag6660

agagtatgtataatcagatcaattttagactttttttcactaaccacttatccatagcac6720

acttgtgtttatttgagatatgtgtcgtcattttgtgttcatcggctattgctatgaaga6780

tgaccacagggttaattcttgtttatgagttcatcctctagaatgtattgggttgtcgcc6840

aagaaaaaacacacgaagacataaattcaaataatttctggtgctgtggagtattcataa6900

ttgtgtgcttttgttgaatccttcatgtaccataattcttactgcttgcaagccctttta6960

gaatgtgactttaggatgtagaatttggtgaaaggagacaagaaattattctctaaacta7020

ctatttttataggcaagggaggctggaagcaccttttttgcatgggagctgcatggagag7080

aacttattggtagaaaacaaactttgaccagattcttgaatgtgccaagggaaaaaggta7140

ttaattgggaccttctatttatacaagggaaaaaggtattttttatacaattgctcttcc7200

ttctcatgtatggcaacttcctgttttgtagagattgagcgcccttcctcactgtttgaa7260

ctttactcatcctcatgaaactgtgatgctacctgtaggagcggtttgttcaaggaggga7320

atgacttttagaccctaataacttgtctacacttaagataaaaattgttgtcattgctat7380

ggttacctcctttagtgtcgtaaactcagaataatcatagatccctttgttgcttacaca7440

ttattcatgttctacaagctattgagattttgactagccgctacattttagtgcaggttt7500

tcataatttcttacctttatatctacattttagatttcctcccttttgacactacatttt7560

agccgctcactcggaagctttccatccctgctggagtactcagcccataaaggtaaatgc7620

atgcctttacctgcctgaaatgcattgtctttcttttcaaccgtgaatgtgaattggatc7680

catcattcatgtgcacacacaaggccggtttgaatttggaatgcactgttgttctgcctg7740

tgatccacttggtggtttttatttgcgttcacatattaaaaaatatattataccctgatt7800

ctgagtcaaactttggttagacctggatttgttgattagtttccgttgttgtgatctgtt7860

gaacaaactataggttaggtgagttatcagattcttagatctgtcctgtacggttgtgat7920

ctgttgaacaaactataggttaggtgagttatcagattcttagatctgtcttgtacgtct7980

gatctctaccatgttacaaacatctgaaagttaataataatcactactacattcacacct8040

atcttgtatggatttgctgttgaaattgcagaatgcttcccaacttgtgcatttttatta8100

catccccacaacttcatcgtatagtccagacatcctttgtttgtgtagcaaaatagatgt8160

gcaattgtttcattgtaacaatgtcctgtatattaacctcggcggctctggtgctttctg8220

caggcttctcattcttatttgcatggcctgtaactgctgcacagcgcgctacggctggcg8280

tagcggcagcaactgctggtatgaacaggatccgtccaccctcattagcgacgatgaagt8340

gacacccaattctaggcgcggacgtacgatggcgccctcgcgcgggatatcgggctgagt8400

gatgagacgctgctgccattggggttccaggcgcgcctgaccgctcccactcctaggccc8460

ggcatgtctctccgacgctacctaccggtgctggtgaaacgcaaacttaaaatcgtgcat8520

tgcaagatgttgccttctctttatgtctcctctactctacgcctacgagctttctttgtt8580

tataactttttctgtggtttcgctttcaacaagctcaagcggtcaacagatcaacttaga8640

caaaatgcagatgcagtggaaatattaaggaagaccagattccctcatgttcatggagca8700

ggtgaaaagaagtcaccagagacaattctggatcatgagactatgagaggactttggcaa8760

caaatgttcggtttggacttctctgggtggttctatgatactttataatttcttttggct8820

tatgtgtcatttcagaagatatgaaaatagctaagcattgtcaataatattagcttcctt8880

gtttcttgttaacctaagatgactgccctatttcacttgttttttctagctgtggtagta8940

ggtcattagagtagttcccttcaatcgtagcactcagccatgttgtaaatggtttacttc9000

tattgtgaatcgtgtttttcttttttaatggtggggttagtggaaaatctggacgtatcc9060

gcgaatggtggagaagcacacaaaacaacaattatatgcaaaaactaaactatgttttct9120

ttctttctatggtatatgaatttagtaccttgataatcacaatgattagagtctgtttgt9180

atttatattaaaacataactgcttaggaggtgtatgaggaggataacgagatgtgcagtg9240

cgcgcaacagtaagcagactaccatgacttggttgctcttcaagtgtgcaagtgtgtcta9300

tggattgtatggttctttggtttttgttgttcgctaaggtgcaaagtaggaagaaaatac9360

ggccctgaccctgcaagtggggaatgtatcgtttttgctggaactgaaatatgcggtgtt9420

cttttttatgcactaatattttgttgaatattttgtataccgttgcaacgcacgggcatc9480

tacctagtaaattaatagttttcaataaaagacctcgagtttctcatgtagtcgcttctg9540

aacctccatttacatctttagtgccttaaaaatcgtgccaatgtttccatatatcaaatc9600

acggtgatcaacaattaaacaaaataactcttaaccaaaaaaatggaagtgtcacgcact9660

catgaactacagatgtcaatagctaacatataaacaggctcaagccttccttcaaggata9720

ccaggttgttctaagcagcatctctatatttagagaggaagacttgtctcgattcatctc9780

ttcctgagaccccactcatgtgggtagttctaccttttttatgaatacagatgcaacatc9840

aacatataaacaggctcatgcctcagatactactctcaattggtctactgaaattctatt9900

gcatttatgacgaatgcagttacagatttaaaagtaaaggacgagtacagctgcatgatg9960

aaaagtaaacagaacggtaactgcaaatttacttcagacatgctgctgctttgatgagat10020

cggaacagggttgttgctcacaatctgtggatttaaagcctgcaacacatgtcacaattt10080

aatcatgaggttcactaagacaacattacacaaaagaggtgttggtactgagaacataaa10140

agatcatgtcatccttgtctgaaaggaagtgcattccaactaagaacaaagatcactgct10200

tctcaatgaagcaaggctacgacatgatgcatatttagctcaaattacccaagcaagtgt10260

ttattaaaataaaagcatgcatcttagacatcagcaataggttatctaagacgtcaatgc10320

atatttagcccagttttagacaacaatgcatatttagctaaaatattaaacatgcaatgc10380

ctctttagctcaaatttcagaccacagtattaaccggtggtgctacacttgaagaacaag10440

cttatctggagaagcagcccacatccagtcctgaaggcaaatcagatgtttgaacaaaat10500

cagtgcttagccaaatctagaaaccacaaccactcgaacgctgactcagatggtagagcc10560

atcacaccaatggtgctaccatcagtaaaataatagaatcaatttgtttttccttactga10620

tacagtacccttcgaataattgtttataagcaattgaacttgctgcccaatgtattatag10680

tcttggtcnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn10740

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnttagagacatat10800

agagtagtatagataaggataatggacagtagtacttgaagttaatacataaaacttgca10860

gctaataaacaaacatagtactattttagttaacatacaacaattttccagctcttcttt10920

gcacttatagcatttttagtacctacacacaacctgaactgaacaccatcaagaacagta10980

ctgcatttggaagtttgaataaaaatggtgtccaattaacagttttctaatattgtaact11040

taattattgaatggtgcaatagggtgttcacttgttgaattggttgataatcttgtgtgg11100

gctttaaatataacattgttagtgaatgctaaataattctgcactcttggaatggtcaag11160

ctgagaatgagccagagattaactggtgcgaacctgggaagcctgcagggatggaggtcc11220

ttcttgggagtaaggatgggttcttgacgaagaagattaagagctatagcacatgccgaa11280

gtgatcctacaaagccacgtgcattgcaatgcctttcaccactgatagcaggtcatgtaa11340

gatacatcatttaaccatctggatatgtgttaatgttagacaactaatttataaaatttt11400

cgattcaggtgggcaatggaattgtttgttgaatgagagtaatgtttattattggtaaag11460

tatcacaattcagtctgattcaggtgggcatggtaaaatgaagtgcatctgccaaaaagt11520

aaggtaagtaactaagtatcttttggatgtttggaggatcagatgttgttgcagaatgtt11580

tattgactaattgcaagtcaaatgaattaaaggggaaaacagataagcactaagattttt11640

atatgaaagctgtgtgagtcaacctgctaagagctctaaatgtcctgtcccagaagtagt11700

ggcggagccagcccaaatttgaagcctatcgactcagaaaattcatccatgccaaccaaa11760

caaatagaataattaccttatctgttgcacgggtgtgaaggagctggagatgcagatcag11820

gtgtcacaaattctataaccaaattgcataatgcataagcaatcaaacaatacaccccta11880

attctttaggctaaacgaactgcactaagcctatgtttcagatttcaagattactattgg11940

attgtgttcatcacaccaataggttaagtaccaacatgctaatcaataagaacagtaaag12000

gggcattacctacaacaaatgttacgtctaaatagaactaacttggaggcaatgtaagct12060

ctgaatgtccgaaacattgaaacaaagtctgaactctgaaattgtctccctaatcctcac12120

ctcatcccatctgtgtcgatggatggtgggtgcctcgcctgagctgttgtgtggtgccgc12180

ggacgaggtgagcgcttgtgtagaatgaagatggggtcgcagagcgacgtggccgacggc12240

ggaccgatgatggagagggtactttttactttcattaacagatcatcaagcaaaataaaa12300

tagtaatgagcaattgagcccagatcacatttgtaatcatctagcatgtaagcactattt12360

tttaatttgatacccctcattcagagtttttgcacaaacttagaactggaggctgaaagt12420

aagcaaattgacttcatcttttagtttgatgaagatcaaatgaatttaaaagcttgagaa12480

gatgaaaagatctcatatgtatgaccaagagatgatagcaaatggcttcaagccaataaa12540

gtcacgagagatgaaagcaaataggttcacatgcaggatacaacagaaagttatgtatac12600

aggaacaaatgctaatagaactatgcatttatcatattttagaaactgtagcccttgttc12660

atcctacccccaatgagtattcctatcagtacacatgtaggctagcatttttttagttgt12720

aatacttgcttagctacagatcactggctagctgaaaaaaacctaagtagcaacagtgga12780

acagtttgagtatagctgtatgaaaaaggctatctgaataatagcaatatttggattcat12840

gttcatttcaagaacttcaccattgggaaaactccaacggtactaccatttgattgctgc12900

aatgtttgtaatgataaagagatctcacttactcacttgtggtaccatagaaacctgcaa12960

aaaagaggaaaataatataaattggtgtaaaaattaaggaattcatagtacaaatatgat13020

cgactttataggaccacccttttgttaaatttttcagtgtcaacaggaaggtgtgattca13080

ggcctttgtggatcaagtcagcacagaaccttggcttcttcattcggtagaatgatgaca13140

agactaccgaatgatgacaagacttttaaaaatatcctgttgaaaatgcatttaccatgt13200

cagctgtgttaccaagacgggtaattact13229

27

1515

dna

artificialsequence

cdna

27

atggcggttgtggaggcgtgcgaccccgcaaggatcgacatctcgtccaagacagcagcc60

ccctcagagcggacgacgcagcttatctctggtggaaggcacgtagccacgcccccactc120

cctcctcgtgctcccgccgacgcctgcctcgcgctcctcgtcgaccgtctctccctctcc180

tccgtcgttgcatctcgcctctcgcgcacgcacaaagatgccccaaatgtgctgaactta240

agctatcgcgtgagaacacaactttctgcttctactttaattgatctcaatggggcattt300

gggtggctagcaattcacattgataatttaaaagtgaatttcaggtgtacatttgatggc360

ctccgatatggtgctgccttcaattctctacaatgcgcgagaatgctgctcaggaggatg420

gctgaaaaactccgagtgctgctacgctaccagagtcttcatgtgcatacttacttcaag480

aactcaagcgagaagctcaaagaaggggaagacatctacatctatcagggtgacaaatcc540

ttttgcagcatggagtgcagagaaaatttcatggtagacgagatggaaggcaagggaggc600

tggaagcaccttttttgcatgggagctgcatggagagaacttattggtagaaaacaaact660

ttgaccagattcttgaatgtgccaagggaaaaaggtattaattgggaccttctatttata720

caagggaaaaaggcgcggacgtacgatggcgccctcgcgcgggatatcgggctgagtgat780

gagacgctgctgccattggggttccaggcgcgcctgaccgctcccactcctaggcccggc840

atgtctctccgacgctacctaccggtgctggtgaaacgcaaacttaaaatcgtgcattgc900

aagatgttgccttctctttatgtctcctctactctacgcctacgagctttctttgtttat960

aactttttctgtggtttcgctttcaacaagctcaagcggtcaacagatcaacttagacaa1020

aatgcagatgcagtggaaatattaaggaagaccagattccctcatgttcatggagcaggt1080

gaaaagaagtcaccagagacaattctggatcatgagactatgagaggactttggcaacaa1140

atgttcggtttggacttctctgggtggttctatgatactttataatttcttttggcttat1200

gtgtcatttcagaagatatgaaaatagctaagcattgtcaataatattagcttccttgtt1260

tcttgttaacctaagatgactgccctatttcacttgttttttctagctgtggtagtaggt1320

cattagagtagttcccttcaatcgtagcactcagccatgttgtaaatggtttacttctat1380

tgtgaatcgtgtttttcttttttaatggtggggttagtggaaaatctggacgtatccgcg1440

aatggtggagaagcacacaaaacaacaattatatgcaaaaactaaactatgttttctttc1500

tttctatggtatatg1515

28

2229

dna

artificialsequence

cdna

28

attgatctcaatggggcatttgggtggctagcaattcacattgataatttaaaagtgaat60

ttcaggtgtacatttgatggcctccgatatggtgctgccttcaattctctacaatgcgcg120

agaatgctgctcaggagggtattaatggctcaacacagatgacctcctcggagtcatgtt180

tctaattatctacactatgattctccttctgttgataaaatattgttttattgtgctgtg240

agctaatgataacagtgatggtaagtaaatatggtccatgcatattctcatcatagatgg300

ctgaaaaactccgagtgctgctacgctaccagagtcttcatgtgcatacttacttcaaga360

actcaagttgggcaagaccaaagtgaaagtgggagaatacccgaagaacttcttgttgga420

cgaacttggagaaaccaatactaaaactcagtgtcaaccgcttgcaacaggcaatttagg480

tcgatgtgctcgcacgctgtgtgaccatgtctgagcactccccacccacccaatcgcttc540

ccacgtcatgccgccacgtcgagaatttgtacacaactcaggttgcccattttactctgt600

tattgaacctcgcttttctgtagacccaggtatgctagaagtaggtaatggtagcatgac660

cttagcagagtatcgctcacattttagcatttgggttgtcatgaaggtaagttttcactg720

agattggaccgatgttgtgtcatcatattttgagtaaagggtttgaccgaagaattttga780

aaaataaaagactgtagttcattgaaggtgataactggtatgcaagtattactattaatg840

attctccacttactgatattacatttggataagaggaaggagatatcctaattctaatgg900

gttaactgctagcagttccttatgtactgttatgcttcaggcccctttattaattgcctc960

atgtggaaagagggtgtgatgatcttttgttgtttttgtagcgagaagctcaaagaaggg1020

gaagacatctacatctatcagggtgacaaatccttttgcagcatggagtgcagagaaaat1080

ttcatggtagacgagatggaaggcaagggaggctggaagcaccttttttgcatgggagct1140

gcatggagagaacttattggtagaaaacaaactttgaccagattcttgaatgtgccaagg1200

gaaaaaggtattaattgggaccttctatttatacaagggaaaaaggtattttttatacaa1260

ttgctcttccttctcatgtatggcaacttcctgttttgtagagattgagcgcccttcctc1320

actgtttgaactttactcatcctcatgaaactgtgatgctacctgtaggagcggtttgtt1380

caaggagggaatgacttttagaccctaataacttgtctacacttaagataaaaattgttg1440

tcattgctatggttacctcctttagtgtcgtaaactcagaataatcatagatccctttgt1500

tgcttacacattattcatgttctacaagctattgagattttgactagccgctacatttta1560

gtgcaggttttcataatttcttacctttatatctacattttagatttcctcccttttgac1620

actacattttagccgctcactcggaagctttccatccctgctggagtactcagcccataa1680

aggcttctcattcttatttgcatggcctgtaactgctgcacagcgcgctacggctggcgt1740

agcggcagcaactgctggtatgaacaggatccgtccaccctcattagcgacgatgaagtg1800

acacccaattctaggcgcggacgtacgatggcgccctcgcgcgggatatcgggctgagtg1860

atgagacgctgctgccattggggttccaggcgcgcctgaccgctcccactcctaggcccg1920

gcatgtctctccgacgctacctaccggtgctggtgaaacgcaaacttaaaatcgtgcatt1980

gcaagatgttgccttctctttatgtctcctctactctacgcctacgagctttctttgttt2040

ataactttttctgtggtttcgctttcaacaagctcaagcggtcaacagatcaacttagac2100

aaaatgcagatgcagtggaaatattaaggaagaccagattccctcatgttcatggagcag2160

gtgaaaagaagtcaccagagacaattctggatcatgagactatgagaggactttggcaac2220

aaatgttcg2229

29

579

dna

artificialsequence

cdna

29

aagttttcactgagattggaccgatgttgtgtcatcatattttgagtaaagggtttgacc60

gaagaattttgaaaaataaaagactgtagttcattgaaggtgataactggtatgcaagta120

ttactattaatgattctccacttactgatattacatttggataagaggaaggagatatcc180

taattctaatgggttaactgctagcagttccttatgtactgttatgcttcaggccccttt240

attaattgcctcatgtggaaagagggtgtgatgatcttttgttgtttttgtagcgagaag300

ctcaaagaaggggaagacatctacatctatcagggtgacaaatccttttgcagcatggag360

tgcagagaaaatttcatggtagacgagatggaaggcaagggaggctggaagcaccttttt420

tgcatgggagctgcatggagagaacttattggtagaaaacaaactttgaccagattcttg480

aatgtgccaagggaaaaaggtattaattgggaccttctatttatacaagggaaaaaggta540

ttttttatacaattgctcttccttctcatgtatggcaac579

30

2512

dna

zeamays

30

cccgctacctgttcaccgcgcgccagcgaaacctccgcacgcccactgcccatctgttcc60

ccgtgcgccagcgaaacatccgcacgcccgcggcccgcctgttccccgcgcatcccgctg120

cacgacttctgctaccgcaacggccacccacgcacgcccgcctgttcaccgcgcatcccg180

ctgacctccccttcacgctcgcacacgctccgttcccccaccccaccgcaatccccgacg240

ctataagagcggtaaccaactccatctccctggtgccacgcattgttgagttcttaaggt300

gcgtttcgttgaggacttgttcatttttgttggtcatgtattccattttactgctctacc360

attttgtggaataaagggaggaatgttttcactagaagagttcatcaatcttatgttggt420

ttcttggatcagttttgctctatggctaaatggtcgaattgagcctatttcattataaag480

ttagcgagcgaataattgttcagcctcttcctagaactcattaccagtagaatcagttac540

taactgcttttctttttcttggattagaatggctggggctatctctcaccatgcgctagc600

attttcacaatcccactggtgcagtgcgaagaactctagattcggaaagaggacgggcaa660

tgctcgcctggtttatctaaaaggaagatgtggttcaggcagcagaaaactgggtttgat720

gtgggcctcgagctcgcagtcttctgtcatggagccgacgcacctaccatctgatggcaa780

cagcagccacaccccaaaaaaatcaagtaattttaacgacctcctatggtggttatttgt840

ttttaatttgagaaaactatccatttgacacatttaactttgggcttctcagaatttggg900

ggcatataataagatctgctaatctgttatctctatgtcgttgtaggtgaaagcgctctt960

atattgatttggcatggtgaatccctgtggaacgagaaaaatctatttcctggctgcatc1020

gatgtacccctgacaccgaagggtgttgaggaggccattgaggcaggtaaaaggatatgc1080

aatatcccaatcgatgtgatatatacttcatcactgatttgtgctcagatgaccgcaatg1140

cttgccatgatgcagcatcgacgcaagaaggtttgtgtctttcctttgaaattccagtaa1200

tttcttctagcatttgtatgaacttgccggagaaatcatgctttgctggtgatatatgta1260

tttatagatcctagttatcacgcataatgagagtgaacaagctcacaggtggagtcagat1320

atacagtgaggagacaatgaaacagtccattcctgtcatcacagcttggcaattgaatga1380

acggatgtaatactttctccatactctttgatttgctaattactccctctgtctcaaaat1440

agtattaattttagctcttgatttttatgtctatattcaaatagatgatgataaatctag1500

attctagacacaaatataaaacatatacatcaagtattatatgaatctattaatttacta1560

agaccaattttaatttgggacagagggagtatacgattataatagttgtttgactgtgct1620

tctctttaaatatcccttgacatttctaggtatggtgagctacaaggccttaacaagcaa1680

gaaactgtagatcgatttggcaaagaacaagttcatgagtggcgccgcagttatgatatt1740

cctccgccaaatggagaaagtctagagaagtgtgctgagagagctgttgcttatttcaaa1800

gatcaggcacatctagcaaggccactttacactaattgaaagatacactttttacttggg1860

ttattggtcttgctgcagtattggtatgcatgctaaaggttattcttgaatcgatgaatt1920

cctctactatgggatgcagaaatgcatgtgcttagttttctttctattgtgctagctcat1980

atcaaatttataacctgaattttttatttatgttcgactctaaaaaacagttttttctag2040

ctcgatttgacctatagtaatttttccgtaatagattattccacaacttgtggctggaaa2100

acatgtgatggttgctgcacatgggaattcacttcgttcaattataatgcatctggacaa2160

attaacttctcagaaggtaattcactgtcgtttttgtctttccatcaaaaaggactcggc2220

taaacagaacatgtagcattatgttaagtttgggagtgagcctttcgtcccttcaggtaa2280

taagccttgagctgtctactggcattcccatgctttacatattcaaagagggaaagttta2340

ttcgacgtgggactccagtaggaccttcggaggccagtgtttatgcttataccagggtaa2400

gattctttcccccacatgttctaccataggacgatactccagtttacaaaccttatctgt2460

acagaccaaacgatttgctgagcacattacatttcagaacaaattggcctag2512

31

1005

dna

artificialsequence

cdna

31

atggctggggctatctctcaccatgcgctagcattttcacaatcccactggtgcagtgcg60

aagaactctagattcggaaagaggacgggcaatgctcgcctggtttatctaaaaggaaga120

tgtggttcaggcagcagaaaactgggtttgatgtgggcctcgagctcgcagtcttctgtc180

atggagccgacgcacctaccatctgatggcaacagcagccacaccccaaaaaaatcaagt240

gaaagcgctcttatattgatttggcatggtgaatccctgtggaacgagaaaaatctattt300

cctggctgcatcgatgtacccctgacaccgaagggtgttgaggaggccattgaggcaggt360

aaaaggatatgcaatatcccaatcgatgtgatatatacttcatcactgatttgtgctcag420

atgaccgcaatgcttgccatgatgcagcatcgacgcaagaagatcctagttatcacgcat480

aatgagagtgaacaagctcacaggtggagtcagatatacagtgaggagacaatgaaacag540

tccattcctgtcatcacagcttggcaattgaatgaacggatgtatggtgagctacaaggc600

cttaacaagcaagaaactgtagatcgatttggcaaagaacaagttcatgagtggcgccgc660

agttatgatattcctccgccaaatggagaaagtctagagaagtgtgctgagagagctgtt720

gcttatttcaaagatcagattattccacaacttgtggctggaaaacatgtgatggttgct780

gcacatgggaattcacttcgttcaattataatgcatctggacaaattaacttctcagaag840

gtaataagccttgagctgtctactggcattcccatgctttacatattcaaagagggaaag900

tttattcgacgtgggactccagtaggaccttcggaggccagtgtttatgcttataccagg960

accaaacgatttgctgagcacattacatttcagaacaaattggcc1005

32

394

prt

zeamays

32

metalavalvalglualacysaspproalaargileaspileserser

151015

lysthralaalaprosergluargthrthrglnleuileserglygly

202530

arghisvalalathrproproleuproproargalaproalaaspala

354045

cysleualaleuleuvalaspargleuserleuserservalvalala

505560

serargleuserargthrhislysaspalaproasnvalleuasnleu

65707580

sertyrargvalargthrglnleuseralaserthrleuileaspleu

859095

asnglyalapheglytrpleualailehisileaspasnleulysval

100105110

asnpheargcysthrpheaspglyleuargtyrglyalaalapheasn

115120125

serleuglncysalaargmetleuleuargargmetalaglulysleu

130135140

argvalleuleuargtyrglnserleuhisvalhisthrtyrphelys

145150155160

asnserserglulysleulysgluglygluaspiletyriletyrgln

165170175

glyasplysserphecyssermetglucysarggluasnphemetval

180185190

aspglumetgluglylysglyglytrplyshisleuphecysmetgly

195200205

alaalatrparggluleuileglyarglysglnthrleuthrargphe

210215220

leuasnvalproargglulysglyileasntrpaspleuleupheile

225230235240

glnglylyslysalaargthrtyraspglyalaleualaargaspile

245250255

glyleuseraspgluthrleuleuproleuglypheglnalaargleu

260265270

thralaprothrproargproglymetserleuargargtyrleupro

275280285

valleuvallysarglysleulysilevalhiscyslysmetleupro

290295300

serleutyrvalserserthrleuargleuargalaphephevaltyr

305310315320

asnphephecysglyphealapheasnlysleulysargserthrasp

325330335

glnleuargglnasnalaaspalavalgluileleuarglysthrarg

340345350

pheprohisvalhisglyalaglyglulyslysserprogluthrile

355360365

leuasphisgluthrmetargglyleutrpglnglnmetpheglyleu

370375380

asppheserglytrpphetyraspthrleu

385390

33

128

prt

zeamays

33

mettyrcystyralaserglypropheileasncysleumettrplys

151015

gluglyvalmetilephecyscysphecysserglulysleulysglu

202530

glygluaspiletyriletyrglnglyasplysserphecyssermet

354045

glucysarggluasnphemetvalaspglumetgluglylysglygly

505560

trplyshisleuphecysmetglyalaalatrparggluleuilegly

65707580

arglysglnthrleuthrargpheleuasnvalproargglulysgly

859095

ileasntrpaspleuleupheileglnglylyslysvalphepheile

100105110

glnleuleupheleuleumettyrglyasnpheleuphecysargasp

115120125

34

335

prt

zeamays

34

metalaglyalaileserhishisalaleualapheserglnserhis

151015

trpcysseralalysasnserargpheglylysargthrglyasnala

202530

argleuvaltyrleulysglyargcysglyserglyserarglysleu

354045

glyleumettrpalaserserserglnserservalmetgluprothr

505560

hisleuproseraspglyasnserserhisthrprolyslysserser

65707580

gluseralaleuileleuiletrphisglygluserleutrpasnglu

859095

lysasnleupheproglycysileaspvalproleuthrprolysgly

100105110

valgluglualaileglualaglylysargilecysasnileproile

115120125

aspvaliletyrthrserserleuilecysalaglnmetthralamet

130135140

leualametmetglnhisargarglyslysileleuvalilethrhis

145150155160

asnglusergluglnalahisargtrpserglniletyrsergluglu

165170175

thrmetlysglnserileprovalilethralatrpglnleuasnglu

180185190

argmettyrglygluleuglnglyleuasnlysglngluthrvalasp

195200205

argpheglylysgluglnvalhisglutrpargargsertyraspile

210215220

proproproasnglygluserleuglulyscysalagluargalaval

225230235240

alatyrphelysaspglnileileproglnleuvalalaglylyshis

245250255

valmetvalalaalahisglyasnserleuargserileilemethis

260265270

leuasplysleuthrserglnlysvalileserleugluleuserthr

275280285

glyileprometleutyrilephelysgluglylyspheileargarg

290295300

glythrprovalglyproserglualaservaltyralatyrthrarg

305310315320

thrlysargphealagluhisilethrpheglnasnlysleuala

325330335

35

637

dna

artificialsequence

cdna

35

gctacgtgccttccaccagagataagctgcgtcgtccgctctgagggggctgctgtcttg60

gacgagatgtcgatccttgcggggtcgcacgcctccacaaccgccatgcggcaaggttca120

aagacaactgcaactaacaattaaagcacaaggaatatacttgcagatgatcattgagga180

gcaacaaaagcttggtggatcaattaaagcttctgagggacattgagagtgatgacaagg240

atcttgattttgaggttgatgatggagctgaggatgaacctaaagcatgatccgaaagat300

gccccagccgaacctgctatcaagacagttgtggccctagctgcgccacccaaagagaca360

gaaagacagttgtctaaaaaggagatgaaaaaaaggaactagcagaacttgatgcagtat420

tggctgagctgggactttctggtattcgagcagcgctgcacaggatggtgagagtcatac480

atgtttccgttaaaactgcctgatctattttgtatgctccggtttagagagagtctaaat540

aattttgagtattttgaaattctagatgagttgtaaatttccaacaagactattgtagta600

atagaggttagatgttcacgtgtaggcatcacattgg637

36

691

dna

artificialsequence

cdna

36

tatagggagagcggccgccagatcttccggatggctcgagtttttcagcaagatgctacg60

tgccttccaccagagataagctgcgtcgtccgctctgagggggctgctgtcttggacgag120

atgtcgatccttgcggggtcgcacgcctccacaaccgccatgcggcaaggttcaaagaca180

actgcaactaacaattaaagcacaaggaatatacttgcagatgatcattgaggagcaaca240

aaagcttggtggatcaattaaagcttctgagggacattgagagtgatgacaaggatcttg300

attttgaggttgatgatggagctgaggatgaacctaaagcatgatccgaaagatgcccca360

gccgaacctgctatcaagacagttgtggccctagctgcgccacccaaagagacagaaaga420

cagttgtctaaaaaggagatgaaaaaaaggaactagcagaacttgatgcagtattggctg480

agctgggactttctggtattcgagcagcgctgcacaggatggtgagagtcatacatgttt540

ccgttaaaactgcctgatctattttgtatgctccggtttagagagagtctaaataatttt600

gagtattttgaaattctagatgagttgtaaatttccaacaagactattgtagtaatagag660

gttagatgttcacgtgtaggcatcacattgg691

37

2146

dna

artificialsequence

cdna

37

atgagaagaatcagagacctagaggggtgagtcgggggaggggagactcacctttgtgcg60

tgcgcgagaggcgagatgcaacgacggaggagagggagagacggtcgacgaggagcgcga120

ggcaggcgtcggcgggagcacgaggagggagtgggggcgtggctacgtgccttccaccag180

agataagctgcgtcgtccgctctgagggggctgctgtcttggacgagatgtcgatccttg240

cggggtcgcacgcctccacaaccgccatgcggcaaggtcggtcctcgtggggccgtgcga300

caccgaggccgagggcggttcccaagctgcctgcgtcgaggccctagagcggcggtggcg360

acgacgttgggggaggggcggtggaggcaaagggaggaagggaagcggtggaggccgcgt420

cggcggtggaggccgcgaggcattctatcgcggcaggggcagggggcgggcaacgcgaga480

gagagaggaacgtgtgggggaaggttagggattgggtggtcgtgcggtggagggagatag540

agtgtgggagacgcagacgcacgatgcgatatgaacggtccagatcgatggatggcagaa600

cggcagaacgagggaggcagactactattgcttacttaataagtagtagagatttctgtt660

ttttacatcattttagtttaccgacaatcctactatgaccagaagtcctacttagttggg720

gctacttctacaagctgatattttggcccattctcaatattcacttattaactgtcacgt780

tacacctaacagtggtttcttctcatggttgtagtaatgcttagttgtcaataatctatt840

gttggcagttctcctgttttgtcttattgcttgtagtttatgagtcctttttagcttcaa900

gatcaatctaaattttccttgccaactttcttttgcttgtctaaataagttatgttttca960

ggttcaaagacaactgcaactaacaattaaagcacaaggaatatacttgcagatgatcat1020

tgaggagcaacaaaagcttggtggatcaattaaagcttctgagggtcagaaactttctga1080

ttcacctccaagcttagatgactacccagagatcatgcaacattctcccaagaaaccaag1140

gatagacgcattatcactagattcagagcgcgatatagcacagcctaaatttgaatccca1200

tttgatcggtctgtgggatcacgacattgcattcccagtggaggagttcaaagcagaccc1260

tgctatgagcaagtcataaggcaaacttcaccttgacagaaaattttcagagctgaccac1320

ctgctaaacaaaaataaaaaatatgtaccttggatagggacttgttccaagtttttctat1380

atcttgggtaagcaagtgatttggaaatggttatacatgacaggtgctgcatgtttgtca1440

tagcaaactctggttattgttctacttgttttttgtcataatttttgcatcaaaactaaa1500

ctattaagtggaagtataatttactcatctcgagggggatggagcatgtttggctgattg1560

tcattatagttagcctctctgatagtgttcttagatgtacttctatatatgttgcaggac1620

attgagagtgatgacaaggatcttgattttgaggttgatgatggagctgaggatgaacct1680

aaagcatgatccgaaagatgccccagccgaacctgctatcaagacagttgtgggccctag1740

ctgcgccacccaaagagacagaaagacagttgtctaaaaaggagatgaaaaaaaggaact1800

agcagaacttgatgcagtattggctgagctgggactttctggtattcgagcagcgctgca1860

caggatggtgagagtcatacatgtttccgttaaaactgcctgatctattttgtatgctcc1920

ggtttagagagagtctaaataattttgagtattttgaaattctagatgagttgtaaattt1980

ccaacaagactattgtagtaatagaggttagatgttcacgtgtaggcatcacattggtgt2040

gcttttgggttacccacttaggtggattgcctgagtcattcatgaaatattattgtgtat2100

catcgcaacacagtgaattggttgttggcaaaaagaatttgagaca2146

38

637

dna

artificialsequence

cdna

38

gctacgtgccttccaccagagataagctgcgtcgtccgctctgagggggctgctgtcttg60

gacgagatgtcgatccttgcggggtcgcacgcctccacaaccgccatgcggcaaggttca120

aagacaactgcaactaacaattaaagcacaaggaatatacttgcagatgatcattgagga180

gcaacaaaagcttggtggatcaattaaagcttctgagggacattgagagtgatgacaagg240

atcttgattttgaggttgatgatggagctgaggatgaacctaaagcatgatccgaaagat300

gccccagccgaacctgctatcaagacagttgtggccctagctgcgccacccaaagagaca360

gaaagacagttgtctaaaaaggagatgaaaaaaaggaactagcagaacttgatgcagtat420

tggctgagctgggactttctggtattcgagcagcgctgcacaggatggtgagagtcatac480

atgtttccgttaaaactgcctgatctattttgtatgctccggtttagagagagtctaaat540

agttttgagtattttgaaattctagatgagttgtaaatttccaacaagactattgtagta600

atagaggttagatgttcacgtgtaggcatcacattgg637

39

55

prt

zeamays

39

metalaargvalpheglnglnaspalathrcysleuproprogluile

151015

sercysvalvalargsergluglyalaalavalleuaspglumetser

202530

ileleualaglyserhisalaserthrthralametargglnglyser

354045

lysthrthralathrasnasn

5055

40

56

prt

zeamays

40

metthrargileleuileleuargleumetmetgluleuargmetasn

151015

leulyshisaspprolysaspalaproalagluproalailelysthr

202530

valvalalaleualaalaproprolysgluthrgluargglnleuser

354045

lyslysglumetlyslysargasn

5055

41

1327

dna

zeamays

41

tattgttgcctcctcctcatctcatcactagtcactcaaccgcaattgattgaaaattgt60

gttcatcatctcgttggatcgatcataattctttcatttctggcctcgacaagtatcgag120

ctcattaatccatcaatccaatgtgtgttctgtcgaaggcgacaatggtgagctacttat180

cgcggcgtccatttaatggctgcagcacaaaggcgatggacgtgatcgtggtcgacaaga240

ccatcgtgccggggggggaggggggtagagggtgacggtgctgatgatggatggcgatgg300

tatccggggtctcatcccggaaaccattcttgccttcctcgaggcgagggtgcaggatct360

ggacaggctggaggcgaggctcgcagactacttcgactacatcgccaggaccggtgggct420

cgtcatcacgctgctcacttcgcccggcaaggacaagcggcctctctacgttgccaagaa480

catcaaccacttgttcttgcatccattcacatcgccctaatcacatcaatgtatagagga540

ctatgatggatggatgcaagaacaatgacgccagatggaattcacttttgagggaagaca600

tgtgggctgttctccatgtagaagtggttgatgtccttggcaacgtagagagaccgcttg660

tttttgccggtcccgatgaacatggcggtgatgatctcatcggtgctattcctagtgatg720

tagtcgaagtagtccgcgagccttgccttcgtcccgtccaactcctacggcatggcctcg780

aggaaggtgaggatggttcccgagatgagaccccagatggcgcctctgtccaccgttatc840

actgtcaccctcggttcaacacgacgggctcatcgaccatgctcccggccatcgccttcg900

tgctgcatgtgttgcatggacgccttgatgagtagctcaccgttgccgcctttgaccaag960

cacacacccaattgatcgattaatgaactagatactcattgaggccacgaatgaaagaat1020

tatcatcaaaccaacaaatgacggacacaattttcaatcgattatggatgagtgtgagta1080

gtgatgagctgaggaagatgcaatgatagatcgattgtgtacatatataggcactgcgta1140

cgtgctgcccctttttggagtgacaaataggaactagcgcgcgtatttttgcatacaacc1200

actactaaataagagatatatgtaaaatttaacgcaagggatatagggaagagatatttg1260

tccattgcaatgtattttgaagctgtccacatatactatttatgaagaaacggattatgc1320

caagtat1327

42

714

dna

zeamays

42

atcattacataacttatgctatattttcccgagtatgtcctaacatcttccacagtgttt60

ttatgggctccttagaagttccagcccaggggcctgaaactattaaagttccaactgctc120

attatgaatttggtgccaattttttagatccaaagttaatgctcattggaagggtgataa180

cagatggaaggcttaatgctcgcgtgaaatgtgatttgacagacaatctcacgctgaaag240

taaatgcacagcttacccaagaggcacattactcacaaggaatgtttaactttgactaca300

aggttgacgtttctgacaagtcagacgtaacgagggcgtccacaccgcggctccgccgga360

catcgcaacaatctccccgccccagctctcctctccctgcgccgaggccacaatccctgc420

cgccccggctctcctcgtccccaaatcttgcacgcggtcgtaatccccgccgcctcgctc480

tcctcgcccctagatcgccgcctccactatcgctgatataccagaccaagcaggtagagc540

agaccaagatgtcgctcgaggaggccaagctggagatggccacgctgctgcagcagcagg600

cgagcaagtcatgcatggtactaagtcctgcatggtactaatggttgtaatgtagtgatg660

aaatagctagattaaaataacaaaatttatgtatggctaggatcacaaatagat714

43

460

dna

zeamays

43

ccagcccaggggcctgaaactattaaagttccaactgctcattatgaatttggtgccaat60

tttttagatccaaagttaatgctcattggaagggtgataacagatggaaggcttaatgct120

cgcgtgaaatgtgatttgacagacaatctcacgctgaaagtaaatgcacagcttacccaa180

gaggcacattactcacaaggaatgtttaactttgactacaaggacgtaacgagggcgtcc240

acaccgcggctccgccggacatcgcaacaatctccccgccccagctctcctctccctgcg300

ccgaggccacaatccctgccgccccggctctcctcgtccccaaatcttgcacgcggtcgt360

aatccccgccgcctcgctctcctcgcccctagatcgccgcctccactatcgctgatatac420

cagaccaagcaggtagagcagaccaagatgtcgctcgagg460

44

192

prt

zeamays

44

metglyserleugluvalproalaglnglyprogluthrilelysval

151015

prothralahistyrglupheglyalaasnpheleuaspprolysleu

202530

metleuileglyargvalilethraspglyargleuasnalaargval

354045

lyscysaspleuthraspasnleuthrleulysvalasnalaglnleu

505560

thrglnglualahistyrserglnglymetpheasnpheasptyrlys

65707580

valaspvalserasplysseraspvalthrargalaserthrproarg

859095

leuargargthrserglnglnserproargproserserproleupro

100105110

alaproargproglnserleuproproargleuserserserproasn

115120125

leualaargglyargasnproargargleualaleuleualaproarg

130135140

serproproproleuserleuiletyrglnthrlysglnvalglugln

145150155160

thrlysmetserleugluglualalysleuglumetalathrleuleu

165170175

glnglnglnalaserlyssercysmetvalleuserproalatrptyr

180185190

45

127

prt

zeamays

45

metleuileglyargvalilethraspglyargleuasnalaargval

151015

lyscysaspleuthraspasnleuthrleulysvalasnalaglnleu

202530

thrglnglualahistyrserglnglymetpheasnpheasptyrlys

354045

aspvalthrargalaserthrproargleuargargthrserglngln

505560

serproargproserserproleuproalaproargproglnserleu

65707580

proproargleuserserserproasnleualaargglyargasnpro

859095

argargleualaleuleualaproargserproproproleuserleu

100105110

iletyrglnthrlysglnvalgluglnthrlysmetserleuglu

115120125

46

2979

dna

helianthusannuus

46

atgaataacatcaatcttgtaatagtttcgcttgtaatcgcgattgtagccatccaaccc60

cttgcgcaagagcaaaccgatgtaggtgaggcaaatttcgtcactgttcttagcatcgat120

ggtgggggtgttcgtggcattgttcccgccaccttgcttgcttttcttgaatccaaaatt180

caggtactcgaacttaaaatgcacatgtgcatcatattacaagctgtaacttattattga240

aatgtgccgtctcttcggataggaaatagatgggccagatgcacgaattgcggattattt300

tgatgtaatagccggaacaagcacaggagggctgatgacaactatgcttgcagctcctaa360

tgagaaaaatcgtcccatgttcgccgcaaaagacattaccaacttctactttcaacattc420

gcctaggatcttccctaaaatagggtaaactctaactagtttccggatctataagatcat480

cattaaatacaagtttcattttctttttcgaatcaaatacagacacacatttgatgaggc540

gcaaccttatccttctcaaaacgaagcctgcgaaatggggtattctcctacaaagacttt600

tgtaattcatgttctagtgggtgtttggatgtgcgttttaaaactgattattatttacat660

gtagtttctgaagaaaataaaacagttattcaaacactttttgttaataattctactaga720

aaaaaaaaatccttgtcaggaaattaattaaaaaaaagttaccatctattaaagttcttt780

cttactaatcaaaagtttttaaattttattatcatgttattataactaaacatacacatc840

caaacactatctcataccacatgattacacaagtctattatttgaatatgctaacttagt900

attttcatataataagtttttaaaacgccacatccaaaacccttgattcttattttacat960

tgtgtagctaaaacagtgtttatacataaaaacaatcagttatataaatcaaagcattat1020

ttaactaaagtaagctcggttcaaactcgataagagaattaatatatacgagtcgagttc1080

ctgttgaccagatttcgctagtgttaagtttcgagttcaaaattgtatatgaacttgaac1140

ctgtgtatcatcatacttgacatttaaaccataatgttgttgaataaataaagtgatttt1200

attttgtagtcggaccaaattcatgaattcggtagtaaccgtacttggtgaggccaccgg1260

accaaagtatgatggtaaatatcttcgagccatggcaaagatgatgttaaaaaacctcac1320

tattaaagatacgttgacgaatgttgtcatacctgctttcgacattaggcggcttcaacc1380

tgttatcttctcctctgctcaagtaattaaactcgttttttatatttatagcagttctct1440

atttaaaattgattgtgtatcataaaatggtttctgtttgatacgtttagggaaaagagg1500

tcgcgtggaaaaatgctttgctagcagacgtatgcattagtaccgcggcggcaccaacat1560

ttttcccgccatactattttgagactagagacgtcgatggaaccaagcacacttttgatc1620

taatcgatggcggggtagctgcaaacaatccggtagttacatttcaacaatattgagttt1680

gcattttatttttaggacaagtagtcacattagggtgaagggtgtgttcaagctcatccc1740

gaaggtgggagcggtgttcccactcgtacctcatggcgcattttcttctttgttgcagct1800

ccaattttaaaaagccaccccgccttttccattccatagcgccacgtcaactgggaaatg1860

gtgttcccactggtattggagatttggaggcgctacgccacctctgtcatcccgaagcca1920

caccctccacccttaggagtgttatcggttcagttttcggtttatacagtttaaaggttt1980

ttttttggttgaaaccaaaaaccgaactgaactgaacggaattcgggtagttcacaactg2040

aaccaaaaactgaatccatattcggttttctgtttgaccgaataatcattatttgttatt2100

tgattgtgcgaggttaaataagattgagcaaaaattgtaattaattggatttggctaaat2160

gacaatttaaataaccgttatttgttatccgattcaaaccgaacacccaactcgatttgg2220

tttaataccgaaccgacaaccggatacgtaattcagttcgctattaaaaggttcgggttt2280

ggtagggtttaacccatttgaaatcgaatcttcagagaaagggttcgctaatgtcaccca2340

cataaccaaagaaatcttgtttaaatgttaatggcagacacatttggctatcacacatat2400

aaccaaagaagcggtgatggggaaatacaggttctctggcccggaggttttcgacggaag2460

acggatgcttgtgctttcactcggcactggtacgcagacgtacaatgacttatatactgc2520

acaaaaggctgcaaaatgggggttgcttagttggatctttaccaatggtactgcgccaat2580

cctccgcatttttggtgatgccatgtcagatatggtcgacatccatgtgtcaactatatt2640

ccaatcgttgcaagtcgaaaaaaactatctgcgtattcaggtataactaagaacatataa2700

atataatgttgtataggttacatgtttagtaacaaggagtttttttatgggcaggaagat2760

aacttgaaaggggaagcaactgcaatggatatttcatcacctgagaacatgagggcgcta2820

gaggacattggcaagaaattgttgaagaaaccgttgtcgagattggatgtggagacaggc2880

aagcttgaaccagttaaaggagaaggtacgaatgctgatgcattagcacgtttcgccact2940

ttgctttgtgccgaacgaaagcgccgcaatccagcttaa2979

47

1302

dna

artificialsequence

cdna

47

atgaataacatcaatcttgtaatagtttcgcttgtaatcgcgattgtagccatccaaccc60

cttgcgcaagagcaaaccgatgtaggtgaggcaaatttcgtcactgttcttagcatcgat120

ggtgggggtgttcgtggcattgttcccgccaccttgcttgcttttcttgaatccaaaatt180

caggaaatagatgggccagatgcacgaattgcggattattttgatgtaatagccggaaca240

agcacaggagggctgatgacaactatgcttgcagctcctaatgagaaaaatcgtcccatg300

ttcgccgcaaaagacattaccaacttctactttcaacattcgcctaggatcttccctaaa360

ataggacacacatttgatgaggcgcaaccttatccttctcaaaacgaagcctgcgaaatg420

ggtcggaccaaattcatgaattcggtagtaaccgtacttggtgaggccaccggaccaaag480

tatgatggtaaatatcttcgagccatggcaaagatgatgttaaaaaacctcactattaaa540

gatacgttgacgaatattgtcatacctgctttcgacatcaggcggcttcaacctgttatc600

ttctcctctgctcaaggaaaagaggtcgcgtggaaaaatgctttgctagcagacgtatgc660

attagtaccgcggcggcaccaacgtttttcccgccatactattttgagactagagatgtc720

gatggaaccaagcacacttttgatctaatcgatggcggggtagctgcaaacaatccgaca780

catttggctatcacacatataaccaaagaagcggtgatggggaaatacaggttctctggc840

ccggaggttttcgacggcagacggatgcttgtgctttcactcggcactggtacgcagacg900

tacaatgacttatacactgcacaaaaggctgcaaaatgggggttgcttagttggatcttt960

accaatggtactgcgccaatcctccgcatttttggtgatgccatgtcagatatggtcgac1020

atccatgtgtcaactatattccaatcgttgcaagtcgaaaaaaactatctgcgtattcag1080

gaagataacttgaaaggggaagcaactgcaatggatatttcatcacccgagaacatgagg1140

gcgctagaggacattggcaagaaattgttgaagaaaccgttgtcgagattggatgtggag1200

acaggcaagcttgaaccagttaaaggagaaggtacgaatgctgatgcattagcacgtttc1260

gccactttgctttgtgccgaacgaaagcgccgcaatccagct1302

48

433

prt

helianthusannuus

48

metasnasnileasnleuvalilevalserleuvalilealaileval

151015

alaileglnproleualaglngluglnthraspvalglyglualaasn

202530

phevalthrvalleuserileaspglyglyglyvalargglyileval

354045

proalathrleuleualapheleugluserlysileglngluileasp

505560

glyproaspalaargilealaasptyrpheaspvalilealaglythr

65707580

serthrglyglyleumetthrthrmetleualaalaproasnglulys

859095

asnargprometphealaalalysaspilethrasnphetyrphegln

100105110

hisserproargilepheprolysileglyhisthrpheaspgluala

115120125

glnprotyrproserglnasnglualacysglumetglyargthrlys

130135140

phemetasnservalvalthrvalleuglyglualathrglyprolys

145150155160

tyraspglylystyrleuargalametalalysmetmetleulysasn

165170175

leuthrilelysaspthrleuthrasnilevalileproalapheasp

180185190

ileargargleuglnprovalilepheserseralaglnglylysglu

195200205

valalatrplysasnalaleuleualaaspvalcysileserthrala

210215220

alaalaprothrphepheproprotyrtyrphegluthrargaspval

225230235240

aspglythrlyshisthrpheaspleuileaspglyglyvalalaala

245250255

asnasnprothrhisleualailethrhisilethrlysglualaval

260265270

metglylystyrargpheserglyprogluvalpheaspglyargarg

275280285

metleuvalleuserleuglythrglythrglnthrtyrasnaspleu

290295300

tyrthralaglnlysalaalalystrpglyleuleusertrpilephe

305310315320

thrasnglythralaproileleuargilepheglyaspalametser

325330335

aspmetvalaspilehisvalserthrilepheglnserleuglnval

340345350

glulysasntyrleuargileglngluaspasnleulysglygluala

355360365

thralametaspileserserprogluasnmetargalaleugluasp

370375380

ileglylyslysleuleulyslysproleuserargleuaspvalglu

385390395400

thrglylysleugluprovallysglygluglythrasnalaaspala

405410415

leualaargphealathrleuleucysalagluarglysargargasn

420425430

pro

49

1795

dna

artificialsequence

tillingmutantd74n

49

agttcatcactaatcacacttattgtgccctcgacgagtatctatagctagctcattaat60

cgattcgggggtgtgttgtcgaaggcggcaatggcgagctactcgtcgcggcgtccatgc120

aatacctgtagcacgaaggcgatggccgggagcgtggtcggcgagcccgtcgtgctgggg180

cagagggtgacggtgctgacggtggacggcggcggcgtccggggtctcatcccgggaacc240

atcctcgccttcctggaggccaggctgcaggagctggacggaccggaggcgaggctggcg300

gactacttcaactacatcgccggaaccagcaccggcggtctcatcaccgccatgctcacc360

gcgcccggcaaggacaagcggcctctctacgctgccaaggacatcaaccacttttacatg420

cagaactgcccgcgcatctttcctcagaagtgagtccgatgctgccgccattgttcttgc480

atccatccagcatcgtacgtacgtcctctatacatctgcggatcatcatgtgcgcatgtt540

tgtggcatgcatgcatgcatgtgagcaggagcaggcttgcggccgccatgtccgcgctga600

ggaagccaaagtacaacggcaagtgcatgcgcagcctgattaggagcatcctcggcgaga660

cgagggtaagcgagacgctgaccaacgtcatcatccctgccttcgacatcaggctgctgc720

agcctatcatcttctctacctacgacgtacgtacgtcgtcacgaatgattcatctgtacg780

tcgtcgcatgcgaatggctgcctacgtacgccgtgcgctaacatactcagctctttccta840

tctgctgcgccaatttgcaggccaagagcacgcctctgaagaacgctctgctctcggacg900

tgtgcattggcacgtccgccgcgccgacctacctcccggcgcactacttccagactgaag960

acgccaacggcaaggagcgcgaatacaacctcatcgacggcggtgtggcggccaacaacc1020

cggtaactgactagctaactggaaaacggacgcacagactccatgtccatggcggcccac1080

aaggtcgatgctaattgttgcttatgtatgtcgcccgattgcacatgcgtagacgatggt1140

tgcgatgacgcagatcaccaaaaagatgcttgccagcaaggacaaggccgaggagctgta1200

cccagtgaagccgtcgaactgccgcaggttcctggtgctgtccatcgggacggggtcgac1260

gtccgagcagggcctctacacggcgcggcagtgctcccggtggggtatctgccggtggct1320

ccgcaacaacggcatggcccccatcatcgacatcttcatggcggccagctcggacctggt1380

ggacatccacgtcgccgcgatgttccagtcgctccacagcgacggcgactacctgcgcat1440

ccaggacaactcgctccgtggcgccgcggccaccgtggacgcggcgacgccggagaacat1500

gcggacgctcgtcgggatcggggagcggatgctggcacagagggtgtccagggtcaacgt1560

ggagacagggaggtacgaaccggtgactggcgaaggaagcaatgccgatgccctcggtgg1620

gctcgctaggcagctctccgaggagaggagaacaaggctcgcgcgccgcgtctctgccat1680

caacccaagaggctctagatgtgcgtcgtacgatatctaagacaagtggctttactgtca1740

gtcacatgcttgtaaataagtagactttattttaataaaacataaaaatatatat1795

50

1284

dna

artificialsequence

tillingmutantd74n

50

atggcgagctactcgtcgcggcgtccatgcaatacctgtagcacgaaggcgatggccggg60

agcgtggtcggcgagcccgtcgtgctggggcagagggtgacggtgctgacggtggacggc120

ggcggcgtccggggtctcatcccgggaaccatcctcgccttcctggaggccaggctgcag180

gagctggacggaccggaggcgaggctggcggactacttcaactacatcgccggaaccagc240

accggcggtctcatcaccgccatgctcaccgcgcccggcaaggacaagcggcctctctac300

gctgccaaggacatcaaccacttttacatgcagaactgcccgcgcatctttcctcagaag360

agcaggcttgcggccgccatgtccgcgctgaggaagccaaagtacaacggcaagtgcatg420

cgcagcctgattaggagcatcctcggcgagacgagggtaagcgagacgctgaccaacgtc480

atcatccctgccttcgacatcaggctgctgcagcctatcatcttctctacctacgacgcc540

aagagcacgcctctgaagaacgctctgctctcggacgtgtgcattggcacgtccgccgcg600

ccgacctacctcccggcgcactacttccagactgaagacgccaacggcaaggagcgcgaa660

tacaacctcatcgacggcggtgtggcggccaacaacccgacgatggttgcgatgacgcag720

atcaccaaaaagatgcttgccagcaaggacaaggccgaggagctgtacccagtgaagccg780

tcgaactgccgcaggttcctggtgctgtccatcgggacggggtcgacgtccgagcagggc840

ctctacacggcgcggcagtgctcccggtggggtatctgccggtggctccgcaacaacggc900

atggcccccatcatcgacatcttcatggcggccagctcggacctggtggacatccacgtc960

gccgcgatgttccagtcgctccacagcgacggcgactacctgcgcatccaggacaactcg1020

ctccgtggcgccgcggccaccgtggacgcggcgacgccggagaacatgcggacgctcgtc1080

gggatcggggagcggatgctggcacagagggtgtccagggtcaacgtggagacagggagg1140

tacgaaccggtgactggcgaaggaagcaatgccgatgccctcggtgggctcgctaggcag1200

ctctccgaggagaggagaacaaggctcgcgcgccgcgtctctgccatcaacccaagaggc1260

tctagatgtgcgtcgtacgatatc1284

51

428

prt

artificialsequence

tillingmutantd74n

51

metalasertyrserserargargprocysasnthrcysserthrlys

151015

alametalaglyservalvalglygluprovalvalleuglyglnarg

202530

valthrvalleuthrvalaspglyglyglyvalargglyleuilepro

354045

glythrileleualapheleuglualaargleuglngluleuaspgly

505560

proglualaargleualaasptyrpheasntyrilealaglythrser

65707580

thrglyglyleuilethralametleuthralaproglylysasplys

859095

argproleutyralaalalysaspileasnhisphetyrmetglnasn

100105110

cysproargilepheproglnlysserargleualaalaalametser

115120125

alaleuarglysprolystyrasnglylyscysmetargserleuile

130135140

argserileleuglygluthrargvalsergluthrleuthrasnval

145150155160

ileileproalapheaspileargleuleuglnproileilepheser

165170175

thrtyraspalalysserthrproleulysasnalaleuleuserasp

180185190

valcysileglythrseralaalaprothrtyrleuproalahistyr

195200205

pheglnthrgluaspalaasnglylysgluargglutyrasnleuile

210215220

aspglyglyvalalaalaasnasnprothrmetvalalametthrgln

225230235240

ilethrlyslysmetleualaserlysasplysalaglugluleutyr

245250255

provallysproserasncysargargpheleuvalleuserilegly

260265270

thrglyserthrsergluglnglyleutyrthralaargglncysser

275280285

argtrpglyilecysargtrpleuargasnasnglymetalaproile

290295300

ileaspilephemetalaalaserseraspleuvalaspilehisval

305310315320

alaalametpheglnserleuhisseraspglyasptyrleuargile

325330335

glnaspasnserleuargglyalaalaalathrvalaspalaalathr

340345350

progluasnmetargthrleuvalglyileglygluargmetleuala

355360365

glnargvalserargvalasnvalgluthrglyargtyrgluproval

370375380

thrglygluglyserasnalaaspalaleuglyglyleualaarggln

385390395400

leuserglugluargargthrargleualaargargvalseralaile

405410415

asnproargglyserargcysalasertyraspile

420425

52

1795

dna

artificialsequence

tillingmutantg78r

52

agttcatcactaatcacacttattgtgccctcgacgagtatctatagctagctcattaat60

cgattcgggggtgtgttgtcgaaggcggcaatggcgagctactcgtcgcggcgtccatgc120

aatacctgtagcacgaaggcgatggccgggagcgtggtcggcgagcccgtcgtgctgggg180

cagagggtgacggtgctgacggtggacggcggcggcgtccggggtctcatcccgggaacc240

atcctcgccttcctggaggccaggctgcaggagctggacggaccggaggcgaggctggcg300

gactacttcgactacatcgccagaaccagcaccggcggtctcatcaccgccatgctcacc360

gcgcccggcaaggacaagcggcctctctacgctgccaaggacatcaaccacttttacatg420

cagaactgcccgcgcatctttcctcagaagtgagtccgatgctgccgccattgttcttgc480

atccatccagcatcgtacgtacgtcctctatacatctgcggatcatcatgtgcgcatgtt540

tgtggcatgcatgcatgcatgtgagcaggagcaggcttgcggccgccatgtccgcgctga600

ggaagccaaagtacaacggcaagtgcatgcgcagcctgattaggagcatcctcggcgaga660

cgagggtaagcgagacgctgaccaacgtcatcatccctgccttcgacatcaggctgctgc720

agcctatcatcttctctacctacgacgtacgtacgtcgtcacgaatgattcatctgtacg780

tcgtcgcatgcgaatggctgcctacgtacgccgtgcgctaacatactcagctctttccta840

tctgctgcgccaatttgcaggccaagagcacgcctctgaagaacgctctgctctcggacg900

tgtgcattggcacgtccgccgcgccgacctacctcccggcgcactacttccagactgaag960

acgccaacggcaaggagcgcgaatacaacctcatcgacggcggtgtggcggccaacaacc1020

cggtaactgactagctaactggaaaacggacgcacagactccatgtccatggcggcccac1080

aaggtcgatgctaattgttgcttatgtatgtcgcccgattgcacatgcgtagacgatggt1140

tgcgatgacgcagatcaccaaaaagatgcttgccagcaaggacaaggccgaggagctgta1200

cccagtgaagccgtcgaactgccgcaggttcctggtgctgtccatcgggacggggtcgac1260

gtccgagcagggcctctacacggcgcggcagtgctcccggtggggtatctgccggtggct1320

ccgcaacaacggcatggcccccatcatcgacatcttcatggcggccagctcggacctggt1380

ggacatccacgtcgccgcgatgttccagtcgctccacagcgacggcgactacctgcgcat1440

ccaggacaactcgctccgtggcgccgcggccaccgtggacgcggcgacgccggagaacat1500

gcggacgctcgtcgggatcggggagcggatgctggcacagagggtgtccagggtcaacgt1560

ggagacagggaggtacgaaccggtgactggcgaaggaagcaatgccgatgccctcggtgg1620

gctcgctaggcagctctccgaggagaggagaacaaggctcgcgcgccgcgtctctgccat1680

caacccaagaggctctagatgtgcgtcgtacgatatctaagacaagtggctttactgtca1740

gtcacatgcttgtaaataagtagactttattttaataaaacataaaaatatatat1795

53

1284

dna

artificialsequence

tillingmutantg78r

53

atggcgagctactcgtcgcggcgtccatgcaatacctgtagcacgaaggcgatggccggg60

agcgtggtcggcgagcccgtcgtgctggggcagagggtgacggtgctgacggtggacggc120

ggcggcgtccggggtctcatcccgggaaccatcctcgccttcctggaggccaggctgcag180

gagctggacggaccggaggcgaggctggcggactacttcgactacatcgccagaaccagc240

accggcggtctcatcaccgccatgctcaccgcgcccggcaaggacaagcggcctctctac300

gctgccaaggacatcaaccacttttacatgcagaactgcccgcgcatctttcctcagaag360

agcaggcttgcggccgccatgtccgcgctgaggaagccaaagtacaacggcaagtgcatg420

cgcagcctgattaggagcatcctcggcgagacgagggtaagcgagacgctgaccaacgtc480

atcatccctgccttcgacatcaggctgctgcagcctatcatcttctctacctacgacgcc540

aagagcacgcctctgaagaacgctctgctctcggacgtgtgcattggcacgtccgccgcg600

ccgacctacctcccggcgcactacttccagactgaagacgccaacggcaaggagcgcgaa660

tacaacctcatcgacggcggtgtggcggccaacaacccgacgatggttgcgatgacgcag720

atcaccaaaaagatgcttgccagcaaggacaaggccgaggagctgtacccagtgaagccg780

tcgaactgccgcaggttcctggtgctgtccatcgggacggggtcgacgtccgagcagggc840

ctctacacggcgcggcagtgctcccggtggggtatctgccggtggctccgcaacaacggc900

atggcccccatcatcgacatcttcatggcggccagctcggacctggtggacatccacgtc960

gccgcgatgttccagtcgctccacagcgacggcgactacctgcgcatccaggacaactcg1020

ctccgtggcgccgcggccaccgtggacgcggcgacgccggagaacatgcggacgctcgtc1080

gggatcggggagcggatgctggcacagagggtgtccagggtcaacgtggagacagggagg1140

tacgaaccggtgactggcgaaggaagcaatgccgatgccctcggtgggctcgctaggcag1200

ctctccgaggagaggagaacaaggctcgcgcgccgcgtctctgccatcaacccaagaggc1260

tctagatgtgcgtcgtacgatatc1284

54

428

prt

artificialsequence

tillingmutantg78r

54

metalasertyrserserargargprocysasnthrcysserthrlys

151015

alametalaglyservalvalglygluprovalvalleuglyglnarg

202530

valthrvalleuthrvalaspglyglyglyvalargglyleuilepro

354045

glythrileleualapheleuglualaargleuglngluleuaspgly

505560

proglualaargleualaasptyrpheasptyrilealaargthrser

65707580

thrglyglyleuilethralametleuthralaproglylysasplys

859095

argproleutyralaalalysaspileasnhisphetyrmetglnasn

100105110

cysproargilepheproglnlysserargleualaalaalametser

115120125

alaleuarglysprolystyrasnglylyscysmetargserleuile

130135140

argserileleuglygluthrargvalsergluthrleuthrasnval

145150155160

ileileproalapheaspileargleuleuglnproileilepheser

165170175

thrtyraspalalysserthrproleulysasnalaleuleuserasp

180185190

valcysileglythrseralaalaprothrtyrleuproalahistyr

195200205

pheglnthrgluaspalaasnglylysgluargglutyrasnleuile

210215220

aspglyglyvalalaalaasnasnprothrmetvalalametthrgln

225230235240

ilethrlyslysmetleualaserlysasplysalaglugluleutyr

245250255

provallysproserasncysargargpheleuvalleuserilegly

260265270

thrglyserthrsergluglnglyleutyrthralaargglncysser

275280285

argtrpglyilecysargtrpleuargasnasnglymetalaproile

290295300

ileaspilephemetalaalaserseraspleuvalaspilehisval

305310315320

alaalametpheglnserleuhisseraspglyasptyrleuargile

325330335

glnaspasnserleuargglyalaalaalathrvalaspalaalathr

340345350

progluasnmetargthrleuvalglyileglygluargmetleuala

355360365

glnargvalserargvalasnvalgluthrglyargtyrgluproval

370375380

thrglygluglyserasnalaaspalaleuglyglyleualaarggln

385390395400

leuserglugluargargthrargleualaargargvalseralaile

405410415

asnproargglyserargcysalasertyraspile

420425

55

13516

dna

zeamays

55

tcttgctatatatgagatgacaaaattttccaaagaagagagaagccggcagaacccatc60

ctgtttcaaatctcttctactacttaagtttctaacgtaggcgtcgacaaaacggattgg120

tgcacggttctgccgatgtctcccacacacgcgcatggaaggaggcaggcacccttcccc180

gccgccccggatctcgcgccagccccagccctaccccgcctgcccttccattcttcccca240

gccgccccccggtcaacgtcacgaacccgggcctcgtgccgttcgccgtggccacgcggt300

tcgacgagcgggtcacggagctgctgagcgcgctcgctgacgcggcggcggggcgaccag360

gcaggtgggccatcggcgaagcgccatggtcgtcgtcggggggcaggaaccaggcggtgt420

acgcgcgccgcgcgcccggctcttcatcgcctccacccgctccagcgtctccaccacctc480

cttcatcgagggccgactgcgaggctcgccggccaggcagccgagcgtcagttgcgccgc540

ttggaacgcctgcttttgttgatcgtttgttttggtctgatttcggtgggtctatccgca600

gagaggaagaagcagaagctctccgagatccaatccggcgttgaggaagctgaatcgctg660

gtaaatagatgccgcgacacgttctggtttggggatccccttggctaacaggacatacga720

catttggggaatgggtagaaaagcagagattagggatttttcgtttccgtcggtgcagtt780

ttggtgttccaacggagttgcgagatgtttatgtgccttagtcttcaatttgggggttgg840

gggaaaagtaattttatgtttttgttttgtgtctgcagattcagaaaatggacctggagg900

caaggagcctacagcctagcattaaggctagtttgcttgcaaagctgagggagtataaat960

ctgacctcaacaacgtcaagagtgagctcaagaggatatctgcgcccaatgccaggcagg1020

ctacccgggaggagctcctggagtctggaatggctgatactctcgcagtgagctaatgat1080

aggacttgactgtgtctacgagactgctcctaacaataaactgaagaaagcaaaagaaat1140

cattcaacgtattcgccgaagagaactctacaaggtagtatgatgctttaattgctcata1200

tacaagtgtcattttgtcatgtcattacacatggttaggatacataggagattctgtttt1260

ttaacacatagttgtcccatgtccatgaattcatttgaattaatttactcttcgcaatct1320

tatacattaaaatcgtgttacctattacatcacaacttcatgagagcatgcttgttctgt1380

gtagatatggtagtcctgggaggaagagccatcgccgctatgtatggcagaaccacccgc1440

gaaagcaacctctctggtctcgcatagccagagcaggagcagctcgcttgcgcggccgca1500

gcgctggcggtcggccccgcgtacgagcgcctgcaggtaggccagcttctgctgcaatgc1560

ccgaatctcggcgtccacgcgcagcagcgtcgtcgcctcctcctccgtcagctcacccag1620

cttggccagcacccccgtcacccccgcgtccgccatggctgtcgccgggaccgaaaggct1680

aaaactgtcacaatgacgtaaagtttggttggtgttggcggctcacgcaaaaccagacct1740

ttccaagttttactttagcagagtttttttggaacgagagcaaagcagcacagtttcaag1800

aatgtggggcaatttgaatgttcgttcctgctgcactgctactgcttttagaattgtagt1860

atgcttcatcatttatttatttctaaaaaaacttgcatgaattctatcgtgacttttatt1920

gagaaaataatgtattcacgtatcttcatgtttctgataaaggtatttgtatatgcatcg1980

gtgctacatatgcgaatacaagttttgtttcaactctgaagtctcaagttgaattctaaa2040

ctccagtttgttttctactgtgctgctgcaggaagccaggaacccatccgaacaaggttg2100

caatcatgataagcagatagagcaagcatatgatgatattttgaattcgtcgaagcatac2160

tttggccagcatgatggagctgcaggaggttcagttatttgcacacattgtttttctttc2220

actcctatgattttcctcaatatgatcaaaatgtttcttttgcaaataatgattgaaatg2280

tttctcattgtactcaacctcttaaactacctataggctttgcttgagagtaatcaggct2340

acaaaggatgccaatggtattgctgctctctatattgttcttgttctaatgtaaaaacta2400

caacacaactctttacttgatcccagaaattccttctgcctcaaatggagacaatgacga2460

gtggtcagaagtacagagattgcagacaaggtaaattttgcaatagaaataactaaccaa2520

ccattagtgcttgaaaaaaactggactggtgactggggcacgtggtttcatcaacatttg2580

gacctcaacggtctaatcagtataacttagaagttggctagctcttgaaaaacactgcat2640

gacactaagcatttgtttattttcagctgcttgcacccctatgatttcaagtaactactt2700

gtctacttgtgataatcacctgaatatgagtatttgaaatgcttatcacgtctcggcaat2760

tgcatttcttttatgcgtaactgaagtctgctctagcttcctaatagagttcatttttta2820

atacagaaaccactttgagatagccacaatatagtaaaagtggcagctaaggtactaaaa2880

acacccatgcaaataagaaaaaaaatgaatcttgtattttaattttgttaaatacctcta2940

tagtttggcgatatattatgttaccatcctgcttgtagcctgtaggtcattttatatgag3000

ccatcaaattgcgatgacagttgccacaaatccagtttcatatgaaggtattagctgtgt3060

aacaagctaactgctgctctctgcccaataagttattcaattggattagtaggttgcatc3120

caaggttattcaattggatcagtaggttgcatccaaggtatactgctgctctctgcccaa3180

taagttattcaattcgatcagtaggttgcctgttcccttcattttattaaaaaatacata3240

ataatataataagtacctgtttgttctaaaaataatacttctgtaaatgaggttattaat3300

tttccttttggtaataatgcaggttgatgatactgaagtcatcagttttttgttgcaaac3360

tgaaataatttctctgtgcttgcgaaccatggagatgggtagtgagctatccaaaactgt3420

atgtagctagccatatattctcattcaaatatcataatttatctcttctgcttaatactg3480

gcaaaggtgtaatagtttttttagtattgatttgtcacctgaagtttatcttgtgcacta3540

ctactttgccatcatcagttatctctagaatactcttatcctgtaccatcttctctctga3600

taagcctaaatttgtacaattcataagcctaaaaggtgacttatataatatatacaagga3660

ccctcaagagttgtttggcaattcagtgactgtcctgggtcctgttttggggagcttctg3720

gtagcttttgcttctccaaaagaaaagctagaagctccccccaaacagagcagcttcttc3780

aagccggtaaaagcttcaaaagctataattatactaaaaacagtgaagctccctcagagc3840

agcttcccagctctccaggagatgcttttggagaagctacagtttccccaaacagggccc3900

tgctctgttgaaccccccttcctgatacatatttgaatatgagtttatagtgtgtgtggg3960

ggggtgtaagtaggggggtaatgggttctaaattttatactataaaaattaaggatcaga4020

ttagaattgagctctatttctattcatttttgaactaaaattaattaagggctcaaatga4080

attatgaagaagcattaggatcatgatccattaccacccctacgtgtaagatgttttttg4140

gtggttgtggttgattttgaattttaaggccgcatatgtctcatggactacacaagctca4200

tattcatctacatttgtagccgtcactaacttagccaaatatgcatatgtggcggctgag4260

agcacctagagggggggggggtgaataggtgatcctataaaaacttgaaacataatgcca4320

caaaacttgattaggagttagcacaataaagccaagtgactagagaggagttcttgcaag4380

acacgataaccacacgaagatcaacacagatagacacaatggtttatcccgttgttcggc4440

caagtccaacacttgcctactccacgttgtggcgtcccaacggacgagggttgcaatcaa4500

cccctctcaagcggtccaaggacccacttgaataccacggtgttttgcttagtttcacta4560

tatcccgcttgcgaggaatctacacaacttggagcctctcgcccttacaatttgatgttc4620

acaaagaagcacgaaagtaaggctgggatgagcaacgcacacaagacacaaaatcagagc4680

acaacacgcacacaagtcacaacacgagctcacaacacaacccaaagagttctctactca4740

aatggagctctagttgctatcacaaagaatcgaatacgcggaattggagtcttggtgctt4800

agaaacgcttagagaatgcttggtgtgttcctccatgcgcctaggggtcccttttatagc4860

cccaaggcagctaggagccgttgagagcattccaggaaggcaattcttgccttctgtcgc4920

ctggcgcaccggacagtccggtgcaccaccggacactgtccggtgcggatttctttcctt4980

ctttagcgaagccgaccgttggagattcagagccgttggcgcaccggacagtgtccggtg5040

cacaccggacagtccggtgcccccttctgaccgttggctctgccacgcgtcgcacgcgga5100

ttacgcggccgaccgttggcccggccgactgttggctcaccggatagtccggtgcaccac5160

cggacagtccggtgatttatagccgtacgccgccgacgaaacccgagagcagccagttcg5220

ccagagccagcctggcgcaccagacactgtccggtgcacccagactacgcagtcttggct5280

gcacagccaagtcttttccaaattggtctttttctgtttctagcacttagacacattaca5340

ttagtatccaaaacaatgtactaagtcttagaaacatacctttactcttgatttgcactt5400

tgtccatcatttggcatagattaacacatgaccacttgtgttggcactcaatctccaaaa5460

tacttagaaatggcccaatggcacatttccctttcaatctccccctttttggtgatttat5520

gccaacacaacaaaaagcaactaaaagaagtgcaacatcaatgcaaatgagaacaaaaaa5580

ttgttttgattcaaatttggcatatttggatcattctttgccaccacttggttttgtttt5640

tgcaaatcaacctcaatttcctatctctaagtcaaacacacttgttgaaacataaagaga5700

gttgttccatgagaaattgatcaaagatttcaaaaactcccccttttttccataatcaaa5760

cattctccccacaagagaccaacttttgacagaagagacaataagagaattttgacaaac5820

caaaaagctctattctactattttcaaaattctcaagtggtagctgatccatttattgct5880

ttggccttattttctccccctttggcatcaagcaccagaacgggataaatcttggccctt5940

aaaaccccattgcctcaccaaaatcttcaattaagagtaaaaaggcaataagagcatgaa6000

gatgaacttggagttagttactctttcatcggagtgcagtggaagtctttcatggtccaa6060

gtccaacatttcctttcaatccacctttgagactaaatcaagcaaactcaagcacacagt6120

tagtctcaaggggtcaagttgtagcacaactccccctaaatatgtgcattacttgcaaat6180

ggacttgtgaggtccggggagtgtttgtacaacttgagcaccatacataaacaacaaaat6240

gcataaaggaacatgatcaaggcataaaacacatgtatgctataaatcaatccaagttcc6300

gcgaatctaagacatttagctcactacgcagcctacaaaaggtcttctcatctagaggct6360

tggtaaagatatcggctagctggttctcggtgctaacatgaaacacttcgatatctccct6420

tttgctggtgatctctcaaaaagtgatgccggatgtctatgtgcttagtgcggctgtgtt6480

caacaggattatccgccatgcggatggcactctcattatcacataggagtgggactttgc6540

tcagattgtagccaaagtccttgagggtttgcctcatccaaagtagttgcgcgcaacact6600

gtcctgcggtaacgtactcggcctcagcggtggatagggcaacggaggtttgtttcttag6660

aattccacgacaccagggaccttcctaagaattgacacgtccccgatgtactctttctat6720

cgaccttacatccaacatagtcggagtctgaatatccaatcaagtcaaaggtagacccct6780

ttggataccagatcccgaagcaaggcgtagcgactaaatatctaagaattcgcttcacaa6840

ccactaagtgacactcctttggatcggattgaaatctagcacacatgcatacacttagca6900

taatatctggtctactagcacataaataaagtaaggaccctatcatagaccggtatgcct6960

tttgatcaacggacttacctcctttgttgaggtcggtgtgtccgtcagttcccattgtag7020

tctttgcgggcttggcgttcttcatcccaaactgctttagcagatcttgcgtgtacttcg7080

tttgggagatgaaagtgccgtccttgagttgcttcacttggaacccaaggaagtagttca7140

actcgcccatcatcgacatctcgaatttctgagtcatcaccctgctaaactcttcacaag7200

acttttggttagtagaaccaaatattatgtcatcgacataaatttggcacacaaaaagat7260

caccatcacaagtcttagtgaataaagttggatcggctttcccaaccttgaaagcattag7320

caagtaaaaagtctctaaggcattcataccatgctcttggggcttgcttaagtccataga7380

gggccttagagagcttacacacgtggtcggggtaccgttcatcctcgaagccagggggtt7440

gctccacgtgcacctcctccttgattagcccgttgaggaaagcactcttcacatccattt7500

ggaacaacctgaaggaatggtgagcggcataggctaacaaaatacgaatagactctagcc7560

tagccacaggagcaaaagtctcctcaaagtccaaacctgcgacttgggcatagccttttg7620

ccacaagtctcgccttgttccttgtcaccaccctgtgctcgtcctgtttgttgcggaaca7680

cccacttggttcccacaacgttttgcttgggacgaggcaccagtgtccaaacttcatttc7740

gcttgaagttgatgagctcttcctgcatggccaacacccagtccggatctagcaaggcct7800

cttctaccctgaaaggctcaatagaagagacaaaagagtaatgctcacaaaatttaacta7860

atctagagcgagtagttactcccttgctaatgtcacccaaaatctggtcgacgggatgat7920

tcctttgaatcgtcgctcgaacttgagttgaaggggcttgaggtgcttcttcctccataa7980

catgatcatcttgtgctcccccttgatcacatgcctcctcttgatgaacctgttcatcgt8040

cttgagttgggggatgtaccaatgttgaggaagaaggttgatcttgctccttttgttcct8100

gtggccgcacatctccaatcgtcatggtgcgtattgcggccgttggaatgtcttcttcat8160

ctacatcattaagatcaacaacttgctctcttggagagccattagtctcatcaaatacaa8220

cgtcgctagagacttcaaccaaactcgatgatttgttgaaaaccctatacgcctttgtat8280

ttgagtcataacctaacaaaaacccttctacagctttgggagcaaacttagaatttctac8340

ctttcttcactagaatgtagcatttactcccaaatacacgaaagtatgaaacattgggtt8400

tgttaccggttaggagctcatacgaagtcttcttgaggaggtgatgaaggtagacccggt8460

ttatggcgtggcaagccgtgttcacggcttccgaccaaaatcgctcgggggtcttgaact8520

ctccaagcatcgtcctcgccatgtcaatgagcgtcctatttttcctctctaccacaccat8580

tttgctgtggtgtgtagggagcggagaactcgtgcttgattccttcctcctcaaggtact8640

cttctacttgaagattcttgaactccgacccgttgtcgctccttatcttcttcaccttga8700

gctcaaactcattttgagctctccttaggaagcgcttgagggtcccttgggtttcagatt8760

tatcctgcaaaaagaatacccaagtgaagcgggaaaaatcatcaactataacaagaccat8820

acttacttcctcctaagcttaggtaggcgacgggcctgaagaggtccatatgtagcaact8880

ccaaaggtcttgatgtggtcatcacatttttggtatgatgagagcttcccacttgtttac8940

ctgcttgacaagctgcataaggtctatctttttcgaaagtaacatttgttagtcctatca9000

cgtgttctccctttagaagcttgtgaaggttcttcatccccacatgtgctaagcggcgat9060

gccacagccagcccatgctagtcttagcaattaagcatgcatctagaccggcctcctctt9120

ttgcaaaatcaactaagtagagtttgccgtctaatacacccttaaaagctaatgaaccat9180

cactccttctaaagacggacacatctacatttgtgaataagcaattatatcccatattac9240

aaagctaactcacagacaataagttatatccaagagactcaactaaaaacacattagaaa9300

tagagtgctcggatgaaatggcaatcttccctagtcctttgaccttgccttgattcccgt9360

caccgaatatgattgagtcttgggaatccttgttcttgacgtaggaggtgaacatcttct9420

tctcccccgtcatgtggtttgtgcatccgctgtcgataatccagcttgatcccccggatg9480

cataaacctgcaaggcaaattaggcttgggtcttaggtacccaactcttgttgggtccta9540

caaggttagtacaaatagccttagggacccaaatgcaagttttatctcccttgcattttg9600

cccctaattttctagcaaccaccttcttatcctttctacaaatatcaaaggaagcattta9660

aagcatgataaattgtagaaggttcattacttgttttcctaggtacatgagcatttctcc9720

taggcacatgatgaatgatatttttcctagccaaatttctatcatgcataatagaagaac9780

ttgaagcaaacattgcatttgaatcataagcatgtgaaatgacatcattgcaacttctat9840

catgatgaacattcctggaatatctcctatcatggtataagaaagcatggttcttttgaa9900

tactatttgccataggggccttccctttctccttgatggagataggagccttatgacttg9960

tcaagttcttggcttccctcttgaagccaagcccatccttaattgaggggtgtctaccaa10020

ccgtgtaggcatcccttgcaaattttagtttatcaaaatcatttttgctagtcttaagtt10080

gagcattaagactagccacttcatcttttagtttagaaatgcaaactaggtgttcactac10140

aagcatcaacattgaaatctttacacctattgcaaatcgtaacatgttcttcacgagagg10200

ttaatttactagctatttctaacttagcactcaaatcatcattaacactttttaggctag10260

agatagattcatggcatgtagacaattcacatgaaagcatttcatttcttttaatttcta10320

aagcaagagaattttgtgcttctacaaacttatcatgttcttcatacaaaagatcctctt10380

gcttttctaataatctattcttatcattcaaggcatcaatcaactcattgatcttatcaa10440

tcttagttctatctaagcccttgaacaaactagcatagtctatttcatcatcgctagatt10500

catcatcactagaagcataagtagactttcgagtgtttaccttcttctcctttgccatta10560

agcatgtgtgatgctcgttgggggaagaggaacgacttgttgaaggccgaggcgacgagt10620

ccttcgttgtcggagtcggacgacgaacaatccgagtcccactccttgccaaggtgtgcc10680

tcacccttagccttcttataagccttcttcttttccctcttcttctcttgttcctggtca10740

ctatcattatcgggacaattagcgataaatgaccaatcttaccacatttgaagcatgagc10800

gcttccccttcgtcttgttgggatgctccttgcgaccctttagtgctgtcttgaatcgct10860

tgatgatgagggccatttcttcttcattaagcccgaccgcctcaacttgcgccaccttgc10920

taggtagcgcctccttgctcctcgttgctttgagagccacagtttgaggctcgtggattg10980

ggccattcaacgcatcatcgacgtatcttgcctccttgatcatcatccgcccgcttacga11040

actttccaagtatttcttcgggcgacatcttggtgtacctaggattttcacgaatattgt11100

ttacaagatgtggatcaaggatagtgaaggaccttagcattaggcggacgacgtcgtgat11160

ccgtccatcgcgtgcttccatagcttcttattttgttgacgagggtcttgagccggttgt11220

acgtttgggttggctcttctcccctgatcattgcgaatctcccaagttcgccctccacca11280

actccatcttggtgagcatggtgacatcgttcccctcatgtgagatcttgagggtgtccc11340

agatctgcttggcgttatccaagccgctcaccttatggtattcatccctgcacaatgatg11400

ctagaagaacagtagtagcttgtgcatttttgtgaatttgctcattgataaatatgggac11460

tatcagtactatcaaattgcattccactctctactatctcccatatgcttggatggagag11520

agaacaagtgactacgcattttgtgactccaaaatccgtagtcctctccatcaaagtgag11580

gaggtttaccaagtggaatggagagtaaatgagcatttgtactttgcggaatacgagaat11640

aatcaaaagaaaagtttgaattgactgttttctttttctcgtagttgtcgtcgtcctttt11700

gggaagaagaggactcgtcgctgtcgtcgtagtagacgatctccttgatgcaccttgttt11760

tcttcttcttcctgtcttttcttttgtggctcgagcccgagtcagtaggcttgtcatctt11820

ttggatcattgacgaaggactccttctccttatcattgaccaccatccccttgcccttag11880

gatccatctcttcgggcgattagtcccttacgtgaagagaacgactcagataccaattga11940

gagcacctagaggggggtgaataggtgatcctataaaaacttgaaacttaatgccacaaa12000

acttgattaggagttagcacaataaagccaagtgactagagagttcttgcaagacacgat12060

aaccacacaaagatcaacacagatagacacagtggtttatctcgtggttcggccaagtcc12120

aacacttgcctactccacgttgtggcgtcccaacggacgagggttgcaatcaacccctct12180

caagcggtccaaggacccacttgaataccacagtgttttgcttagtttcactatatcccg12240

cttgcgaggaatctccacaacttgtagcctctcgcccttacaatttgatgttcacaaaga12300

agcacgaaagtaaggctgggatgagcaacgcacacaagacacaaaatcagagcacaacac12360

gcacacaagtcacaactcgagctcacaacacaacccaaagagttctctactcaaatggag12420

ctctagttgctatcacaaagaatcgaatgcgcggaattggagtcttggtgcttaggaacg12480

cttagagaatgctttgtgtgttcctccatacgcctaggggtcccttttatagccccaagg12540

cagctaggagccgttgagagcattccaggaaggcaattcttgccttctgtcgcctggcgc12600

accggacagtccggtgcaccatcggacactgtccggtgcagatttctttccttttttagc12660

gaagccgaccgtcggagattcagagccgttggcgcactggacactgtccgatggacaccg12720

gacagtccggtgcccccttctgaccgttggctctgccacgcgtcgcgcgcggattacgcg12780

gccgaccgttggctcgaccgactgttggctcactgaacagtccggtgcaccaccggacag12840

tccggtgatttatagccgtacgccgccgacgaaacccgagagcagctagttcgctagagc12900

cagtctggcgcaccagacattgtccggtgcaccaccggacagttcggtgcacccagactg12960

cgcagagtcttggctgcacagccaagtcttttccaaattggtcttttcctgtttctagca13020

cttagacacattacattagtctccaaaacaatgtactaagtattagaaacatacctttac13080

tcttgatttgcactttgtccatcatttggcatagattaacacatgaccacttgtgttggc13140

actcaatctccaaaatacttagaaatggcccaagggcacatttccctttcagcggctagc13200

aacaggtccttggtttcttgggttatttattctctttttatcgtgtttgaatgttttcgt13260

gttcatttgcataacatcttaggtctacatcagtatatgaattgagatcaaatgtgaatt13320

ggaccacacaagctcatattcatctacatttgtagtcgtcactaacttagccaaatatgc13380

atatgtccgcttctgatttcattgtgtcttttcttcaggagtttggggatcaaggagagg13440

actccattatcttgtcaccgcgactgaaggagattagtactcctgaccgccccgctgccc13500

tccgtttcctaggtac13516

56

1026

dna

artificialsequence

cdnasnaret1

56

gcgcccggctcttcatcgcctccacccgctccagcgtctccaccacctccttcatcgagg60

gccgactgcgaggctcgccggccaggcagccgagcgtcagttgcgccgcttggaacgcct120

gcttttgttgatcgtttgttttggtctgatttcggtgggtctatccgcagagaggaagaa180

gcagaagctctccgagatccaatccggcgttgaggaagctgaatcgctgattcagaaaat240

ggacctggaggcaaggagcctacagcctagcattaaggctagtttgcttgcaaagctgag300

ggagtataaatctgacctcaacaacgtcaagagtgagctcaagaggatatctgcgcccaa360

tgccaggcaggctacccgggaggagctcctggagtctggaatggctgatactctcgcagt420

gagctaatgataggacttgactgtgtctacgagactgctcctaacaataaactgaagaaa480

gcaaaagaaatcattcaacgtattcgccgaagagaactctacaagatatggtagtcctgg540

gaggaagagccatcgccgctatgtatggcagaaccacccgcgaaagcaacctctctggtc600

tcgcatagccagagcaggagcagctcgcttgcgcggccgcagcgctggcggtcggccccg660

cgtacgagcgcctgcaggaagccaggaacccatccgaacaaggttgcaatcatgataagc720

agatagagcaagcatatgatgatattttgaattcgtcgaagcatactttggccagcatga780

tggagctgcaggaggctttgcttgagagtaatcaggctacaaaggatgccaatgaaattc840

cttctgcctcaaatggagacaatgacgagtggtcagaagtacagagattgcagacaaggt900

aaattttgcaatagaaataactaaccaaccattagtgcttgaaaaaaactggactggtga960

ctggggcacgtggtttcatcaacatttggacctcaacggtctaatcagtataacttagaa1020

gttggc1026

57

874

dna

artificialsequence

cdnasnaret1

57

gcgcccggctcttcatcgcctccacccgctccagcgtctccaccacctccttcatcgagg60

gccgactgcgaggctcgccggccaggcagccgagcgtcagttgcgccgcttggaacgcct120

gcttttgttgatcgtttgttttggtctgatttcggtgggtctatccgcagagaggaagaa180

gcagaagctctccgagatccaatccggcgttgaggaagctgaatcgctgattcagaaaat240

ggacctggaggcaaggagcctacagcctagcattaaggctagtttgcttgcaaagctgag300

ggagtataaatctgacctcaacaacgtcaagagtgagctcaagaggatatctgcgcccaa360

tgccaggcaggctacccgggaggagctcctggagtctggaatggctgatactctcgcagt420

gagctaatgataggacttgactgtgtctacgagactgctcctaacaataaactgaagaaa480

gcaaaagaaatcattcaacgtattcgccgaagagaactctacaaggaagccaggaaccca540

tccgaacaaggttgcaatcatgataagcagatagagcaagcatatgatgatattttgaat600

tcgtcgaagcatactttggccagcatgatggagctgcaggaggctttgcttgagagtaat660

caggctacaaaggatgccaatgaaattccttctgcctcaaatggagacaatgacgagtgg720

tcagaagtacagagattgcagacaaggtaaattttgcaatagaaataactaaccaaccat780

tagtgcttgaaaaaaactggactggtgactggggcacgtggtttcatcaacatttggacc840

tcaacggtctaatcagtataacttagaagttggc874

58

553

dna

zeamays

58

cgatgtgcagtggcctgattagctacaagaagctcttgttccatggactcgatctctgga60

ccgcactatcgttgcctcagcccctaggtcatgctgccctctggcctcctcatcgtacaa120

ttcaccaacatctccaatgtaagtgcagctggttcagtaatgaactcagaagtggcatca180

gaatactccaagagttttttgttctttttgcctggatatataccaagggaaatgcattca240

aaactcctatagatgacgaatcccatctctccctcttttctcggacacggatccccaggt300

ccgtctccgtgctttactcatttgttttttacaagttcagatccacttgcgtactcacac360

agtggacatctgttatgcacatgtgtaaaccagcataagaattaggaattatgctcattt420

tatctaagaagtccttacactcgaaaatgcatgtgttatttagcttgagaataaataaaa480

ttattagcaaggagaaaaaaaataggactaaagaatagagtcacattggtttaaattagt540

acctagaagcaaa553

59

527

dna

artificialsequence

cdnasnaret2

59

gcttctcgatgtgcagtggcctgattagctacaagaagctcttgttccatggactcgatc60

tctggaccgcactatcgttgcctcagcccctaggtcatgctgccctctggcctcctcatc120

gtacaattcaccaacatctccaatgtaagtgcagctggttcagtaatgaactcagaagtg180

gcatcagaatactccaagagttttttgttctttttgcctggatatataccaagggaaatg240

cattcaaaactcctatagatgacgaatcccatctctccctcttttctcggacacggatcc300

ccaggtccgtctccgtgctttactcatttgttttttacaagttcagatccacttgcgtac360

tcacacagtggacatctgttatgcacatgtgtaaaccagcataagaattaggaattatgc420

tcattttatctaagaagtccttacactcgaaaatgcatgtgttatttagcttgagaataa480

ataaaattattagcaaggagaaaaaaaataggactaaagaatagagt527

60

9062

dna

zeamays

60

gttgcgagatgtttatgtgccttagtcttcaatttgggggttgggggaaaagtaatttta60

tgtttttgttttgtgtctgcagattcggaagatggacttggaggcaaggagcctacagcc120

tagcattaaggctggtttgcttgcaaagctgagggagtataaatctgacctcaacaacgt180

caagagtgagctcaagaggatatttgcgcccaatgccaggcaggctacccgggaggagct240

cctagagtttggaatggctgatactctcgctgtgagctaatgctaggacttgactgtgtc300

tacgagactgctcctaacaataaactgaagaaagcaaaagaaatcattcaacgtattcgc360

cgaagagaactctacaaggtagtatgatgctttaattgctcatatacaagtgtcattttg420

tcatgtcattacacatggttaggatacatacttaagtttctaacgtaggcgtccacacaa480

cggattggtgcacggttctgccgatgtatcccacgcacgtgcatggaaggaggcaggcac540

ccttccccgccgccccggatctcgcgccagcccccgccctaccccgcctgcccttccact600

cttcccccgctgcccccggtcaacgtcacgaacccgggcctcgtgccgctcgtcgtggcc660

acactgttcgacgagcgagtcacagagctgctgagcgtgctcgctgatgcggcggtgggg720

cgaccaggcaggtggtccatcggcgaagcgccatggtcgtcgtcggggggcacgaaccag780

gcggtgtacgcgcgccgcgcgcccggctcttcatcgcctccacccgctccagcgtctcca840

ccacttccttcatcgagggccgactgcttggctcgctggccaggcagccgagcattagtt900

gcgccgcttggaacgcctgcttttgttgatcgtttgttttggtctgatttcagtgggtct960

atccgcagagaggaagaagcagaagctctccgagatccaatccggcgttgaggaagctga1020

atcgctggtaaatagatgtcgcgacgcgttctgttttggggatccccttggctaacggga1080

catacgacatttggggaatgggtagaaaagcagagattagggatttttcgtttccgtcgg1140

tgcagttttggtgttccaacagagttgcgagatgtttatgtgccttagtcttcaatttgg1200

gggttgggggaaaagtaattttatgtttttgttttgtgtctgcagattcagaaaatggac1260

ctggaggcaaggagcctacagcctagcattaaggctggtttgcttgcaaagccgagggat1320

tataaatctgacctcaacaacgtcaagagtgagctcaagaggatatctgcgcccaatgcc1380

aggtaggctacccgggaggagctcgtggagtctagaatggctgatactctcgcagtgagc1440

taatgctaggacttgactgtgtctacgagactgctcctaataataaactgaagaaagcaa1500

aagaaatcattcaacgtattcgccgaagagaactctacaaggtagtatgatgctttaatt1560

gctcatatacaagtgtcattttgtcatgtcattacacatggttaggatacatacttaagt1620

ttctaacgtaggcatccacacaatggattggtgcacggttctgccgatgtatcccacgca1680

cgcgcatggaaggaggcaggcacccttccctgccgccccggatctcgcgccagccatcgc1740

cctaccccgcctgcccttccactcttccccctgaaagtcgcctagagggggggtgaatag1800

ggcgaatctgaaatttacaaacttaagcacaactacaagccgggttaacgttagaaatat1860

aaacgagtccgagagagagggcgcaaaacaaatcatgagcaaataaagagtgagacacga1920

tgatttgttttaccgaggttcggttcttgcaaacctactccccgttgaggtggtcacaaa1980

gaccgggtctctttcaaccctttccctctctcaaacggtcacttagaccgagtgagcttc2040

tcttctcaatcaaacgaaacacaaagttcccgcaaggaccaccacacaattggtgtctct2100

tgccttggttacaattgagtttgatcacaagaagaatgagaaagaaaagaagcgatccaa2160

gcgcaagagctcaaatgaacacaaatgtcgctctctctagtcactatttgatttggagtg2220

attccggacttgggagaggatttgatcttttggagtgtctagaattgaatgctatagctc2280

ttgtaatatgttgaaggtgggaaacttggatgccattgaatgtggggtggttggggtatt2340

tatagccccaaaacaccaaaaaaggccgttggaaggctgctctcgcatggcgcaccggac2400

agtccggtgcgccagccacgtcagcagaccgttggggttcgaccgttggagctctgactt2460

gtggggcctctgggctgtccggtggtgcaccggacaggtcctgtaggatgtctggtgcgc2520

caactgcacgtgctctgtcctctgcgcgcgcaggcgcgcattaaatgcgttgtagtcaac2580

cgttgcgcgcgaagtagccattgctctgctggcacaccggacagtccggtgaattatagc2640

ggagcgccctctgattttcccgaaggtagcgagttcagcttcgagtgccctggtgcaccg2700

gacactgtccggtgcgccaaaccagggtgccttccgggatgtcttttgctctctttgttt2760

gaaccctttcttggtctttttattggcttattgtgaacctttgacacctgtaaaacttat2820

agactagagcaaactagttagtccaattatttgtgttggacaattcaaccaccaaaatca2880

attaggaaataggtgtgagcctaattccctttcaatctccccctttttggtgattgatgc2940

caacacaaaccaaagcaagtatagaagtgcataattgaactagtttgcataatgtaagtg3000

caaaggttacttagaattgaaccaataaatattttcataagttatgcatggattgtttct3060

ttattttcatcattttggaccacgcttgcaccacatgttttgtttttgcaaatccttttg3120

taaatagtcaaaggtaaatgaataagattttgagaagcattttcaaaatttgaaattttc3180

tccccctgtttcaaatgcttttcctttgacttaaacaaaactcccccctcaaaaatccta3240

ctcatagtgttcaagagggttttaagatatcaattttgaaaatgctactttctccccctt3300

ttgaatataataagatatcaattgaaaaattcatcattttaaaaccttttgaaaatgggt3360

ggtggtgcggtccttttgctttgggctaatactttctccccctttggcatgaatcgccaa3420

aaacgaatacttgagtgaaatataagcccctttaactactttctcctgctttggcgaaca3480

taatatgagtgaagattataccaaagttggagagttgcttgaagcgacggtgaaggatga3540

gttatggagtggaggttaagcctttgtcttcgccgaagattccaattccctttcaataca3600

cctatgacttggtttgaaatatacttgaaaacacattagtcatagcacatgaaagagata3660

tgatcaaaggtatattaatgagctatgtatgcaagacatcaaaagaaattcctagaatca3720

agaatatttagctcgtgtctaagtttgttcatctagtggcttggtaaagatatcggctaa3780

ttgttccttagtgttaatataggcaatctcgatatctccctttttttggtgatcccttag3840

gaaatgataccgaatggctatgtgtttagtgcggctatgctcaacgggattatccgccat3900

gcggattgcactctcattatcacatagaagaggaactttggttaatttttaaccatagtc3960

cctaagggtttgcctcatccaaagtaattgtgcgcaacaatggcctgcggcaatatactc4020

ggcttcggcggtagaaagagctacggaattttgcttctttgaagcccaagacaccaggga4080

ccttcccaagaactggcaagtccccgatgtactctttctattaattttacaccccgccca4140

atcggcatccgaataaccaatcaaatcaaatgtggatcccgtaggataccaaagcccaaa4200

cttaggagtatgaactaaatatctcaagattcgttttacggccgtaaggtgagcttcctt4260

agggtcggcttggaatcttgcacacatgcatacggaaagcataatatccggtcgagatgc4320

acataaatagagtaaagagcctatcatcgaccggtataccttttgatcgacggatttacc4380

tcccgtgtcgaggtcgagatgcccattggttcccatgggtgtcttgatgggtttggcatc4440

cttcatcccatacttgtttagaatgtcttgaatgtacttcgtttggctaatgaaggtgcc4500

ctcttagcgttgcttcacttgaaatcacaagaagtacttcaactcccccatcatagacat4560

ctcgaatttctgtgtcatgatcctactaaattcctcacatgtagattcattagtagaccc4620

aaatataatatcatcaacataaatttggcatacaaacaaatcattgtcaagagttttagt4680

aaataaagtaggatcggcctttccgactttgaaaccattagtgataaggaaatctctaag4740

gcattcataccatgctcttggggcttgcttgagcccataaagcgccttagagagtttata4800

tacgtgattagggtactcactatcttcaaagccgggaggttgctcaacatagacctcttc4860

cttgattggtccgttgaggaaggcacttttcacgtccatttgataaagcttgaagccatg4920

gtaagtagcataggcaagtaatatgcgaattgactcaagcctagctacgggtgcataggt4980

ttcaccaaaatccaaaccttcgacttgtgaataacccttggccacaagtcgggctttgtt5040

ccttgtcaccacaccatgctcatcttgtttgttgcggaagacccacttggttcctacaac5100

attttgattaggacgtggaactaagtgacatacctcattcctagtgaagttgttgagctc5160

ctcttgcatcgccaccacccaatctgaatcctgtagtgcttcctctaccctttgtggctc5220

aatagaggaaacaaaagagtaatgttcacaaaaatgagcaacacgagatcgagtagttac5280

ccccttttgaatatcgccgaggatggtgttcacggggtgatctcgttggattgcttggtg5340

gactcttgggtgtggcggtcttggttcttcatcctccttgtcttgatcatttgcatctcc5400

cccttgattattgccgtcatcttgaggtggctcatcttcttgatcttctcctttatcatc5460

ttgagcctcatcctcattttgagttggtggagatgcttgcgtggaggaggatggttgatc5520

ttgtgcatttggaggctctttggattccttaggacacacatccccaatggacatgttcct5580

tagcgcgacgcacggagcctcttcatcacctatctcatcaagatcaacttgctctacttg5640

agagccgttagtttcatcaaacacaatgtcaccagaaacttcaactagtcccgaggactt5700

gttaaagactctatatgcccttgtgtttgaatcataccctagtaaaaagccttctacagc5760

cttaggagcaaatttagattttctacctcttttaacaagaatgaagcatttgctaccaaa5820

gactctaaaatatgaaacattgggcttttaccggttaggagttcatatgatgtcttcttg5880

aggattcggtgtagatataaccggttgatggtgtagcaagcggtgttgaccgcctcgatc5940

caaaaccgatccgaagtcttgtactcatcaagcatggttcttgccatgtccaatagagtt6000

cgattcttcctctccactacaccattttgttgtggggtgtagggagaagagaactcatgc6060

ttgatgccctcctcctcaaggaagccttcgatttgagaattcttgaactccgtcccgttg6120

tcgcttctaatctttttgattcttaagccgaactcattttgagcccatctcaagaatccc6180

tttaaggtctcttgggtatgagatttttcctgcaaaaagaatacccaagtgaagcgagta6240

taatcatccacaataactagatagtacttactcccgccgatgcttatgtaagctatcggg6300

ccgaataggtccatatgtaggagctcgagtggcctgtcagtcatcatgatgttcttgtgt6360

ggatgatgagtaccaacttgcttccctgtctgacatgcgctacaaatcctatctttctca6420

aagtgaacatttgttagtcccaaaatgtgttctccctttagaagcttgtgaagattcttc6480

attccaacatgtgctagtcggtgatgccagagccagcccatgttagtcttagcaattaag6540

caagtgtcgagttcagctctatcaaaatctaccaagtatagctgaccctctaacactccc6600

ttaaatgctactgaatcatcacttcttctaaagacagtaacacctatatctgtaaaaaga6660

cagttgtagcccattttacataattgcgaaactgaaagcaagttgtaatctaaagaatct6720

acaagaaacacattggaaatggaatggtcaggagatatagcaattttacccaatcctttg6780

accaaaccttggtttccatccccgaatgtgatcgctctttggggatcttggtttttctca6840

taggaggagaacattttcttctcccctgtcatatggtttgtgcacccgctatcgatgatc6900

caacttgggcccccggatgcataaacctacaaaacaagtttagttcttgattttaggtac6960

ccaaatggttttgggtcctttgacattagatacaagaactttgggtacccaaacacaagt7020

ctttgatcccttgtgtttgcccccaacatacttggcaactatcttgtcggatttgttagt7080

taaaacataagatgcatcaaaagttttgaatgaaatgttatgatcatttgatgcagcagg7140

agttttcttcttaggcaattttgcacgggttgattgcctagagctagatgtctcaccctt7200

atacataaaagcatgattatggccagagtgagacttcctagaatgaattctcctaatttt7260

gctctcgggataaccggcagggtacaaaatgtaaccctcattatcctgaggcatgggagc7320

cttgcccttaacaaagtttgacaatcttttaggagaggcattaagtttgacattgtttcc7380

cttttggaagccaatgccatccttgatgccagggcgtctcccactatagagcatgcttct7440

agcaaatttaaatttttcattttttaagtcatgctcggcaattttagcatctaattttgc7500

tatatgattattttgttgtttaattaaagccatatgatcatgaatagcatcaatgttaat7560

atctctacatctagtgcaaataatgacatgctcaatggcagatgtagagggtttgcaaga7620

attaagttcaacaatcttagcacgtaaaatatcattgttatttctaagatcagaaatgga7680

agcattgcaaacatctaattctttagccttagcaatcaatttttcattttcaaccctaag7740

gctagcaagagagacattcaattcttcaatcttagcaagcaaattaacattatcatctct7800

aagattgggaattgaaacatcacaaatattagaatcaaccttagcaattagtttagtatt7860

tttatttctaaggatggtaatagtatcatggcaagtgcttagctcactagataatttttc7920

acatttttctacttctagagcataagcatttttaaccttaacatgcttcttattttcctt7980

aattaggaagtcctcttgaaagtccaagagatcatctttctcatgaatagcactaattaa8040

ttcatttagtttttcctgtagttgcatgtttaggttggcaaaaagggtacgcaaattatc8100

ctcctcatcactagcattatcttcatcactagaggatgcatatttagtggaggattttga8160

ttttaccttcttctttttgccgtcctttgccatgaggcacttgtggccgacgttggggaa8220

gaggagccctttggtgacggcgatgttggcggcgtcctcgtcggatgaggagtcggagga8280

actctcgtcggagtcccactcgcggcacacatgggcatcgccgcccttcttcttgtaata8340

cctcctcttttctctcctcttgcccttcttgtcgtcgcccctgtcactatcactagataa8400

aggacatttaacaatgaaatgaccgggcttaccacacttgtagcacacccttttggaaca8460

aggcttgtaatctttccccttcctttgtttgaggatttggtggaagctcttgatgatgag8520

cgccattttctagttgtcgagcttagaggcgtcgatgggttgtctacttgatgtagactc8580

ctctttcttctcctctgtcgccttgaatgcgaccggttgtgcttcgggcgtggagggacc8640

gtcgtgctcgataattttctttgagcctttgatcatcaactcaaagctcacaaagtttcc8700

tattacttcctcgggagtcattagtgtatatctaggattaccacgaattaattgaacttg8760

cgtagggttaaggaacacaagtgatctaagaataaccttaaccatttcatggtcatccca8820

ttttttgctcccgaggttgcgcacttggttcaccaaggtcttgagccggttgtacatatc8880

ttgtggctcctccccttggcgaagccggaagcgaccgagctccccctcgatcgtctcccg8940

cttggtgatcttggttacctcgtctccttcgtgcgcggtctttagcacgtcccagatatc9000

ctttgcactcttcaacccttgcaccttattatactcctctcgacttggatttgaaatgtt9060

gg9062

61

1082

dna

artificialsequence

cdnasnaret3

61

cctgcccttccactcttcccccgctgcccccggtcaacgtcacgaacccgggcctcgtgc60

cgctcgtcgtggccacactgttcgacgagcgagtcacagagctgctgagcgtgctcgctg120

atgcggcggtggggcgaccaggcaggtggtccatcggcgaagcgccatggtcgtcgtcgg180

ggggcacgaaccaggcggtgtacgcgcgccgcgcgcccggctcttcatcgcctccacccg240

ctccagcgtctccaccacttccttcatcgagggccgactgcttggctcgctggccaggca300

gccgagcattagttgcgccgcttggaacgcctgcttttgttgatcgtttgttttggtctg360

atttcagtgggtctatccgcagagaggaagaagcagaagctctccgagatccaatccggc420

gttgaggaagctgaatcgctgattcagaaaatggacctggaggcaaggagcctacagcct480

agcattaaggctggtttgcttgcaaagccgagggattataaatctgacctcaacaacgtc540

aagagtgagctcaagaggatatctgcgcccaatgccagattcggaagatggacctggaag600

caaggagcctacaacctagcattaagagtgagctcaagaggatatctgcgcccattgcca660

ggcaggctacccgggaggagctcctggagtctggaatggctgatactctcgcagtgagct720

aatgctaggacttgactgtgtctacgagactgctcctaacaataaactgaagaaagcaaa780

agaaatcattcaacgtattcgccgaagagaactctacaagatatggtagtcctgggagga840

agagccatcgccgctatgtatggcagaaccacccgcgaaagcaacctctctggtctcgcg900

tagccagagcaggagcagctcgcttgcgcggtcgcggcgctggcggccggccccgcgtac960

gagcgcctgcaggaagccaggaacccatccgaacaaggttgcaatcatgataagcagata1020

gagcaagcatatgatgatattttgaattcgtcgaagcatactttggccagcatgatggag1080

ct1082

62

1154

dna

artificialsequence

cdnasnaret3

62

cctgcccttccactcttcccccgctgcccccggtcaacgtcacgaacccgggcctcgtgc60

cgctcgtcgtggccacactgttcgacgagcgagtcacagagctgctgagcgtgctcgctg120

atgcggcggtggggcgaccaggcaggtggtccatcggcgaagcgccatggtcgtcgtcgg180

ggggcacgaaccaggcggtgtacgcgcgccgcgcgcccggctcttcatcgcctccacccg240

ctccagcgtctccaccacttccttcatcgagggccgactgcttggctcgctggccaggca300

gccgagcattagttgcgccgcttggaacgcctgcttttgttgatcgtttgttttggtctg360

atttcagtgggtctatccgcagagaggaagaagcagaagctctccgagatccaatccggc420

gttgaggaagctgaatcgctgattcagaaaatggacctggaggcaaggagcctacagcct480

agcattaaggctggtttgcttgcaaagccgagggattataaatctgacctcaacaacgtc540

aagagtgagctcaagaggatatctgcgcccaatgccagactgctcctaataataaactga600

agaaagcaaaagaaatcattcaacgtattcgccgaagagaactctacaagattcggaaga660

tggacctggaagcaaggagcctacaacctagcattaagagtgagctcaagaggatatctg720

cgcccattgccaggcaggctacccgggaggagctcctggagtctggaatggctgatactc780

tcgcagtgagctaatgctaggacttgactgtgtctacgagactgctcctaacaataaact840

gaagaaagcaaaagaaatcattcaacgtattcgccgaagagaactctacaagatatggta900

gtcctgggaggaagagccatcgccgctatgtatggcagaaccacccgcgaaagcaacctc960

tctggtctcgcgtagccagagcaggagcagctcgcttgcgcggtcgcggcgctggcggcc1020

ggccccgcgtacgagcgcctgcaggaagccaggaacccatccgaacaaggttgcaatcat1080

gataagcagatagagcaagcatatgatgatattttgaattcgtcgaagcatactttggcc1140

agcatgatggagct1154

63

107

prt

zeamays

63

metileglyleuaspcysvaltyrgluthralaproasnasnlysleu

151015

lyslysalalysgluileileglnargileargargarggluleutyr

202530

lysglualaargasnprosergluglnglycysasnhisasplysgln

354045

ilegluglnalatyraspaspileleuasnserserlyshisthrleu

505560

alasermetmetgluleuglnglualaleuleugluserasnglnala

65707580

thrlysaspalaasngluileproseralaserasnglyaspasnasp

859095

glutrpsergluvalglnargleuglnthrarg

100105

64

131

prt

zeamays

64

metcysserglyleuilesertyrlyslysleuleuphehisglyleu

151015

aspleutrpthralaleuserleuproglnproleuglyhisalaala

202530

leutrpproprohisargthrilehisglnhisleuglncyslyscys

354045

sertrppheserasngluleuargserglyileargileleuglnglu

505560

phephevalleuphealatrpiletyrthrlysglyasnalaphelys

65707580

thrproileaspaspgluserhisleuserleupheserargthrarg

859095

ileproargservalservalleutyrserphevalphetyrlysphe

100105110

argserthrcysvalleuthrglntrpthrservalmethismetcys

115120125

lysproala

130

65

162

prt

zeamays

65

metgluglyglyarghisproserproproproargileserarggln

151015

proproprotyrproalacysproserileleuproproleupropro

202530

valasnvalthrasnproglyleuvalproleuvalvalalathrleu

354045

pheaspgluargvalthrgluleuleuservalleualaaspalaala

505560

valglyargproglyargtrpserileglyglualaprotrpserser

65707580

serglyglythrasnglnalavaltyralaargargalaproglyser

859095

serserproproproalaproalaserproproleuproserserarg

100105110

alaaspcysleualaargtrpproglyserargalaleuvalalapro

115120125

leuglythrproalaphevalaspargleuphetrpserasppheser

130135140

glyserileargargglugluglualaglualaleuargaspproile

145150155160

argarg

同类文章

一種新型多功能組合攝影箱的製作方法

一種新型多功能組合攝影箱的製作方法【專利摘要】本實用新型公開了一種新型多功能組合攝影箱,包括敞開式箱體和前攝影蓋,在箱體頂部設有移動式光源盒,在箱體底部設有LED脫影板,LED脫影板放置在底板上;移動式光源盒包括上蓋,上蓋內設有光源,上蓋部設有磨沙透光片,磨沙透光片將光源封閉在上蓋內;所述LED脫影

壓縮模式圖樣重疊檢測方法與裝置與流程

本發明涉及通信領域,特別涉及一種壓縮模式圖樣重疊檢測方法與裝置。背景技術:在寬帶碼分多址(WCDMA,WidebandCodeDivisionMultipleAccess)系統頻分復用(FDD,FrequencyDivisionDuplex)模式下,為了進行異頻硬切換、FDD到時分復用(TDD,Ti

個性化檯曆的製作方法

專利名稱::個性化檯曆的製作方法技術領域::本實用新型涉及一種檯曆,尤其涉及一種既顯示月曆、又能插入照片的個性化檯曆,屬於生活文化藝術用品領域。背景技術::公知的立式檯曆每頁皆由月曆和畫面兩部分構成,這兩部分都是事先印刷好,固定而不能更換的。畫面或為風景,或為模特、明星。功能單一局限性較大。特別是畫

一種實現縮放的視頻解碼方法

專利名稱:一種實現縮放的視頻解碼方法技術領域:本發明涉及視頻信號處理領域,特別是一種實現縮放的視頻解碼方法。背景技術: Mpeg標準是由運動圖像專家組(Moving Picture Expert Group,MPEG)開發的用於視頻和音頻壓縮的一系列演進的標準。按照Mpeg標準,視頻圖像壓縮編碼後包

基於加熱模壓的纖維增強PBT複合材料成型工藝的製作方法

本發明涉及一種基於加熱模壓的纖維增強pbt複合材料成型工藝。背景技術:熱塑性複合材料與傳統熱固性複合材料相比其具有較好的韌性和抗衝擊性能,此外其還具有可回收利用等優點。熱塑性塑料在液態時流動能力差,使得其與纖維結合浸潤困難。環狀對苯二甲酸丁二醇酯(cbt)是一種環狀預聚物,該材料力學性能差不適合做纖

一種pe滾塑儲槽的製作方法

專利名稱:一種pe滾塑儲槽的製作方法技術領域:一種PE滾塑儲槽一、 技術領域 本實用新型涉及一種PE滾塑儲槽,主要用於化工、染料、醫藥、農藥、冶金、稀土、機械、電子、電力、環保、紡織、釀造、釀造、食品、給水、排水等行業儲存液體使用。二、 背景技術 目前,化工液體耐腐蝕貯運設備,普遍使用傳統的玻璃鋼容

釘的製作方法

專利名稱:釘的製作方法技術領域:本實用新型涉及一種釘,尤其涉及一種可提供方便拔除的鐵(鋼)釘。背景技術:考慮到廢木材回收後再加工利用作業的方便性與安全性,根據環保規定,廢木材的回收是必須將釘於廢木材上的鐵(鋼)釘拔除。如圖1、圖2所示,目前用以釘入木材的鐵(鋼)釘10主要是在一釘體11的一端形成一尖

直流氧噴裝置的製作方法

專利名稱:直流氧噴裝置的製作方法技術領域:本實用新型涉及ー種醫療器械,具體地說是ー種直流氧噴裝置。背景技術:臨床上的放療過程極易造成患者的局部皮膚損傷和炎症,被稱為「放射性皮炎」。目前對於放射性皮炎的主要治療措施是塗抹藥膏,而放射性皮炎患者多伴有局部疼痛,對於止痛,多是通過ロ服或靜脈注射進行止痛治療

新型熱網閥門操作手輪的製作方法

專利名稱:新型熱網閥門操作手輪的製作方法技術領域:新型熱網閥門操作手輪技術領域:本實用新型涉及一種新型熱網閥門操作手輪,屬於機械領域。背景技術::閥門作為流體控制裝置應用廣泛,手輪傳動的閥門使用比例佔90%以上。國家標準中提及手輪所起作用為傳動功能,不作為閥門的運輸、起吊裝置,不承受軸向力。現有閥門

用來自動讀取管狀容器所載識別碼的裝置的製作方法

專利名稱:用來自動讀取管狀容器所載識別碼的裝置的製作方法背景技術:1-本發明所屬領域本發明涉及一種用來自動讀取管狀容器所載識別碼的裝置,其中的管狀容器被放在循環於配送鏈上的文檔匣或託架裝置中。本發明特別適用於,然而並非僅僅專用於,對引入自動分析系統的血液樣本試管之類的自動識別。本發明還涉及專為實現讀