本申請(qǐng)要求于2014年3月17日提交的美國(guó)臨時(shí)申請(qǐng)?zhí)?1/954,337的權(quán)益,其全部?jī)?nèi)容以引用方式并入本文中。
背景技術(shù):
糖基化(glycosylation)對(duì)糖蛋白的結(jié)構(gòu)和功能至關(guān)重要。例如,認(rèn)為糖基化可影響蛋白折疊(及由此的穩(wěn)定性)和/或糖蛋白的生物活性。治療性重組糖蛋白的需求,特別是單克隆抗體,在最近的二十年強(qiáng)勁增長(zhǎng)。以前的研究顯示,重組糖蛋白聚糖結(jié)構(gòu)的微小差異可能影響糖蛋白的生物活性和藥代動(dòng)力學(xué)。例如,阿法達(dá)貝泊汀是具有兩個(gè)額外的N-連接糖基化位點(diǎn)的重組人紅細(xì)胞生成素(EPO)的高度糖基化的類似物。與內(nèi)源或重組EPO相比,額外糖基化增加了糖類分子質(zhì)量百分比,并且顯著延長(zhǎng)了阿法達(dá)貝泊汀的血清半衰期。此外,對(duì)于治療性抗體,其功效主要依賴于抗體依賴性細(xì)胞毒性(ADCC),去除Fc部分的核心巖藻糖殘基的化學(xué)-酶促和遺傳方法已被開發(fā),以增加由所述抗體誘導(dǎo)的ADCC效應(yīng)的效力。
但是,目前可用的用于糖基化重塑的方法通常需要多種酶和/或多個(gè)步驟,導(dǎo)致制造糖工程化的重組蛋白成本高。
技術(shù)實(shí)現(xiàn)要素:
本發(fā)明是基于基因工程宿主動(dòng)物細(xì)胞的開發(fā),其能夠生產(chǎn)糖蛋白,如具有包括去巖藻糖基化和單糖基化的改性糖基化的抗體。這樣的宿主動(dòng)物細(xì)胞被工程化以過量表達(dá)一種或多種巖藻糖苷酶、糖苷內(nèi)切酶或二者。出乎意料的是,宿主動(dòng)物細(xì)胞的細(xì)胞糖基化機(jī)制的改變沒有導(dǎo)致與糖蛋白的合成和宿主細(xì)胞的生長(zhǎng)相關(guān)的不利影響。
因此,本發(fā)明提供一種基因工程宿主動(dòng)物細(xì)胞(例如,哺乳動(dòng)物細(xì)胞),其過量表達(dá)巖藻糖苷酶,糖苷內(nèi)切酶或二者,其中與野生型對(duì)應(yīng)物相比,所述宿主動(dòng)物細(xì)胞生產(chǎn)具有改性糖基化的糖蛋白。在一些實(shí)例中,巖藻糖苷酶可以是哺乳動(dòng)物巖藻糖苷酶或細(xì)菌巖藻糖苷酶,例如,人FUCA1,人FUCA2,灰倉(cāng)鼠(Cricetulus griseus)巖藻糖苷酶,α-L-1腦膜膿毒性金黃桿菌(Chryseobacterium meningosepticum),α-1,6-巖藻糖苷酶或細(xì)菌巖藻糖苷酶BF3242??商娲鼗蛄硗獾?,糖苷內(nèi)切酶可以是Endo S酶,例如,包含SEQ ID NO:11所示氨基酸序列的酶。在一些實(shí)例中,基因工程化宿主動(dòng)物細(xì)胞表達(dá)(i)人FUCA1,人FUCA2,灰倉(cāng)鼠巖藻糖苷酶α-L-1,腦膜膿毒性金黃桿菌α-1,6-巖藻糖苷酶或細(xì)菌巖藻糖苷酶BF3242,和(ii)Endo S(如SEQ ID NO:11)。
本文中所描述的基因工程宿主動(dòng)物細(xì)胞可以進(jìn)一步表達(dá)糖蛋白,所述糖蛋白可以是外源性的(在相同類型的天然動(dòng)物細(xì)胞中不表達(dá))。實(shí)例包括:但不限于抗體、Fc融合蛋白、細(xì)胞因子、激素、生長(zhǎng)因子或酶。
在一些實(shí)例中,基因工程宿主動(dòng)物細(xì)胞是哺乳動(dòng)物細(xì)胞,例如中國(guó)倉(cāng)鼠卵巢(CHO)細(xì)胞,大鼠骨髓瘤細(xì)胞,幼倉(cāng)鼠腎(BHK)細(xì)胞,雜交瘤細(xì)胞,拿馬瓦(Namalwa)細(xì)胞,胚胎干細(xì)胞或受精卵。
本文還描述了用于生產(chǎn)具有改性糖基化模式(例如,去巖藻糖基化或單糖基化)的糖蛋白的方法,所述方法使用任何本文所述的基因工程宿主動(dòng)物細(xì)胞。所述方法可以包括(i)提供表達(dá)(a)糖蛋白和(b)巖藻糖苷酶、糖苷內(nèi)切酶或二者的宿主動(dòng)物細(xì)胞;在允許用于生產(chǎn)糖蛋白和巖藻糖苷酶、糖苷內(nèi)切酶或二者的條件下,培養(yǎng)所述宿主動(dòng)物細(xì)胞;(ⅱ)收集所述宿主動(dòng)物細(xì)胞或培養(yǎng)上清液以便分離糖蛋白,和任選地(iii)分離所述糖蛋白。所述方法可以進(jìn)一步包括(ⅳ)分析所述糖蛋白的糖基化模式。
此外,本發(fā)明特征在于一種用于制備本文描述的任一基因工程動(dòng)物細(xì)胞的方法。所述方法可以包括:(i)將一種或多種表達(dá)載體引入動(dòng)物細(xì)胞,所述表達(dá)載體共同地編碼巖藻糖苷酶、糖苷內(nèi)切酶或二者,以及任選地(ii)將編碼糖蛋白的表達(dá)載體引入所述動(dòng)物細(xì)胞。所述方法可以進(jìn)一步包括分選表達(dá)巖藻糖苷酶、糖苷內(nèi)切酶、和糖蛋白的轉(zhuǎn)化細(xì)胞。
在下面的描述中提出本發(fā)明中一或多個(gè)具體實(shí)施方式的細(xì)節(jié)。從以下幾個(gè)具體實(shí)施方式的詳細(xì)說明和圖示以及所附的權(quán)利要求書,本發(fā)明的其它特征或優(yōu)點(diǎn)將是顯而易見的。
附圖說明
圖1為各種不同的N-聚糖的結(jié)構(gòu)示意圖,A:糖蛋白的典型N連接聚糖,B:去巖藻糖基化的N-聚糖,C:具有單N-乙酰葡糖胺(Nacetylglucosamine)還原糖(單糖基化)的N-聚糖。
圖2為用于生產(chǎn)巖藻糖苷酶的示例性表達(dá)載體的示意圖,A:攜帶巖藻糖苷酶基因的示例性質(zhì)粒的圖譜,B:用于表達(dá)巖藻糖苷酶或糖苷內(nèi)切酶S的示例性表達(dá)盒。
圖3包括表示通過在工程化以表達(dá)巖藻糖苷酶和/或糖苷內(nèi)切酶的宿主細(xì)胞內(nèi)抗體的瞬時(shí)表達(dá)生產(chǎn)均勻去巖藻糖基化(afucosylated)單糖(GlcNAc)抗體h4B12的示意圖,A:為生產(chǎn)均勻去巖藻糖基化單糖(GlcNAc)抗體的示意圖,B:顯示在工程化宿主細(xì)胞中產(chǎn)生抗體4B12糖基化的示意圖,所述宿主細(xì)胞經(jīng)工程化以表達(dá)人FUCA1,人FUCA2,灰倉(cāng)鼠巖藻糖苷酶α-L-1,或腦膜膿毒性金黃桿菌(C.meningosepticum)α-1,6-巖藻糖苷酶,通過LC/MS/MS測(cè)定,C:為通過Western印跡檢測(cè)到的CHO細(xì)胞內(nèi)巖藻糖苷酶或糖苷內(nèi)切酶的瞬時(shí)表達(dá)的示意圖,D:表示在如上文所示的表達(dá)巖藻糖苷酶或糖苷內(nèi)切酶的CHO細(xì)胞中生產(chǎn)抗體h4B12的圖,通過LC/MS/MS測(cè)定,所述抗體具有均勻的無巖藻糖的單糖(GlcNAc)糖型。均勻N-聚糖是指糖蛋白例如抗體中N-聚糖(例如,去巖藻糖基化N-聚糖)相對(duì)于總N-聚糖的比率。
圖4包括示出生產(chǎn)均勻去巖藻糖基化單糖(GlcNAc)抗體利妥昔單抗的示意圖,通過在工程化以表達(dá)巖藻糖苷酶和/或糖苷內(nèi)切酶的宿主細(xì)胞中瞬時(shí)表達(dá)所述抗體,A:表示在如上文所述的工程化以表達(dá)各種不同的巖藻糖苷酶或糖苷內(nèi)切酶的宿主細(xì)胞中生產(chǎn)的抗體利妥昔單抗的糖基化,B:表示通過Western印跡測(cè)定,在如上文所述的CHO細(xì)胞中瞬時(shí)表達(dá)巖藻糖苷酶(左)或糖苷內(nèi)切酶(右)。
具體實(shí)施方式
本文公開了基因工程宿主動(dòng)物細(xì)胞例如哺乳動(dòng)物細(xì)胞,其能夠生產(chǎn)具有改性糖基化模式(例如改性N-糖基化模式如去巖藻糖基化N-聚糖或單糖聚糖)的糖蛋白(例如外源糖蛋白如抗體)。這種宿主動(dòng)物細(xì)胞可以被工程化以過量表達(dá)巖藻糖苷酶,糖苷內(nèi)切酶或二者。任選地,宿主動(dòng)物細(xì)胞也可被改造以表達(dá)外源糖蛋白例如抗體。
圖1中A表示野生型哺乳動(dòng)物細(xì)胞中產(chǎn)生的糖蛋白的典型復(fù)雜N-聚糖的結(jié)構(gòu)。這種復(fù)雜N-聚糖包含連接至糖基化位點(diǎn)的N-乙?;咸烟前?GlcNAc)殘基,糖蛋白的天冬酰胺(Asn)殘基,以及巖藻糖殘基,其以α1,6鍵連接至所述Asn殘基。所述基因工程宿主動(dòng)物細(xì)胞能夠產(chǎn)生糖蛋白(如內(nèi)源或外源),所述糖蛋白具有改性N-聚糖,如去巖藻糖化N-聚糖,其中一個(gè)實(shí)施例顯示于圖1中B,以及單糖N-聚糖,其中僅GlcNAc殘基連接至Asn糖基化位點(diǎn)(圖1中C)。具有改性糖基化的糖蛋白是指糖蛋白攜帶至少一種聚糖例如N-聚糖,其結(jié)構(gòu)上與所述基因工程宿主動(dòng)物細(xì)胞的野生型對(duì)應(yīng)物產(chǎn)生的糖蛋白的聚糖不同。去巖藻糖化聚糖是指不含α1,6巖藻糖殘基或任何巖藻糖殘基的任何聚糖。
A.巖藻糖苷酶
巖藻糖苷酶是一種能夠分解巖藻糖的酶。這種酶從包含巖藻糖殘基的聚糖切割巖藻糖殘基。用于制備基因工程宿主動(dòng)物細(xì)胞的巖藻糖苷酶可以是哺乳動(dòng)物巖藻糖苷酶或細(xì)菌巖藻糖苷酶。在一些實(shí)施方式中,所述巖藻糖苷酶是野生型酶,如野生型細(xì)菌酶或野生型哺乳動(dòng)物酶例如人類酶。多種示例性巖藻糖苷酶的氨基酸序列及其編碼核苷酸序列提供如下(包括C端的His標(biāo)簽)。
人類巖藻糖苷酶FUCA1:
氨基酸序列(SEQ ID NO:1)
MRVPAQLLGLLLLWLPGARCQPPRRYTPDWPSLDSRPLPAWFDEAKFGVFIHWGVFSVPAWGSEWFWWHWQGEGRPQYQRFMRDNYPPGFSYADFGPQFTARFFHPEEWADLFQAAGAKYVVLTTKHHEGFTNWPSPVSWNWNSKDVGPHRDLVGELGTALRKRNIRYGLYHSLLEWFHPLYLLDKKNGFKTQHFVSAKTMPELYDLVNSYKPDLIWSDGEWECPDTYWNSTNFLSWLYNDSPVKDEVVVNDRWGQNCSCHHGGYYNCEDKFKPQSLPDHKWEMCTSIDKFSWGYRRDMALSDVTEESEIISELVQTVSLGGNYLLNIGPTKDGLIVPIFQERLLAVGKWLSINGEAIYASKPWRVQWEKNTTSVWYTSKGSAVYAIFLHWPENGVLNLESPITTSTTKITMLGIQGDLKWSTDPDKGLFISLPQLPPSAVPAEFAWTIKLTGVKHHHHHH
核苷酸序列(SEQ ID NO:2;密碼子經(jīng)優(yōu)化)
atgagagtgcctgctcagctgctgggactgctgctgctgtggctgcctggtgctagatgccagccccctcggagatacacccctgactggccttccctggactccagacctctgcccgcttggtttgacgaggccaagttcggcgtgttcatccactggggcgtgttctccgtgcctgcctggggctctgagtggttctggtggcattggcagggcgagggcagacctcagtaccagcggttcatgcgggacaactacccccctggcttctcctacgccgacttcggccctcagttcaccgcccggttcttccaccctgaggaatgggccgatctgttccaggccgctggcgccaaatacgtggtgctgaccaccaagcaccacgagggcttcaccaactggccctcccccgtgtcctggaactggaactctaaggacgtgggcccccaccgggatctcgtgggagaactgggaaccgccctgcggaagcggaacatcagatacggcctgtaccactccctgctggaatggttccaccccctgtacctgctggacaagaagaacggcttcaagacccagcacttcgtgtccgccaagaccatgcccgagctgtacgacctcgtgaactcctacaagcccgacctgatttggagcgacggcgagtgggagtgccccgacacctattggaactccaccaactttctgtcctggctgtacaacgactcccctgtgaaggacgaggtggtcgtgaacgacagatggggccagaactgctcctgtcaccacggcggctactacaactgcgaggacaagttcaagccccagtccctgcccgaccacaagtgggagatgtgcacctctatcgacaagttctcctggggctaccggcgggacatggccctgtctgatgtgaccgaggaatccgagatcatctccgagctggtgcagaccgtgtccctgggcggcaactacctgctgaacatcggccctaccaaggacggcctgatcgtgcccatcttccaggaacggctgctggccgtgggcaagtggctgtctatcaacggcgaggccatctacgcctccaagccttggcgagtgcagtgggagaagaacaccacctccgtgtggtacacctccaagggctctgccgtgtacgccatcttcctgcactggcccgagaacggcgtgctgaacctggaatcccccatcaccacctctaccaccaagatcaccatgctgggcatccagggcgacctgaagtggtccaccgaccctgacaagggcctgttcatctccctgccccagctgcctccttccgctgtgcctgctgagttcgcctggaccatcaagctgaccggcgtgaagcaccaccaccatcaccattga
人類巖藻糖苷酶FUCA2:
氨基酸序列(SEQ ID NO:3)
MRVPAQLLGLLLLWLPGARCHSATRFDPTWESLDARQLPAWFDQAKFGIFIHWGVFSVPSFGSEWFWWYWQKEKIPKYVEFMKDNYPPSFKYEDFGPLFTAKFFNANQWADIFQASGAKYIVLTSKHHEGFTLWGSEYSWNWNAIDEGPKRDIVKELEVAIRNRTDLRFGLYYSLFEWFHPLFLEDESSSFHKRQFPVSKTLPELYELVNNYQPEVLWSDGDGGAPDQYWNSTGFLAWLYNESPVRGTVVTNDRWGAGSICKHGGFYTCSDRYNPGHLLPHKWENCMTIDKLSWGYRREAGISDYLTIEELVKQLVETVSCGGNLLMNIGPTLDGTISVVFEERLRQMGSWLKVNGEAIYETHTWRSQNDTVTPDVWYTSKPKEKLVYAIFLKWPTSGQLFLGHPKAILGATEVKLLGHGQPLNWISLEQNGIMVELPQLTIHQMPCKWGWALALTNVIHHHHHH
核苷酸序列(SEQ ID NO:4;密碼子經(jīng)優(yōu)化)
atgagagtgcctgctcagctgctgggactgctgctgctgtggctgcctggcgctagatgccactccgccaccagattcgaccccacctgggagtctctggacgccagacagctgcccgcttggtttgaccaggccaagttcggcatcttcatccactggggcgtgttctccgtgcccagcttcggctctgagtggttctggtggtactggcagaaagagaagatccccaaatacgtggagttcatgaaggacaactacccccccagctttaagtacgaggacttcggccccctgttcaccgccaagttcttcaacgccaaccagtgggccgacatcttccaggcctctggcgccaagtacatcgtgctgacctccaagcaccacgagggcttcaccctgtggggctccgagtactcctggaactggaacgccatcgacgagggccccaagcgggacatcgtgaaagaactggaagtggccatccggaaccggaccgacctgagattcggcctgtactactccctgttcgagtggttccaccccctgtttctggaagatgagtcctccagcttccacaagcggcagttccccgtgtccaagaccctgcccgagctgtacgagctcgtgaacaactaccagcccgaggtgctgtggagtgacggggatggtggtgcccccgatcagtactggaactctaccggcttcctggcctggctgtacaacgagtctcctgtgcggggcaccgtcgtgaccaacgatagatggggcgctggctccatctgcaagcacggcggcttctacacctgttccgaccggtacaaccccggccatctgctgcctcacaagtgggagaactgcatgaccatcgacaagctgtcctggggctacagaagagaggccggcatctccgactacctgacaatcgaggaactcgtgaagcagctggtggaaaccgtgtcctgcggcggcaacctgctgatgaacatcggccctaccctggacggcaccatctccgtggtgttcgaggaacggctgcggcagatgggctcctggctgaaagtgaacggcgaggccatctacgagacacacacctggcggtcccagaacgacaccgtgacccctgacgtgtggtacaccagcaagcccaaagaaaagctggtgtatgccatcttcctgaagtggcctacctccggccagctgttcctgggccaccctaaggctatcctgggcgccaccgaagtgaaactgctgggccatggacagcccctgaactggatctccctggaacagaacggcatcatggtggaactgccccagctgaccatccatcagatgccctgcaaatggggctgggccctggccctgaccaacgtgatccaccatcaccaccaccactga
灰倉(cāng)鼠(中國(guó)倉(cāng)鼠)巖藻糖苷酶FUCA2
氨基酸序列(SEQ ID NO:5)
MRVPAQLLGLLLLWLPGARCKSSRRYDPTWESLDRRPLPSWFDQAKFGIFIHWGVFSVPSFGSEWFWWYWQKEKRPKFVDFMNNNYPPGFKYEDFGVLFTAKFFNASQWADILQASGAKYLVLTSKHHEGFTLWGSEYSWNWNAVDEGPKRDIVKELKVAITKNTDLRFGLYYSLFEWFHPLFLEDKLSSFQKRQFPISKMLPELYELVNKYQPDILWTDGDGGAPDRYWNSTGFLAWLYNESPVRNTVVTNDRWGAGSICKHGGYYTCSDRYNPGHLLPHKWENCMTIDQFSWGYRREAVISDYLTIEELVKQLVETVACGGNLLMNIGPTLDGIIPVIFEERLRQMGMWLKVNGEAIYETQPWRSQNDTATPDVWYTYKPEEKIVYAIFLKWPVSRELFLEQPIGSLGETEVALLGEGKPLTWTSLKPNGIIVELPQLTLHQMPCKWGWTLALTNVTHHHHHH
核苷酸序列(SEQ ID NO:6;密碼子經(jīng)優(yōu)化)
atgagagtgcctgctcagctgctgggactgctgctgctgtggctgcctggcgctagatgcaagtcctctcggagatacgaccccacctgggagtccctggacagaaggcctctgcccagttggttcgaccaggccaagttcggcatcttcatccactggggcgtgttctccgtgcccagcttcggctctgagtggttctggtggtactggcagaaagagaagcggcccaagttcgtggacttcatgaacaacaactacccccctggctttaagtacgaggacttcggcgtgctgttcaccgccaagttcttcaacgcctcccagtgggccgacatcctgcaggcttccggcgctaagtacctggtgctgacctccaagcaccacgagggctttaccctgtggggctccgagtactcctggaactggaacgccgtggacgagggccctaagcgggacatcgtgaaagaactgaaggtggccatcaccaagaacaccgacctgagattcggcctgtactactccctgttcgagtggttccaccccctgtttctggaagataagctgtccagcttccagaagcggcagttccccatctccaagatgctgcccgagctgtacgagctcgtgaacaagtaccagcctgacatcctgtggaccgacggggatggtggcgcccctgacagatactggaactctaccggcttcctggcctggctgtacaacgagtcccctgtgcggaacaccgtcgtgaccaacgacagatggggcgctggctccatctgcaagcacggcggctactacacctgttccgaccggtacaaccccggccatctgctgcctcacaagtgggagaactgcatgacaatcgaccagttctcctggggctaccggcgcgaggccgtgatctctgactacctgaccatcgaggaactcgtgaagcagctggtggaaaccgtggcctgtggcggcaacctgctgatgaacatcggccctaccctggacggcatcatccccgtgatcttcgaggaacggctgcggcagatgggcatgtggctgaaagtgaacggcgaggccatctacgagacacagccttggcggtcccagaacgacaccgccacacctgacgtgtggtacacctacaagcccgaagagaagatcgtgtacgccatcttcctgaagtggcccgtgtccagagagctgtttctggaacagcccatcggctccctgggcgagacagaagtggctctgctgggcgagggcaagcctctgacctggacctccctgaagcccaatggcatcatcgtggaactgccccagctgaccctgcaccagatgccctgtaaatggggctggaccctggccctgaccaacgtgacccaccaccaccatcaccactga
腦膜膿毒素金黃桿菌α1,6-巖藻糖苷酶
氨基酸序列(SEQ ID NO:7)
MRVPAQLLGLLLLWLPGARCHNVSEGYEKPADPLVVQNLEQWQDLKFGLFMHWGTYSQWGIVESWSLCPEDESWTQRKPEHGKSYNEYVKNYENLQTTFNPVQFNPQKWADATKKAGMKYVVFTTKHHDGFAMFDTKQSDYKITSSKTPFSKNPKADVAKEIFNTFRDNGFRIGAYFSKPDWHSDDYWWSYFPPKDRNVNYDPQKYPARWENFKKFTFNQLNEITSNYGKIDILWLDGGWVRPFHTIDPNIEWQRTIKVEQDIDMDKIGTMARKNQPGIIIVDRTVPGKWENYVTPEQAVPEHALSIPWESCITMGDSFSYVPNDNYKSSQKIIETLIRIISRGGNYLMNIAPGPNGDYDAVVYERLKEISGWMDKNQSAVFTTRALAPYHESDFYYTQSKDGKIVNVFHISEKSNYQAPSELSFSIPENINPKTVKVLGISSQIKWKKKGNKIHVQLPEERTKLNYSTVIQITQHHHHHH
核苷酸序列(SEQ ID NO:8;密碼子經(jīng)優(yōu)化)
atgagagtgcctgctcagctgctgggactgctgctgctgtggctgcctggcgctagatgccacaatgtgtccgagggctacgagaagcccgccgaccctctggtggtgcagaacctggaacagtggcaggacctgaagttcggcctgttcatgcactggggcacctactcccagtggggcatcgtggaatcctggtccctgtgccctgaggacgagtcttggacccagcggaagcctgagcacggcaagtcctacaacgagtacgtgaagaactacgagaacctgcagaccaccttcaaccccgtgcagttcaacccccagaagtgggccgacgccaccaagaaagccggcatgaaatacgtggtgttcaccaccaagcaccacgacggcttcgccatgttcgacaccaagcagtccgactacaagatcacctcctccaagacccccttcagcaagaaccccaaggccgacgtggccaaagagattttcaacaccttccgggacaacggcttccggatcggcgcctacttctccaagcctgactggcactccgacgactactggtggtcctacttcccacccaaggaccggaacgtgaactacgaccctcagaaataccccgccagatgggagaacttcaagaagttcaccttcaatcagctgaacgagatcaccagcaactacggcaagatcgacatcctgtggctggacggcggatgggtgcgacccttccacaccatcgaccccaacatcgagtggcagcggaccatcaaggtggaacaggacatcgacatggacaagatcggcaccatggcccggaagaaccagcccggcatcatcatcgtggaccggaccgtgcctggcaagtgggagaattacgtgacccccgagcaggccgtgcctgagcatgccctgtctatcccttgggagtcctgtatcacaatgggcgacagcttctcctacgtgcccaacgacaactacaagtcctcccagaagatcatcgagacactgatcaggatcatctccagaggcggcaactacctgatgaatatcgcccctggccccaacggcgactacgacgctgtggtgtacgagcggctgaaagaaatctccggctggatggataagaaccagtccgccgtgtttaccacccgggctctggccccttaccacgagtccgacttctactacacccagtccaaggacggaaagatcgtgaacgtgttccacatctccgagaagtccaactaccaggccccctccgagctgtccttcagcatccccgagaacatcaaccccaagaccgtgaaggtgctgggcatctccagccagatcaagtggaagaagaagggcaacaagatccacgtgcagctgcccgaggaacggaccaagctgaactactccaccgtgatccagatcacccagcaccaccaccatcaccactga
細(xì)菌巖藻糖苷酶BF3242
氨基酸序列(SEQ ID NO:9)
MRVPAQLLGLLLLWLPGARCQQKYQPTEANLKARSEFQDNKFGIFLHWGLYAMLATGEWTMTNNNLNYKEYAKLAGGFYPSKFDADKWVAAIKASGAKYICFTTRHHEGFSMFDTKYSDYNIVKATPFKRDVVKELADACAKHGIKLHFYYSHIDWYREDAPQGRTGRRTGRPNPKGDWKSYYQFMNNQLTELLTNYGPIGAIWFDGWWDQDINPDFDWELPEQYALIHRLQPACLVGNNHHQTPFAGEDIQIFERDLPGENTAGLSGQSVSHLPLETCETMNGMWGYKITDQNYKSTKTLIHYLVKAAGKDANLLMNIGPQPDGELPEVAVQRLKEVGEWMSKYGETIYGTRGGLVAPHDWGVTTQKGNKLYVHILNLQDKALFLPIVDKKVKKAVVFADKTPVRFTKNKEGIVLELAKVPTDVDYVVELTIDHHHHHH
核苷酸序列(SEQ ID NO:10;密碼子經(jīng)優(yōu)化)
atgagagtgcctgctcagctgctgggactgctgctgctgtggctgcctggtgctagatgccagcagaagtaccagcccaccgaggccaacctgaaggccagatccgagttccaggacaacaagttcggcatcttcctgcactggggcctgtacgccatgctggctactggcgagtggaccatgaccaacaacaacctgaactacaaagagtacgctaagctggctggcggcttctacccctccaagttcgacgccgacaaatgggtggccgccatcaaggcctctggcgccaagtacatctgcttcaccacccggcaccacgagggcttctccatgttcgacaccaagtactccgactacaacatcgtgaaggccacccccttcaagcgggacgtcgtgaaagagctggccgacgcctgcgctaagcacggcatcaagctgcacttctactactcccacatcgactggtacagagaggacgccccccagggcagaaccggcagaagaacaggcagacccaaccccaagggcgactggaagtcctactaccagtttatgaacaaccagctgaccgagctgctgaccaactacggccccatcggcgccatttggttcgacgggtggtgggaccaggacatcaaccccgacttcgactgggagctgcccgagcagtacgccctgatccacagactgcagcccgcctgtctcgtgggcaacaaccaccaccagaccccctttgccggcgaggacatccagattttcgagcgggatctgcccggcgagaacaccgctggactgtctggccagtccgtgtcccatctgcccctggaaacctgcgagacaatgaacggcatgtggggctacaagatcaccgaccagaactacaagtccaccaagacactgatccactacctcgtgaaagccgctggcaaggacgccaacctgctgatgaacatcggcccccagcctgacggcgagctgcctgaagtggctgtgcagcggctgaaagaagtgggagagtggatgtctaagtacggcgagactatctacggcaccagaggcggcctggtggcccctcatgattggggcgtgaccacccagaagggcaacaagctgtacgtgcacatcctgaacctgcaggacaaggccctgttcctgcccatcgtggacaagaaagtgaagaaagccgtggtgttcgccgacaagacccccgtgcggttcaccaagaacaaagagggcatcgtgctggaactggccaaggtgcccaccgacgtggactacgtggtggaactgaccatcgaccaccatcatcaccaccactga
在一些實(shí)施方式中,巖藻糖苷酶可為相較于上述提供示例性的巖藻糖苷酶(例如SEQ ID NO:1,SEQ ID NO:3,SEQ ID NO:5,SEQ ID NO:7或SEQ ID NO:9,以及本文所述的其它巖藻糖苷酶)任一者,具有至少85%(例如90%、93%、95%、96%、97%、98%或99%)序列相同性的酶(如野生型酶)。
兩種氨基酸序列“百分比相同性”使用Karlin和Altschul Proc.Natl.Acad.Sci.USA 87:2264-68,1990的算法確定,其如Karlin和Altschul Proc.Natl.Acad.Sci.USA 90:5873-77,1993描述修正。這種算法被并入Altschul等人.J.Mol.Biol.215:403-10,1990中的NBLAST和XBLAST程序(版本2.0)。BLAST蛋白質(zhì)搜索可以采用XBLAST程序進(jìn)行,其中評(píng)分(score)為50、字長(zhǎng)(wavelength)為3,以獲取與本發(fā)明蛋白質(zhì)分子同源的氨基酸序列。若二序列間存在間隙(gap),可使用Altschul等人,核酸研究(Nucleic Acids Res).25(17):3389-3402,1997所描述的Gapped BLAST。采用BLAST和Gapped BLAST程序時(shí),可以使用各程序(例如BLAST和Gapped BLAST)默認(rèn)的參數(shù)。
可用于構(gòu)建基因工程宿主動(dòng)物細(xì)胞的哺乳動(dòng)物巖藻糖苷酶包括但不限于:以GenBank登錄號(hào)NP_114409.2,XP_003811598.1,AAH03060.1,EHH53333.1,XP_001127152.1,XP_010360962.1,XP_006084558.1,XP_004263802.1,XP_007171384.1,XP_006075254.1,XP_010982011.1,NP_001004218.1和XP_010964137.1所披露的那些。
可用于構(gòu)建基因工程宿主動(dòng)物細(xì)胞的細(xì)菌糖苷酶包括但不限于:以GenBank登錄號(hào)WP_008769537.1,WP_032568292.1,EYA08300.1,WP_005780841.1,EXY26528.1,WP_044654435.1,WP_029425671.1,WP_022470316.1,CDA84816.1,WP_004307183.1和WP_008025871.1所披露的那些。
B.糖苷內(nèi)切酶
糖苷內(nèi)切酶是指能夠破壞聚糖中2個(gè)單糖間糖苷鍵的酶,從而從糖蛋白或糖酯中釋放寡糖。本發(fā)明披露中使用的糖苷內(nèi)切酶(例如野生型酶)包括糖苷內(nèi)切酶D、糖苷內(nèi)切酶F、糖苷內(nèi)切酶F1、糖苷內(nèi)切酶F2、糖苷內(nèi)切酶H、和糖苷內(nèi)切酶S。各亞屬示例性糖苷內(nèi)切酶列于下表:
其它適用的糖苷內(nèi)切酶包括與本文所述的酶具有至少85%(如90%,95%,98%,或99%)序列相同性的那些。與上述所列的糖苷內(nèi)切酶任一者具有高度序列同源性(如至少85%序列相同性)的酶被預(yù)期具有相同的生物活性。此類酶(例如野生型)可以使用以上所列酶之一從基因數(shù)據(jù)庫(kù)例如GenBank中查詢獲得。
在一些實(shí)施方式中,本文所述的糖苷內(nèi)切酶為Endo S酶。Endo S是一種糖苷內(nèi)切酶,其特異性切割連接至原始IgG分子Fc結(jié)構(gòu)域Asn糖基化位點(diǎn)的第一個(gè)GlcNAc殘基的N-聚糖,產(chǎn)生單糖基化的IgG分子,即具有單一GlcNAc連接至Asn糖基化位點(diǎn)的IgG分子。氨基酸序列以及編碼核苷酸序列提供如下:
Endo S氨基酸序列(SEQ ID NO:11)
MRVPAQLLGLLLLWLPGARCAQHDSLIRVKAEDKVVQTSPSVSAIDDLHYLSENSKKEFKEGLSKAGEVPEKLKDILSKAQQADKQAKVLAEMKVPEKIAMKPLKGPLYGGYFRTWHDKTSDPAEKDKVNSMGELPKEVDLAFVFHDWTKDYSLFWQELATKHVPTLNKQGTRVIRTIPWRFLAGGDHSGIAEDTQKYPNTPEGNKALAKAIVDEYVYKYNLDGLDVDIERDSIPKVNGKESNENIQRSIAVFEEIGKLIGPKGADKSRLFIMDSTYMADKNPLIERGAPYIDLLLVQVYGIQGEKGDWDPVARKPEKTMEERWESYSKYIRPEQYMVGFSFYEENAGSGNLWYDINERKDDHNPLNSEIAGTRAERYAKWQPKTGGVKGGIFSYAIDRDGVAHQPKKVSDDEKRTNKAIKDITDGIVKSDYKVSKALKKVMENDKSYELIDQKDFPDKALREAVIAQVGSRRGDLERFNGTLRLDNPDIKSLEGLNKLKKLAKLELIGLSQITKLDSSVLPENIKPTKDTLVSVLETYKNDDRKEEAKAIPQVALTISGLTGLKELNLAGFDRDSLAGIDAASLTSLEKVDLSKNKLDLAAGTENRQIFDVMLSTVSNRVGSNEQTVTFDHQKPTGHYPNTYGTTSLRLPVGEGKIDLQSQLLFGTVTNQGTLINSEADYKAYQEQLIAGRRFVDPGYAYKNFAVTYDAYKVRVTDSTLGVTDEKKLSTSKEETYKVEFFSPTNGTKPVHEAKVVVGAEKTMMVNLAAGATVIKSDSHENAKKVFDGAIEYNPLSFSSKTSITFEFKEPGLVKYWRFFNDITRKDDYIKEAKLEAFVGHLEDDSKVKDSLEKSTEWVTVSDYSGEAQEFSQPLDNISAKYWRVTVDTKGGRYSSPSLPELQILGYRLPLTHDYKDDDDK
Endo S核苷酸序列(SEQ ID NO:12;密碼子經(jīng)優(yōu)化)
atgagagtgcctgctcagctgctgggcctgctgctgctgtggctgcctggtgctagatgcgcccagcacgactccctgatcagagtgaaggccgaggacaaggtggtgcagacctccccttccgtgtccgccatcgacgacctgcactacctgtccgagaactccaagaaagagttcaaagagggcctgtccaaggccggcgaggtgcccgaaaagctgaaggacatcctgagcaaggctcagcaggccgacaagcaggccaaggtgctggccgagatgaaggtgccagagaagatcgccatgaagcccctgaagggccctctgtacggcggctacttcagaacctggcacgacaagacctccgaccccgccgagaaggacaaagtgaactccatgggcgagctgcccaaagaggtggacctggccttcgtgttccacgactggaccaaggactactccctgttctggcaggaactggccaccaagcacgtgcccaccctgaacaagcagggcaccagagtgatccggacaatcccctggcggtttctggctggcggcgaccactctggaatcgccgaggatacccagaagtaccccaacacccccgagggcaacaaggccctggctaaggccatcgtggacgagtacgtgtacaagtacaacctggacggcctggacgtggacatcgagcgggactccatccctaaagtgaacggcaaagagtccaacgagaacatccagcggtctatcgccgtgttcgaggaaatcggcaagctgatcggccccaagggcgccgacaagtcccggctgttcatcatggactccacctacatggccgataagaaccccctgatcgagagaggcgccccttacatcgatctgctgctggtgcaggtgtacggcatccagggcgagaagggcgattgggaccctgtggcccggaagcctgaaaagaccatggaagagagatgggagtcctactccaagtacatccggcccgagcagtatatggtgggattcagcttctacgaggaaaacgccggctccggcaacctgtggtacgacatcaacgagcggaaggacgaccacaaccctctgaactccgagatcgccggcacccgggctgagagatacgctaagtggcagcccaagaccggcggagtgaagggcggcatcttctcctacgccatcgatagggatggcgtggcccaccagcctaagaaggtgtccgacgacgagaagcggaccaacaaggctatcaaggacatcaccgacggcatcgtgaagtccgactacaaggtgtccaaagccctgaagaaagtgatggaaaacgacaagagctacgagctgatcgaccagaaggacttccccgataaggccctgcgcgaggccgtgattgctcaagtgggctccagacggggcgacctggaaagattcaacggcaccctgcggctggacaaccccgacatcaagtccctggaaggcctgaacaaactgaagaagctggccaagctggaactgatcggactgtcccagatcacaaagctggactcctccgtgctgcctgagaacatcaagcccaccaaggacaccctggtgtccgtgctggaaacctacaagaacgacgaccggaaagaggaagccaaggccatccctcaggtggccctgaccatctctggcctgaccggcctgaaagagctgaatctggccggcttcgaccgggattccctggctggaatcgatgccgcctctctgacctccctggaaaaagtggacctgtctaagaacaagctggatctggctgccggcaccgagaaccggcagatcttcgacgtgatgctgtccaccgtgtccaacagagtgggcagcaacgagcagaccgtgaccttcgaccaccagaagcccaccggccactaccctaacacctacggcaccacctccctgagactgcctgtgggcgagggcaagatcgacctgcagtcccagctgctgttcggcaccgtgaccaaccagggcacactgatcaactccgaggccgattacaaggcctaccaggaacagctgatcgctgggcggagattcgtggaccctggctacgcttacaagaacttcgccgtgacctacgatgcctacaaagtgcgcgtgaccgactccaccctgggcgtgacagacgaaaagaagctgagcacctccaaagaagagacatacaaggtggaattcttctcccccaccaatggcaccaagcctgtgcatgaggctaaggtggtcgtgggcgccgagaaaaccatgatggtcaacctggccgctggcgccaccgtgatcaagtctgactctcacgagaatgccaaaaaggtgttcgacggcgccatcgagtacaatcctctgagcttctccagcaagaccagcatcaccttcgagtttaaagaacccggcctcgtgaaatactggcggttcttcaacgatatcacccgcaaggacgactacatcaaagaggctaagctggaagccttcgtgggccatctggaagatgactccaaagtgaaggactctctggaaaagtccaccgagtgggtcaccgtgtctgactactctggcgaggcccaggaattctcccagcccctggacaacatctccgccaagtattggagagtgaccgtggacaccaagggcggacggtacagctctcctagcctgcccgagctgcagatcctgggctacagactgcctctgacccacgactataaggacgacgacgacaaatga
一些實(shí)施方式中,本文所述Endo S酶可以是相較于SEQ ID NO:11具有至少85%(例如90%,93%,95%,96%,97%,98%,或99%)序列相同性的酶(例如野生型酶)。實(shí)例包括但不限于:GenBank登錄號(hào).EQB24254.1,WP_037584019.1,WP_012679043.1,和ADC53484.1所描述的那些。
C.基因工程宿主動(dòng)物細(xì)胞
本文所述宿主動(dòng)物細(xì)胞是基因工程化的,以過量表達(dá)一種或多種具有特異性聚糖-修飾活性的酶(例如糖苷酶或乙二醇轉(zhuǎn)移酶)?;蚬こ趟拗鲃?dòng)物細(xì)胞是指攜帶外源(非原生)基因材料的動(dòng)物細(xì)胞,這些外源基因編碼本文所述的一種或多種巖藻糖苷酶和糖苷內(nèi)切酶。過量表達(dá)酶的宿主細(xì)胞是指基因工程宿主細(xì)胞,所述基因工程宿主細(xì)胞表達(dá)酶的水平相較于該宿主細(xì)胞野生型對(duì)應(yīng)物表達(dá)酶的水平高(例如20%,50%,80%,100%,2-倍,5-倍,10-倍,50-倍,100-倍,1,000-倍,104-倍,或105-倍),該宿主細(xì)胞野生型對(duì)應(yīng)物即為未含有與基因工程宿主細(xì)胞相同基因改性的相同類型的細(xì)胞。在一些實(shí)施方式中,編碼本文所述外源酶的基因可被導(dǎo)入到適用的母體動(dòng)物細(xì)胞,以生產(chǎn)本文所述基因工程宿主動(dòng)物細(xì)胞。外源酶是指不存在于用以制造工程化宿主動(dòng)物細(xì)胞的母體細(xì)胞內(nèi)的酶。
本文所述基因工程宿主動(dòng)物細(xì)胞,相較于野生型對(duì)應(yīng)物,能夠生產(chǎn)具有改性糖基化的糖蛋白,其可以通過常規(guī)重組技術(shù)制備。一些情況下,可將強(qiáng)的啟動(dòng)子插入內(nèi)源巖藻糖苷酶和/或糖苷內(nèi)切酶基因上游,以增強(qiáng)其表達(dá)。其它情況下,可將編碼一種或多種巖藻糖苷酶和/或糖苷內(nèi)切酶的基因材料導(dǎo)入母體宿主細(xì)胞,以生產(chǎn)本文所述的基因工程宿主動(dòng)物細(xì)胞。
利用本領(lǐng)域熟知的方法,可將編碼巖藻糖苷酶或糖苷內(nèi)切酶的基因插入適用的表達(dá)載體(例如病毒載體或非病毒載體)。Sambrook等人.,分子克隆(Molecular Cloning),實(shí)驗(yàn)手冊(cè)(A Laboratory Mannual),第三版.,冷泉港實(shí)驗(yàn)室出版社。舉例而言,基因與載體在合適的條件下可互相接觸,用限制性內(nèi)切酶在各分子上創(chuàng)造互補(bǔ)端,可互補(bǔ)配對(duì),且以連接酶相連接?;蛘?,可將合成的核酸連接物連接至基因末端。這些合成的連接物含有相對(duì)應(yīng)于載體上特定限制位點(diǎn)的核酸序列。在一些實(shí)施方式中,巖藻糖苷酶或糖苷內(nèi)切酶的基因包含于表達(dá)盒內(nèi),其包含一種或多種下述元件:科扎克序列以及信號(hào)肽序列,其位于酶的N末端,以及蛋白質(zhì)標(biāo)簽(例如FLAG,His標(biāo)簽,包括幾丁質(zhì)結(jié)合蛋白(CBP)、麥芽糖結(jié)合蛋白(MBP)、和谷胱甘肽-S-轉(zhuǎn)移酶(GST))。所述蛋白標(biāo)簽可以位于酶的N末端或者C末端。見例如圖2中B。
此外,所述表達(dá)載體可包含,例如,下述部分或全部:選擇性標(biāo)記基因,例如新霉素基因,用于哺乳動(dòng)物細(xì)胞篩選穩(wěn)定或瞬時(shí)轉(zhuǎn)染子;源于人類CMV立即早期基因的增強(qiáng)子/啟動(dòng)子序列,用于高水平轉(zhuǎn)錄;用于mRNA穩(wěn)定性,源自SV40的轉(zhuǎn)錄終止和RNA加工信號(hào)(基因);SV40多瘤復(fù)制起始(序列)和ColE1,用于適度游離型復(fù)制;通用多克隆位點(diǎn);以及T7與SP6RNA啟動(dòng)子,用于體外轉(zhuǎn)錄正義與反義RNA。合適的載體以及用于生產(chǎn)含轉(zhuǎn)基因載體的方法為本領(lǐng)域公知且可及。Sambrook等人.,分子克隆,實(shí)驗(yàn)手冊(cè),第三版.,冷泉港實(shí)驗(yàn)室出版社。
若需用二種或二種以上酶構(gòu)建本文所述的宿主動(dòng)物細(xì)胞,例如二種或多種巖藻糖苷酶,二種或多種糖苷內(nèi)切酶,或巖藻糖苷酶與糖苷內(nèi)切酶組合,則編碼二種或多種酶的基因可插入各自表達(dá)載體或插入到經(jīng)設(shè)計(jì)以表達(dá)多種蛋白的共用表達(dá)載體。
用于生產(chǎn)巖藻糖苷酶和/或糖苷內(nèi)切酶表達(dá)載體可導(dǎo)入適用的母體宿主細(xì)胞,包括但不限于:鼠骨髓瘤細(xì)胞(如NSO細(xì)胞)、中國(guó)倉(cāng)鼠卵巢(CHO)細(xì)胞、人類胚胎腎細(xì)胞(如HEK293)、和人類視網(wǎng)膜母細(xì)胞瘤細(xì)胞(如PER.C6)。選擇適用的宿主細(xì)胞系,其屬于本領(lǐng)域技術(shù)人員公知范疇,將取決于高產(chǎn)率需求及生產(chǎn)具有所需性質(zhì)產(chǎn)物的需求之間的平衡。
一些情況下,表達(dá)載體可經(jīng)設(shè)計(jì),以致它們能通過本領(lǐng)域公知方法,以同源或非同源重組并入細(xì)胞基因中。用于將表達(dá)載體轉(zhuǎn)移至母宿主細(xì)胞的方法包括但不限于:病毒介導(dǎo)的基因轉(zhuǎn)移、脂質(zhì)體介導(dǎo)的轉(zhuǎn)移、轉(zhuǎn)化、轉(zhuǎn)染和轉(zhuǎn)導(dǎo),例如,病毒介導(dǎo)的基因轉(zhuǎn)移,如使用基于DNA病毒如腺病毒、腺相關(guān)病毒以及皰疹病毒的載體,以及基于反轉(zhuǎn)錄病毒的載體。基因轉(zhuǎn)移模式的示例包括,如:裸DNA、CaPO4沉淀、DEAE葡聚糖、電穿孔、原生質(zhì)體融合、脂轉(zhuǎn)染、細(xì)胞顯微注射以及病毒載體、佐劑輔助型DNA、基因槍、導(dǎo)管。在一個(gè)示例當(dāng)中,使用病毒載體。為了增加非病毒載體至細(xì)胞的輸送,核酸或蛋白可偶聯(lián)結(jié)合至抗體或它的結(jié)合片段,其結(jié)合細(xì)胞表面抗原。也包含靶標(biāo)抗體或其片段的脂質(zhì)體可用于本文所述的方法。
本文所述“病毒載體”是指以重組方式生產(chǎn)的病毒或病毒顆粒,其包含待輸送至宿主細(xì)胞的多核苷酸,不論體內(nèi)、離體或體外。病毒載體的示例包括:反轉(zhuǎn)錄病毒載體例如慢病毒載體、腺病毒載體、腺相關(guān)病毒載體及其類似物。對(duì)于反轉(zhuǎn)錄病毒載體介導(dǎo)的基因轉(zhuǎn)移方面,載體構(gòu)建物是指包含反轉(zhuǎn)錄病毒基因組或其部分的多核苷酸,以及治療性基因。
基因工程宿主動(dòng)物細(xì)胞可包含使用為所導(dǎo)入的一種或多用基因組成型表達(dá)或誘導(dǎo)型表達(dá)所創(chuàng)建的表達(dá)盒。此表達(dá)盒可包括調(diào)節(jié)元件,如啟動(dòng)子、起始密碼子、終止密碼子及多腺苷化信號(hào)。所述元件能夠可操作連接至編碼感興趣的表面蛋白的基因,如此,該基因在宿主細(xì)胞中可操作(例如表達(dá))。
許多啟動(dòng)子可用于表達(dá)巖藻糖苷酶和/或糖苷內(nèi)切酶(以及本文所述的任何外源糖蛋白)。可用于表達(dá)蛋白的啟動(dòng)子為本領(lǐng)域公知,包括但不限于:巨細(xì)胞病毒(CMV)中間早期啟動(dòng)子、病毒LTR如勞斯肉瘤(Rous sarcoma)病毒LTR、HIV-LTR、HTLV-1LTR、猿猴(simian)病毒40(SV40)早期啟動(dòng)子、大腸桿菌lac UV5啟動(dòng)子及單純皰疹tk病毒啟動(dòng)子。
也可以使用調(diào)節(jié)性啟動(dòng)子。此類調(diào)節(jié)性啟動(dòng)子包括使用四環(huán)素抑制子(tetR)[Gossen,M.和Bujard,H.,Proc.Natl.Acad.Sci.USA 89:5547-5551(1992);Yao,F.等人,人類基因治療,9:1939-1950(1998);Shockelt,P.等人,Proc.Natl.Acad.Sci.USA,92:6522-6526(1995)]的那些。其它系統(tǒng)包括FK506二聚體,使用雌二醇的VP16或p65,RU486,二酚米樂甾酮(diphenol murislerone)或雷帕霉素(rapamycin)。誘導(dǎo)性系統(tǒng)可向英杰(Invitrogen)、克隆泰克(Clontech)和阿瑞雅德(Ariad)購(gòu)買。
一些誘導(dǎo)性啟動(dòng)子的效果可隨時(shí)間增加。在這種情況下,通過串聯(lián)方式插入多重抑制子,如通過內(nèi)部核糖體插入位點(diǎn)(internal ribosome entry site;IRES)將TetR連接至TetR,可以增加此類系統(tǒng)的效果?;蛘?,可以于篩選所需功能前,等待至少3天。雖然可能發(fā)生某些沉默現(xiàn)象,可以通過使用適當(dāng)數(shù)目細(xì)胞而最小化該現(xiàn)象,所述細(xì)胞數(shù)目較佳為至少1x104、更佳為至少1x105、又更佳為至少1x106、甚至更佳為至少1x107??梢酝ㄟ^公知方法增強(qiáng)所需蛋白表達(dá),以增進(jìn)所述系統(tǒng)的功效。例如,使用土拔鼠肝炎病毒轉(zhuǎn)錄后調(diào)控元件(Woodchuck Hepatitis Virus Posttranscriptional Regulatory Element;WPRE)。參見Loeb,V.E.,等人,人類基因治療(Human Gene Therapy)10:2295-2305(1999);Zufferey,R.,等人,J.of Virol.73:2886-2892(1999);Donello,J.E.,等人,J.of Virol.72:5085-5092(1998).
用于實(shí)施本文所述方法的聚腺苷酸化信號(hào)的示例包括但不限于:人類I型膠原蛋白聚腺苷酸化信號(hào)、人類II型膠原蛋白聚腺苷酸化信號(hào)及SV40聚腺苷酸化信號(hào)。
包括可操作地連接至調(diào)控元件的巖藻糖苷酶基因和/或糖苷內(nèi)切酶基因(以及本文所述的糖蛋白基因)的外源基因材料,可維持存在于細(xì)胞以作為功能胞質(zhì)分子、功能游離分子或其可整合入細(xì)胞的染色體DNA。外源基因材料可導(dǎo)入細(xì)胞,其以質(zhì)粒形式維持為獨(dú)立基因材料?;蛘撸烧先肴旧w的線型DNA,可被導(dǎo)入細(xì)胞。當(dāng)導(dǎo)入DNA至細(xì)胞內(nèi)時(shí),可加入促進(jìn)DNA整合入染色體的試劑。適用于促進(jìn)整合的DNA序列也可包括于DNA分子中?;蛘撸蓪NA導(dǎo)入細(xì)胞。
可使用選擇性標(biāo)記監(jiān)控本文所述宿主動(dòng)物細(xì)胞對(duì)所需轉(zhuǎn)入基因的攝取。這些標(biāo)記基因可以受控于任何啟動(dòng)子或誘導(dǎo)性啟動(dòng)子。這些是本領(lǐng)域公知的,且包括改變細(xì)胞對(duì)刺激,如營(yíng)養(yǎng)物、抗生素等的敏感性的基因。基因包括那些neo、puro、tk、多重抗藥性(multiple drug resistance;MDR)等基因。其它基因表達(dá)可易于進(jìn)行篩選的蛋白,如綠色熒光蛋白(GFP)、藍(lán)色熒光蛋白(BFP)、熒光素酶及LacZ。
D.生產(chǎn)具有改性糖基化的糖蛋白
基因工程宿主動(dòng)物細(xì)胞可用于生產(chǎn)具有改性糖基化模式的糖蛋白(如內(nèi)源性或外源性)。一些具體實(shí)施方式中,用于生產(chǎn)上述基因工程宿主動(dòng)物細(xì)胞的母宿主細(xì)胞已攜帶編碼外源性糖蛋白的一或多個(gè)基因。在其它實(shí)施方式中,編碼感興趣糖蛋白的一個(gè)或多個(gè)基因可被導(dǎo)入基因工程宿主動(dòng)物細(xì)胞,所述宿主動(dòng)物細(xì)胞可由本領(lǐng)域公知或本文所述方法表達(dá)一種或多種巖藻糖苷酶和/或糖苷內(nèi)切酶。
能產(chǎn)生感興趣糖蛋白和一種或多種巖藻糖苷酶和/或糖苷內(nèi)切酶兩者的基因工程宿主動(dòng)物細(xì)胞可于容許蛋白質(zhì)表達(dá)的適當(dāng)條件下培養(yǎng)??墒占?xì)胞和/或培養(yǎng)基,可使用常規(guī)技術(shù)自細(xì)胞和/或培養(yǎng)基分離和純化感興趣的糖蛋白??梢猿R?guī)技術(shù)如LC/MS/MS測(cè)定所生產(chǎn)的糖蛋白的糖基化模式,以確認(rèn)糖基化改性。
一些實(shí)施例中,感興趣的糖蛋白為抗體。示例性抗體包括但不限于:阿昔單抗(糖蛋白IIb/IIIa;心血管疾病),阿達(dá)木單抗(TNF-α;各種自身免疫性疾病,如類風(fēng)濕性關(guān)節(jié)炎),阿侖單抗(CD52;慢性淋巴細(xì)胞白血病),巴利昔單抗(IL-2Rα受體(CD25);移植排斥),貝伐單抗(血管內(nèi)皮生長(zhǎng)因子A;各種癌癥,如結(jié)腸直腸癌、非小細(xì)胞肺癌、膠質(zhì)母細(xì)胞瘤、腎癌、濕性年齡相關(guān)性黃斑退化),卡妥索單抗,西妥昔單抗(EGF受體,各種癌癥,如結(jié)腸直腸癌、頭頸癌),賽妥珠單抗(如聚乙二醇結(jié)合賽妥珠單抗certolizumab pegol)(TNFα;克隆氏癥,類風(fēng)濕性關(guān)節(jié)炎),達(dá)利珠單抗(IL-2Rα受體(CD25);移植排斥),依庫(kù)珠單抗(補(bǔ)體蛋白C5;陣發(fā)性睡眠性血紅蛋白尿癥(paroxysmal nocturnal hemoglobinuria)),依法利珠單抗(CD11a;牛皮癬),吉妥珠單抗(CD33;急性骨髓性白血病(如伴隨卡里奇霉素(calicheamicin))),替伊莫單抗(ibritumomab tiuxetan)(CD20;非霍奇金淋巴瘤(如伴釔-90或銦-111)),英夫利昔單抗(TNF-α;各種自身免疫性疾病,如類風(fēng)濕性關(guān)節(jié)炎),莫羅單抗-CD3(T細(xì)胞CD3受體;移植排斥),那他珠單抗(α-4(α4)整合素;多發(fā)性硬化癥,克隆氏癥),奧馬珠單抗(IgE;過敏性哮喘),帕利珠單抗(RSV F蛋白表位;呼吸道融合病毒感染),帕尼單抗(EGF受體;癌癥,如結(jié)腸直腸癌),蘭尼單抗(血管內(nèi)皮生長(zhǎng)因子A;濕性年齡相關(guān)性黃斑退化),利妥昔單抗(CD20;非霍奇金淋巴瘤),托西莫單抗(CD20,非霍奇金淋巴瘤),曲妥珠單抗(ErbB2;乳腺癌)。
一些實(shí)施例中,感興趣糖蛋白是細(xì)胞因子。示例包括但不限于:干擾素(如IFN-α、INF-β或INF-γ)、白介素(如IL-2,IL-3,IL-4,IL-5,IL-6,IL-7,IL-12),及集落刺激因子(如G-CSF、GM-CSF、M-CSF)。IFN可為如干擾素α2a或干擾素α2b。參見如Mott HR和Campbell ID.“四螺旋束生長(zhǎng)因子及其受體:蛋白-蛋白相互作用(Four-helix bundle growth factors and their receptors:protein-protein interactions).”Curr Opin Struct Biol.1995 2月;5(1):114-21;Chaiken IM,Williams WV.“識(shí)別四螺旋束細(xì)胞因子結(jié)構(gòu)-功能關(guān)系:重新模擬設(shè)計(jì)(Identifying structure-function relationships in four-helix bundle cytokines:towards de novo mimetics design).”Trends Biotechnol.1996,10月;14(10):369-75;Klaus W等人,“由異核核磁共振光譜學(xué)測(cè)定溶解狀態(tài)中人類干擾素α-2a的三維高分辨率結(jié)構(gòu)(The three-dimensional high resolution structure of human interferon alpha-2a determined by heteronuclear NMR spectroscopy in solution)”.J.Mol Biol.,274(4):661-75,1997,對(duì)于某些此類細(xì)胞因子的進(jìn)一步討論。
感興趣的蛋白質(zhì)可為細(xì)胞因子蛋白,其具有與一種或多種前述細(xì)胞因子類似的結(jié)構(gòu)。例如,所述細(xì)胞因子可為IL-6類細(xì)胞因子例如白血病抑制因子(leukemia inhibitory factor;LIF)或抑瘤素M(oncostatin M)。一些實(shí)施方式中,所述細(xì)胞因子自然結(jié)合至包含GP130信號(hào)轉(zhuǎn)導(dǎo)亞基的受體。其它感興趣的四螺旋束蛋白包括生長(zhǎng)激素(growth hormone;GH)、泌乳素(prolactin;PRL)及胎盤生乳素(placental lactogen)。一些具體實(shí)施方式中,靶標(biāo)蛋白為紅血球生成刺激劑,如(EPO),其也為四螺旋束細(xì)胞因子。一些具體實(shí)施方式中,紅血球生成刺激劑為EPO變體,如阿法達(dá)貝泊汀,也稱作新型紅血球生成刺激蛋白(novel erythropoiesis stimulating protein;NESP),其被工程化以含有5條N-連接的糖鏈(比重組HuEPO多2條)。一些具體實(shí)施方式中,所述蛋白包含5個(gè)螺旋。例如,所述蛋白可為干擾素β,如干擾素β-1a或β-1b,其(如所理解)通常歸類為四螺旋束細(xì)胞因子。一些具體實(shí)施方式中,靶標(biāo)蛋白為IL-9,IL-10,IL-11,IL-13,或IL-15。參見如Hunter,CA,免疫學(xué)自然綜述(Nature Reviews Immunology)5,521-531,2005,對(duì)特定細(xì)胞因子的探討。還參見Paul,WE(編輯),基礎(chǔ)免疫學(xué)(Fundamental Immunology),Lippincott Williams和Wilkins;第6版,2008。
此外,感興趣的蛋白可以為用于治療人體疾病或失調(diào),經(jīng)美國(guó)食品藥品管理局(或?qū)Φ确ㄒ?guī)機(jī)構(gòu)如歐洲藥物評(píng)審局)批準(zhǔn)的蛋白。此類蛋白可為或可不為這樣的蛋白,聚乙二醇化(PEGylated)形式已經(jīng)經(jīng)臨床試驗(yàn)和/或上市許可。在某些情況下,所述感興趣蛋白為Fc融合蛋白,包括但不限于:阿巴西普、依那西普、IL-2-Fc融合蛋白、CD80-Fc融合蛋白,及PDL1-1-Fc融合蛋白。
進(jìn)一步地,感興趣的蛋白可為神經(jīng)營(yíng)養(yǎng)因子,即促進(jìn)神經(jīng)譜系細(xì)胞(該詞如本文所使用包括神經(jīng)祖細(xì)胞、神經(jīng)元、及神經(jīng)膠質(zhì)細(xì)胞,如星狀膠質(zhì)細(xì)胞、少突膠質(zhì)細(xì)胞、小膠質(zhì)細(xì)胞)存活、發(fā)育和/或功能的因子。例如,一些具體實(shí)施方式中,靶標(biāo)蛋白為促進(jìn)軸突生長(zhǎng)的因子。一些具體實(shí)施方式中,所述蛋白為睫狀神經(jīng)營(yíng)養(yǎng)因子(ciliary neurotrophic factor;CNTF,一種四螺旋束蛋白)或其類似物如阿索開(Axokine),其為人類睫狀神經(jīng)營(yíng)養(yǎng)因子的修飾版本,其具有C端15個(gè)氨基酸截?cái)嘁约?個(gè)氨基酸取代,其體外與體內(nèi)試驗(yàn)的功效為CNTF的3至5倍,且具有改進(jìn)的穩(wěn)定性。
或者,感興趣的蛋白可為酶,如代謝或其他生理過程中重要的酶。如本領(lǐng)域所知,酶或者其他蛋白質(zhì)的缺乏可造成多種疾病。此類疾病包括與糖類代謝、氨基酸代謝、有機(jī)酸代謝、卟啉(porphyrin)代謝、嘌呤或嘧啶代謝、溶酶體儲(chǔ)存失調(diào)、血液凝聚等缺陷相關(guān)的疾病。實(shí)例包括法布瑞癥(Fabry disease)、高歇氏癥(Gaucher disease)、龐貝氏癥(Pompe disease)、腺苷脫胺酶缺失、天冬酰胺酶缺失、卟啉病(porphyria)、血友病以及遺傳性血管水腫。一些具體實(shí)施方式中,蛋白為凝血或凝聚因子(如因子VII、VIIa、VIII或IX)。其他具體實(shí)施方式中,蛋白是一種酶,其在糖類代謝、氨基酸代謝、有機(jī)酸代謝、卟啉代謝、嘌呤或嘧啶代謝、和/或溶酶體儲(chǔ)存中發(fā)揮作用,其中外源性注射所述酶可至少部分緩解疾病。
此外,感興趣的蛋白可為激素,如胰島素、生長(zhǎng)激素、黃體生成素、卵泡刺激素、及促甲狀腺激素。感興趣的蛋白可為生長(zhǎng)因子,包括但不限于:腎上腺髓質(zhì)素(adrenomedullin;AM)、血管生成素(angiopoietin;Ang)、自分泌運(yùn)動(dòng)因子(autocrine motility factor)、骨形成蛋白(bone morphogenetic proteins;BMPS)、腦源神經(jīng)營(yíng)養(yǎng)因子(brain-derived neurotrophic factor;BDNF)、表皮生長(zhǎng)因子(EGF)、紅血球生成素(EPO)、成纖維細(xì)胞生長(zhǎng)因子(fibroblast growth factor;FGF)、膠質(zhì)源性神經(jīng)營(yíng)養(yǎng)因子(glial cell line-derived neurotrophic factor;GDNF)、粒細(xì)胞集落刺激因子(granulocyte colony-stimulating factor;G-CSF)、粒細(xì)胞巨噬細(xì)胞集落刺激因子(granulocyte macrophage colony-stimulating factor;GM-CSF)、生長(zhǎng)分化因子-9(growth differentiation factor-9;GDF9)、愈合因子、肝細(xì)胞生長(zhǎng)因子(HGF)、肝癌源性生長(zhǎng)因子(hepatoma-derived growth factor;HDGF)、類胰島素生長(zhǎng)因子(IGF)、角質(zhì)細(xì)胞生長(zhǎng)因子(keratinocyte growth factor;KGF)、促移行因子(migration-stimulating factor;MSF)、肌肉生長(zhǎng)抑制素(myostatin;GDF-8)、神經(jīng)生長(zhǎng)因子(NGF)與其他神經(jīng)營(yíng)養(yǎng)因子、血小板衍生生長(zhǎng)因子(PDGF)、血小板生成素(TPO)、轉(zhuǎn)化生長(zhǎng)因子α(TGF-α)、轉(zhuǎn)化生長(zhǎng)因子β(TGF-β)、腫瘤壞死因子α(TNF-α)、血管內(nèi)皮生長(zhǎng)因子(VEGF)及胎盤生長(zhǎng)因子(PGF)。
據(jù)信無需進(jìn)一步詳盡說明,本領(lǐng)域普通技術(shù)人員可根據(jù)前面描述,最大程度利用本發(fā)明。因此,下列具體實(shí)施方式應(yīng)理解為僅具有說明性,而非以任何方式局限本發(fā)明。通過引用并入本文引用的所有出版物用于本文引用的主題或目的。
實(shí)施例
方法
i.構(gòu)建用于生產(chǎn)巖藻糖苷酶或糖苷內(nèi)切酶的表達(dá)載體
為了構(gòu)建巖藻糖苷酶或糖苷內(nèi)切酶的表達(dá)載體,以常規(guī)技術(shù)分離巖藻糖苷酶或糖苷內(nèi)切酶基因,并基于倉(cāng)鼠細(xì)胞的密碼子使用,進(jìn)行密碼子優(yōu)化。合成基因由GeneArt Corp制備,并在限制酶位點(diǎn)Bgl II/EcoR I插入到pcDNA3.1B(-)Myc-His載體(英杰,US)(圖2中A)。α巖藻糖苷酶的表達(dá)盒,從5’到3’,包含科扎克序列、Igk前導(dǎo)序列、巖藻糖苷酶編碼序列及His-標(biāo)簽編碼序列。圖2中B。糖苷內(nèi)切酶表達(dá)盒,從5’到3’,包含科扎克序列、Igk前導(dǎo)序列、糖苷內(nèi)切酶編碼序列、及His-標(biāo)簽編碼序列。圖2中B。
ii.制備去巖藻糖基化抗體
抗體產(chǎn)生細(xì)胞株以0.3~3.0×106存活細(xì)胞/毫升的濃度維持于完全培養(yǎng)基,以8mM L-谷氨酰胺與抗結(jié)塊劑1:100稀釋補(bǔ)充的CD FortiCHOTM培養(yǎng)基(生命科技(Life Technologies),美國(guó))。將細(xì)胞維持在含8%CO2培養(yǎng)箱中的振蕩臺(tái),其設(shè)定為130-150rpm。
為了制備去巖藻糖基化抗體,利用FreeStyleMAX試劑(生命科技,美國(guó))根據(jù)制造商的方案,以編碼前述α-巖藻糖苷酶的表達(dá)載體轉(zhuǎn)染前述抗體產(chǎn)生細(xì)胞。將轉(zhuǎn)染細(xì)胞培養(yǎng)于含有4g/L葡萄糖的培養(yǎng)基,且每隔一天置換培養(yǎng)基。當(dāng)細(xì)胞存活率降至70%時(shí),收取細(xì)胞。收集澄清的培養(yǎng)上清液,并以蛋白A層析法純化。
iii.分析抗體糖基化
根據(jù)本文所述方法制備的重組抗體是經(jīng)過還原、烷基化、及于37℃25mM碳酸氫銨緩沖液(pH~8)存在下以胰酶隔夜分解。將PNG酶F溶液(3μL,羅氏(Roche))加至200μL經(jīng)分解的樣本,且將混合物于37℃下再孵育16小時(shí)。使用C18柱(沃特斯(Waters))從肽中分離釋放出的聚糖。以乙腈洗脫Sep-Pak C18,之后以水清洗。將PNG酶F分解樣本填入該柱,并以1%乙醇沖洗釋放的聚糖,同時(shí)肽仍結(jié)合于Sep-Pak C18。釋放的蛋白質(zhì)寡糖首先用多孔性石墨碳管柱(PhyNexus)純化,之后經(jīng)全甲基化(permethylated)。所有質(zhì)譜實(shí)驗(yàn)以O(shè)rbitrap Fusion Tribrid質(zhì)譜儀進(jìn)行,其經(jīng)由直接輸注至納米電噴霧源(nano-electrospray source)內(nèi)。
結(jié)果
1.生產(chǎn)具有單糖(GlcNAc)糖型的抗體h4B12、利妥昔單抗和奧馬珠單抗
通過巖藻糖苷酶切割反應(yīng),可由前述二糖變體制備單糖的糖變體。由許多可得酶的檢索及通過LC/MS/MS的乙二醇-肽分析顯示,經(jīng)過優(yōu)化切割反應(yīng)條件,使用α-1,6巖藻糖苷酶可達(dá)到有效去巖藻糖苷化,且較高的切割效率與較低的NF/N比例有關(guān)?;蛘?,可依次結(jié)合兩種反應(yīng)酶以獲得單糖的糖變體,所述兩種酶包括糖苷內(nèi)切酶(Endo S)和α-1,6-巖藻糖苷酶。所得的單-GlcNAc糖變體示于圖3中A。
結(jié)果顯示,Endo S移除大于90%本文所述的工程CHO細(xì)胞中產(chǎn)生的h4B12、利妥昔單抗和奧馬珠單抗的重鏈的N-連接聚糖。關(guān)于表達(dá)各巖藻糖苷酶的CHO細(xì)胞中產(chǎn)生的h4B12抗體,五種不同類型的巖藻糖苷酶FUCA1、FUCA2、灰倉(cāng)鼠巖藻糖苷酶α-L-1、腦膜膿毒性金黃桿菌α-1,6-巖藻糖苷酶以及BF3242的去巖藻糖基化能力分別為5.8%,9.1%,17.7%,11.5和68%。圖3中B。酶表達(dá)以Western blot法檢測(cè),如圖3中C所示。
2-去氧-2-氟L-巖藻糖是經(jīng)氟化的巖藻糖類似物。其可以在宿主細(xì)胞內(nèi)代謝,以產(chǎn)生基于底物的巖藻糖基轉(zhuǎn)移酶抑制劑。當(dāng)培養(yǎng)瞬時(shí)表達(dá)巖藻糖苷酶BF3242和Endo S的產(chǎn)生抗體的CHO細(xì)胞時(shí),連接至CHO細(xì)胞產(chǎn)生的抗體的N-聚糖中,99.89%為單糖基化(GlcNAc-Ig-Fc)。圖3中D。
CHO-35D6細(xì)胞,其產(chǎn)生利妥昔單抗,以表達(dá)載體穩(wěn)定轉(zhuǎn)染,以生產(chǎn)BF3242和Endo S。細(xì)胞于100~400g/ml G418存在下培養(yǎng)。因此,所產(chǎn)生的抗體含有17-19%的GlcNAc-Ig-Fc。圖4中A。
酶表達(dá)以Western blot法檢測(cè),如圖4中B所示。
在工程化以表達(dá)巖藻糖苷酶和Endo S兩者的CHO細(xì)胞中所生產(chǎn)的奧馬珠單抗中,觀察到類似結(jié)果。
對(duì)于制備均一的聚糖型抗體中的轉(zhuǎn)糖基化反應(yīng)而言,這樣的效率代表了至為重要的步驟。
其它實(shí)施方式
本說明書所揭示的所有特征可以任意組合結(jié)合。本說明書揭示的各特征可基于相同、等同或類似目的,以另一特征所替代。因此,除非另有明確指明,所揭示的各特征僅為一通用系列的等同或類似特征的實(shí)例。
由上述陳述,本領(lǐng)域普通技術(shù)人員可易于確定本發(fā)明的必要特征,且在不背離其精神及范疇之下,可進(jìn)行本發(fā)明的各種變更及改進(jìn),以使其適應(yīng)于各種不同的用途及條件。因此,其它實(shí)施方式也在權(quán)利要求范圍內(nèi)。
序列表
<110> 泉盛生物科技公司
<120> 生產(chǎn)具有可控多糖形式的重組蛋白的方法
<130> F0720.70001WO00
<150> US 61/954,337
<151> 2014-03-17
<160> 12
<170> PatentIn version 3.5
<210> 1
<211> 461
<212> PRT
<213> 人工序列
<220>
<223> 合成多肽
<400> 1
Met Arg Val Pro Ala Gln Leu Leu Gly Leu Leu Leu Leu Trp Leu Pro
1 5 10 15
Gly Ala Arg Cys Gln Pro Pro Arg Arg Tyr Thr Pro Asp Trp Pro Ser
20 25 30
Leu Asp Ser Arg Pro Leu Pro Ala Trp Phe Asp Glu Ala Lys Phe Gly
35 40 45
Val Phe Ile His Trp Gly Val Phe Ser Val Pro Ala Trp Gly Ser Glu
50 55 60
Trp Phe Trp Trp His Trp Gln Gly Glu Gly Arg Pro Gln Tyr Gln Arg
65 70 75 80
Phe Met Arg Asp Asn Tyr Pro Pro Gly Phe Ser Tyr Ala Asp Phe Gly
85 90 95
Pro Gln Phe Thr Ala Arg Phe Phe His Pro Glu Glu Trp Ala Asp Leu
100 105 110
Phe Gln Ala Ala Gly Ala Lys Tyr Val Val Leu Thr Thr Lys His His
115 120 125
Glu Gly Phe Thr Asn Trp Pro Ser Pro Val Ser Trp Asn Trp Asn Ser
130 135 140
Lys Asp Val Gly Pro His Arg Asp Leu Val Gly Glu Leu Gly Thr Ala
145 150 155 160
Leu Arg Lys Arg Asn Ile Arg Tyr Gly Leu Tyr His Ser Leu Leu Glu
165 170 175
Trp Phe His Pro Leu Tyr Leu Leu Asp Lys Lys Asn Gly Phe Lys Thr
180 185 190
Gln His Phe Val Ser Ala Lys Thr Met Pro Glu Leu Tyr Asp Leu Val
195 200 205
Asn Ser Tyr Lys Pro Asp Leu Ile Trp Ser Asp Gly Glu Trp Glu Cys
210 215 220
Pro Asp Thr Tyr Trp Asn Ser Thr Asn Phe Leu Ser Trp Leu Tyr Asn
225 230 235 240
Asp Ser Pro Val Lys Asp Glu Val Val Val Asn Asp Arg Trp Gly Gln
245 250 255
Asn Cys Ser Cys His His Gly Gly Tyr Tyr Asn Cys Glu Asp Lys Phe
260 265 270
Lys Pro Gln Ser Leu Pro Asp His Lys Trp Glu Met Cys Thr Ser Ile
275 280 285
Asp Lys Phe Ser Trp Gly Tyr Arg Arg Asp Met Ala Leu Ser Asp Val
290 295 300
Thr Glu Glu Ser Glu Ile Ile Ser Glu Leu Val Gln Thr Val Ser Leu
305 310 315 320
Gly Gly Asn Tyr Leu Leu Asn Ile Gly Pro Thr Lys Asp Gly Leu Ile
325 330 335
Val Pro Ile Phe Gln Glu Arg Leu Leu Ala Val Gly Lys Trp Leu Ser
340 345 350
Ile Asn Gly Glu Ala Ile Tyr Ala Ser Lys Pro Trp Arg Val Gln Trp
355 360 365
Glu Lys Asn Thr Thr Ser Val Trp Tyr Thr Ser Lys Gly Ser Ala Val
370 375 380
Tyr Ala Ile Phe Leu His Trp Pro Glu Asn Gly Val Leu Asn Leu Glu
385 390 395 400
Ser Pro Ile Thr Thr Ser Thr Thr Lys Ile Thr Met Leu Gly Ile Gln
405 410 415
Gly Asp Leu Lys Trp Ser Thr Asp Pro Asp Lys Gly Leu Phe Ile Ser
420 425 430
Leu Pro Gln Leu Pro Pro Ser Ala Val Pro Ala Glu Phe Ala Trp Thr
435 440 445
Ile Lys Leu Thr Gly Val Lys His His His His His His
450 455 460
<210> 2
<211> 1386
<212> DNA
<213> 人工序列
<220>
<223> 合成多核苷酸
<400> 2
atgagagtgc ctgctcagct gctgggactg ctgctgctgt ggctgcctgg tgctagatgc 60
cagccccctc ggagatacac ccctgactgg ccttccctgg actccagacc tctgcccgct 120
tggtttgacg aggccaagtt cggcgtgttc atccactggg gcgtgttctc cgtgcctgcc 180
tggggctctg agtggttctg gtggcattgg cagggcgagg gcagacctca gtaccagcgg 240
ttcatgcggg acaactaccc ccctggcttc tcctacgccg acttcggccc tcagttcacc 300
gcccggttct tccaccctga ggaatgggcc gatctgttcc aggccgctgg cgccaaatac 360
gtggtgctga ccaccaagca ccacgagggc ttcaccaact ggccctcccc cgtgtcctgg 420
aactggaact ctaaggacgt gggcccccac cgggatctcg tgggagaact gggaaccgcc 480
ctgcggaagc ggaacatcag atacggcctg taccactccc tgctggaatg gttccacccc 540
ctgtacctgc tggacaagaa gaacggcttc aagacccagc acttcgtgtc cgccaagacc 600
atgcccgagc tgtacgacct cgtgaactcc tacaagcccg acctgatttg gagcgacggc 660
gagtgggagt gccccgacac ctattggaac tccaccaact ttctgtcctg gctgtacaac 720
gactcccctg tgaaggacga ggtggtcgtg aacgacagat ggggccagaa ctgctcctgt 780
caccacggcg gctactacaa ctgcgaggac aagttcaagc cccagtccct gcccgaccac 840
aagtgggaga tgtgcacctc tatcgacaag ttctcctggg gctaccggcg ggacatggcc 900
ctgtctgatg tgaccgagga atccgagatc atctccgagc tggtgcagac cgtgtccctg 960
ggcggcaact acctgctgaa catcggccct accaaggacg gcctgatcgt gcccatcttc 1020
caggaacggc tgctggccgt gggcaagtgg ctgtctatca acggcgaggc catctacgcc 1080
tccaagcctt ggcgagtgca gtgggagaag aacaccacct ccgtgtggta cacctccaag 1140
ggctctgccg tgtacgccat cttcctgcac tggcccgaga acggcgtgct gaacctggaa 1200
tcccccatca ccacctctac caccaagatc accatgctgg gcatccaggg cgacctgaag 1260
tggtccaccg accctgacaa gggcctgttc atctccctgc cccagctgcc tccttccgct 1320
gtgcctgctg agttcgcctg gaccatcaag ctgaccggcg tgaagcacca ccaccatcac 1380
cattga 1386
<210> 3
<211> 465
<212> PRT
<213> 人工序列
<220>
<223> 合成多肽
<400> 3
Met Arg Val Pro Ala Gln Leu Leu Gly Leu Leu Leu Leu Trp Leu Pro
1 5 10 15
Gly Ala Arg Cys His Ser Ala Thr Arg Phe Asp Pro Thr Trp Glu Ser
20 25 30
Leu Asp Ala Arg Gln Leu Pro Ala Trp Phe Asp Gln Ala Lys Phe Gly
35 40 45
Ile Phe Ile His Trp Gly Val Phe Ser Val Pro Ser Phe Gly Ser Glu
50 55 60
Trp Phe Trp Trp Tyr Trp Gln Lys Glu Lys Ile Pro Lys Tyr Val Glu
65 70 75 80
Phe Met Lys Asp Asn Tyr Pro Pro Ser Phe Lys Tyr Glu Asp Phe Gly
85 90 95
Pro Leu Phe Thr Ala Lys Phe Phe Asn Ala Asn Gln Trp Ala Asp Ile
100 105 110
Phe Gln Ala Ser Gly Ala Lys Tyr Ile Val Leu Thr Ser Lys His His
115 120 125
Glu Gly Phe Thr Leu Trp Gly Ser Glu Tyr Ser Trp Asn Trp Asn Ala
130 135 140
Ile Asp Glu Gly Pro Lys Arg Asp Ile Val Lys Glu Leu Glu Val Ala
145 150 155 160
Ile Arg Asn Arg Thr Asp Leu Arg Phe Gly Leu Tyr Tyr Ser Leu Phe
165 170 175
Glu Trp Phe His Pro Leu Phe Leu Glu Asp Glu Ser Ser Ser Phe His
180 185 190
Lys Arg Gln Phe Pro Val Ser Lys Thr Leu Pro Glu Leu Tyr Glu Leu
195 200 205
Val Asn Asn Tyr Gln Pro Glu Val Leu Trp Ser Asp Gly Asp Gly Gly
210 215 220
Ala Pro Asp Gln Tyr Trp Asn Ser Thr Gly Phe Leu Ala Trp Leu Tyr
225 230 235 240
Asn Glu Ser Pro Val Arg Gly Thr Val Val Thr Asn Asp Arg Trp Gly
245 250 255
Ala Gly Ser Ile Cys Lys His Gly Gly Phe Tyr Thr Cys Ser Asp Arg
260 265 270
Tyr Asn Pro Gly His Leu Leu Pro His Lys Trp Glu Asn Cys Met Thr
275 280 285
Ile Asp Lys Leu Ser Trp Gly Tyr Arg Arg Glu Ala Gly Ile Ser Asp
290 295 300
Tyr Leu Thr Ile Glu Glu Leu Val Lys Gln Leu Val Glu Thr Val Ser
305 310 315 320
Cys Gly Gly Asn Leu Leu Met Asn Ile Gly Pro Thr Leu Asp Gly Thr
325 330 335
Ile Ser Val Val Phe Glu Glu Arg Leu Arg Gln Met Gly Ser Trp Leu
340 345 350
Lys Val Asn Gly Glu Ala Ile Tyr Glu Thr His Thr Trp Arg Ser Gln
355 360 365
Asn Asp Thr Val Thr Pro Asp Val Trp Tyr Thr Ser Lys Pro Lys Glu
370 375 380
Lys Leu Val Tyr Ala Ile Phe Leu Lys Trp Pro Thr Ser Gly Gln Leu
385 390 395 400
Phe Leu Gly His Pro Lys Ala Ile Leu Gly Ala Thr Glu Val Lys Leu
405 410 415
Leu Gly His Gly Gln Pro Leu Asn Trp Ile Ser Leu Glu Gln Asn Gly
420 425 430
Ile Met Val Glu Leu Pro Gln Leu Thr Ile His Gln Met Pro Cys Lys
435 440 445
Trp Gly Trp Ala Leu Ala Leu Thr Asn Val Ile His His His His His
450 455 460
His
465
<210> 4
<211> 1398
<212> DNA
<213> 人工序列
<220>
<223> 合成多核苷酸
<400> 4
atgagagtgc ctgctcagct gctgggactg ctgctgctgt ggctgcctgg cgctagatgc 60
cactccgcca ccagattcga ccccacctgg gagtctctgg acgccagaca gctgcccgct 120
tggtttgacc aggccaagtt cggcatcttc atccactggg gcgtgttctc cgtgcccagc 180
ttcggctctg agtggttctg gtggtactgg cagaaagaga agatccccaa atacgtggag 240
ttcatgaagg acaactaccc ccccagcttt aagtacgagg acttcggccc cctgttcacc 300
gccaagttct tcaacgccaa ccagtgggcc gacatcttcc aggcctctgg cgccaagtac 360
atcgtgctga cctccaagca ccacgagggc ttcaccctgt ggggctccga gtactcctgg 420
aactggaacg ccatcgacga gggccccaag cgggacatcg tgaaagaact ggaagtggcc 480
atccggaacc ggaccgacct gagattcggc ctgtactact ccctgttcga gtggttccac 540
cccctgtttc tggaagatga gtcctccagc ttccacaagc ggcagttccc cgtgtccaag 600
accctgcccg agctgtacga gctcgtgaac aactaccagc ccgaggtgct gtggagtgac 660
ggggatggtg gtgcccccga tcagtactgg aactctaccg gcttcctggc ctggctgtac 720
aacgagtctc ctgtgcgggg caccgtcgtg accaacgata gatggggcgc tggctccatc 780
tgcaagcacg gcggcttcta cacctgttcc gaccggtaca accccggcca tctgctgcct 840
cacaagtggg agaactgcat gaccatcgac aagctgtcct ggggctacag aagagaggcc 900
ggcatctccg actacctgac aatcgaggaa ctcgtgaagc agctggtgga aaccgtgtcc 960
tgcggcggca acctgctgat gaacatcggc cctaccctgg acggcaccat ctccgtggtg 1020
ttcgaggaac ggctgcggca gatgggctcc tggctgaaag tgaacggcga ggccatctac 1080
gagacacaca cctggcggtc ccagaacgac accgtgaccc ctgacgtgtg gtacaccagc 1140
aagcccaaag aaaagctggt gtatgccatc ttcctgaagt ggcctacctc cggccagctg 1200
ttcctgggcc accctaaggc tatcctgggc gccaccgaag tgaaactgct gggccatgga 1260
cagcccctga actggatctc cctggaacag aacggcatca tggtggaact gccccagctg 1320
accatccatc agatgccctg caaatggggc tgggccctgg ccctgaccaa cgtgatccac 1380
catcaccacc accactga 1398
<210> 5
<211> 465
<212> PRT
<213> 人工序列
<220>
<223> 合成多肽
<400> 5
Met Arg Val Pro Ala Gln Leu Leu Gly Leu Leu Leu Leu Trp Leu Pro
1 5 10 15
Gly Ala Arg Cys Lys Ser Ser Arg Arg Tyr Asp Pro Thr Trp Glu Ser
20 25 30
Leu Asp Arg Arg Pro Leu Pro Ser Trp Phe Asp Gln Ala Lys Phe Gly
35 40 45
Ile Phe Ile His Trp Gly Val Phe Ser Val Pro Ser Phe Gly Ser Glu
50 55 60
Trp Phe Trp Trp Tyr Trp Gln Lys Glu Lys Arg Pro Lys Phe Val Asp
65 70 75 80
Phe Met Asn Asn Asn Tyr Pro Pro Gly Phe Lys Tyr Glu Asp Phe Gly
85 90 95
Val Leu Phe Thr Ala Lys Phe Phe Asn Ala Ser Gln Trp Ala Asp Ile
100 105 110
Leu Gln Ala Ser Gly Ala Lys Tyr Leu Val Leu Thr Ser Lys His His
115 120 125
Glu Gly Phe Thr Leu Trp Gly Ser Glu Tyr Ser Trp Asn Trp Asn Ala
130 135 140
Val Asp Glu Gly Pro Lys Arg Asp Ile Val Lys Glu Leu Lys Val Ala
145 150 155 160
Ile Thr Lys Asn Thr Asp Leu Arg Phe Gly Leu Tyr Tyr Ser Leu Phe
165 170 175
Glu Trp Phe His Pro Leu Phe Leu Glu Asp Lys Leu Ser Ser Phe Gln
180 185 190
Lys Arg Gln Phe Pro Ile Ser Lys Met Leu Pro Glu Leu Tyr Glu Leu
195 200 205
Val Asn Lys Tyr Gln Pro Asp Ile Leu Trp Thr Asp Gly Asp Gly Gly
210 215 220
Ala Pro Asp Arg Tyr Trp Asn Ser Thr Gly Phe Leu Ala Trp Leu Tyr
225 230 235 240
Asn Glu Ser Pro Val Arg Asn Thr Val Val Thr Asn Asp Arg Trp Gly
245 250 255
Ala Gly Ser Ile Cys Lys His Gly Gly Tyr Tyr Thr Cys Ser Asp Arg
260 265 270
Tyr Asn Pro Gly His Leu Leu Pro His Lys Trp Glu Asn Cys Met Thr
275 280 285
Ile Asp Gln Phe Ser Trp Gly Tyr Arg Arg Glu Ala Val Ile Ser Asp
290 295 300
Tyr Leu Thr Ile Glu Glu Leu Val Lys Gln Leu Val Glu Thr Val Ala
305 310 315 320
Cys Gly Gly Asn Leu Leu Met Asn Ile Gly Pro Thr Leu Asp Gly Ile
325 330 335
Ile Pro Val Ile Phe Glu Glu Arg Leu Arg Gln Met Gly Met Trp Leu
340 345 350
Lys Val Asn Gly Glu Ala Ile Tyr Glu Thr Gln Pro Trp Arg Ser Gln
355 360 365
Asn Asp Thr Ala Thr Pro Asp Val Trp Tyr Thr Tyr Lys Pro Glu Glu
370 375 380
Lys Ile Val Tyr Ala Ile Phe Leu Lys Trp Pro Val Ser Arg Glu Leu
385 390 395 400
Phe Leu Glu Gln Pro Ile Gly Ser Leu Gly Glu Thr Glu Val Ala Leu
405 410 415
Leu Gly Glu Gly Lys Pro Leu Thr Trp Thr Ser Leu Lys Pro Asn Gly
420 425 430
Ile Ile Val Glu Leu Pro Gln Leu Thr Leu His Gln Met Pro Cys Lys
435 440 445
Trp Gly Trp Thr Leu Ala Leu Thr Asn Val Thr His His His His His
450 455 460
His
465
<210> 6
<211> 1398
<212> DNA
<213> 人工序列
<220>
<223> 合成多核苷酸
<400> 6
atgagagtgc ctgctcagct gctgggactg ctgctgctgt ggctgcctgg cgctagatgc 60
aagtcctctc ggagatacga ccccacctgg gagtccctgg acagaaggcc tctgcccagt 120
tggttcgacc aggccaagtt cggcatcttc atccactggg gcgtgttctc cgtgcccagc 180
ttcggctctg agtggttctg gtggtactgg cagaaagaga agcggcccaa gttcgtggac 240
ttcatgaaca acaactaccc ccctggcttt aagtacgagg acttcggcgt gctgttcacc 300
gccaagttct tcaacgcctc ccagtgggcc gacatcctgc aggcttccgg cgctaagtac 360
ctggtgctga cctccaagca ccacgagggc tttaccctgt ggggctccga gtactcctgg 420
aactggaacg ccgtggacga gggccctaag cgggacatcg tgaaagaact gaaggtggcc 480
atcaccaaga acaccgacct gagattcggc ctgtactact ccctgttcga gtggttccac 540
cccctgtttc tggaagataa gctgtccagc ttccagaagc ggcagttccc catctccaag 600
atgctgcccg agctgtacga gctcgtgaac aagtaccagc ctgacatcct gtggaccgac 660
ggggatggtg gcgcccctga cagatactgg aactctaccg gcttcctggc ctggctgtac 720
aacgagtccc ctgtgcggaa caccgtcgtg accaacgaca gatggggcgc tggctccatc 780
tgcaagcacg gcggctacta cacctgttcc gaccggtaca accccggcca tctgctgcct 840
cacaagtggg agaactgcat gacaatcgac cagttctcct ggggctaccg gcgcgaggcc 900
gtgatctctg actacctgac catcgaggaa ctcgtgaagc agctggtgga aaccgtggcc 960
tgtggcggca acctgctgat gaacatcggc cctaccctgg acggcatcat ccccgtgatc 1020
ttcgaggaac ggctgcggca gatgggcatg tggctgaaag tgaacggcga ggccatctac 1080
gagacacagc cttggcggtc ccagaacgac accgccacac ctgacgtgtg gtacacctac 1140
aagcccgaag agaagatcgt gtacgccatc ttcctgaagt ggcccgtgtc cagagagctg 1200
tttctggaac agcccatcgg ctccctgggc gagacagaag tggctctgct gggcgagggc 1260
aagcctctga cctggacctc cctgaagccc aatggcatca tcgtggaact gccccagctg 1320
accctgcacc agatgccctg taaatggggc tggaccctgg ccctgaccaa cgtgacccac 1380
caccaccatc accactga 1398
<210> 7
<211> 481
<212> PRT
<213> 人工序列
<220>
<223> 合成多肽
<400> 7
Met Arg Val Pro Ala Gln Leu Leu Gly Leu Leu Leu Leu Trp Leu Pro
1 5 10 15
Gly Ala Arg Cys His Asn Val Ser Glu Gly Tyr Glu Lys Pro Ala Asp
20 25 30
Pro Leu Val Val Gln Asn Leu Glu Gln Trp Gln Asp Leu Lys Phe Gly
35 40 45
Leu Phe Met His Trp Gly Thr Tyr Ser Gln Trp Gly Ile Val Glu Ser
50 55 60
Trp Ser Leu Cys Pro Glu Asp Glu Ser Trp Thr Gln Arg Lys Pro Glu
65 70 75 80
His Gly Lys Ser Tyr Asn Glu Tyr Val Lys Asn Tyr Glu Asn Leu Gln
85 90 95
Thr Thr Phe Asn Pro Val Gln Phe Asn Pro Gln Lys Trp Ala Asp Ala
100 105 110
Thr Lys Lys Ala Gly Met Lys Tyr Val Val Phe Thr Thr Lys His His
115 120 125
Asp Gly Phe Ala Met Phe Asp Thr Lys Gln Ser Asp Tyr Lys Ile Thr
130 135 140
Ser Ser Lys Thr Pro Phe Ser Lys Asn Pro Lys Ala Asp Val Ala Lys
145 150 155 160
Glu Ile Phe Asn Thr Phe Arg Asp Asn Gly Phe Arg Ile Gly Ala Tyr
165 170 175
Phe Ser Lys Pro Asp Trp His Ser Asp Asp Tyr Trp Trp Ser Tyr Phe
180 185 190
Pro Pro Lys Asp Arg Asn Val Asn Tyr Asp Pro Gln Lys Tyr Pro Ala
195 200 205
Arg Trp Glu Asn Phe Lys Lys Phe Thr Phe Asn Gln Leu Asn Glu Ile
210 215 220
Thr Ser Asn Tyr Gly Lys Ile Asp Ile Leu Trp Leu Asp Gly Gly Trp
225 230 235 240
Val Arg Pro Phe His Thr Ile Asp Pro Asn Ile Glu Trp Gln Arg Thr
245 250 255
Ile Lys Val Glu Gln Asp Ile Asp Met Asp Lys Ile Gly Thr Met Ala
260 265 270
Arg Lys Asn Gln Pro Gly Ile Ile Ile Val Asp Arg Thr Val Pro Gly
275 280 285
Lys Trp Glu Asn Tyr Val Thr Pro Glu Gln Ala Val Pro Glu His Ala
290 295 300
Leu Ser Ile Pro Trp Glu Ser Cys Ile Thr Met Gly Asp Ser Phe Ser
305 310 315 320
Tyr Val Pro Asn Asp Asn Tyr Lys Ser Ser Gln Lys Ile Ile Glu Thr
325 330 335
Leu Ile Arg Ile Ile Ser Arg Gly Gly Asn Tyr Leu Met Asn Ile Ala
340 345 350
Pro Gly Pro Asn Gly Asp Tyr Asp Ala Val Val Tyr Glu Arg Leu Lys
355 360 365
Glu Ile Ser Gly Trp Met Asp Lys Asn Gln Ser Ala Val Phe Thr Thr
370 375 380
Arg Ala Leu Ala Pro Tyr His Glu Ser Asp Phe Tyr Tyr Thr Gln Ser
385 390 395 400
Lys Asp Gly Lys Ile Val Asn Val Phe His Ile Ser Glu Lys Ser Asn
405 410 415
Tyr Gln Ala Pro Ser Glu Leu Ser Phe Ser Ile Pro Glu Asn Ile Asn
420 425 430
Pro Lys Thr Val Lys Val Leu Gly Ile Ser Ser Gln Ile Lys Trp Lys
435 440 445
Lys Lys Gly Asn Lys Ile His Val Gln Leu Pro Glu Glu Arg Thr Lys
450 455 460
Leu Asn Tyr Ser Thr Val Ile Gln Ile Thr Gln His His His His His
465 470 475 480
His
<210> 8
<211> 1446
<212> DNA
<213> 人工序列
<220>
<223> 合成多核苷酸
<400> 8
atgagagtgc ctgctcagct gctgggactg ctgctgctgt ggctgcctgg cgctagatgc 60
cacaatgtgt ccgagggcta cgagaagccc gccgaccctc tggtggtgca gaacctggaa 120
cagtggcagg acctgaagtt cggcctgttc atgcactggg gcacctactc ccagtggggc 180
atcgtggaat cctggtccct gtgccctgag gacgagtctt ggacccagcg gaagcctgag 240
cacggcaagt cctacaacga gtacgtgaag aactacgaga acctgcagac caccttcaac 300
cccgtgcagt tcaaccccca gaagtgggcc gacgccacca agaaagccgg catgaaatac 360
gtggtgttca ccaccaagca ccacgacggc ttcgccatgt tcgacaccaa gcagtccgac 420
tacaagatca cctcctccaa gacccccttc agcaagaacc ccaaggccga cgtggccaaa 480
gagattttca acaccttccg ggacaacggc ttccggatcg gcgcctactt ctccaagcct 540
gactggcact ccgacgacta ctggtggtcc tacttcccac ccaaggaccg gaacgtgaac 600
tacgaccctc agaaataccc cgccagatgg gagaacttca agaagttcac cttcaatcag 660
ctgaacgaga tcaccagcaa ctacggcaag atcgacatcc tgtggctgga cggcggatgg 720
gtgcgaccct tccacaccat cgaccccaac atcgagtggc agcggaccat caaggtggaa 780
caggacatcg acatggacaa gatcggcacc atggcccgga agaaccagcc cggcatcatc 840
atcgtggacc ggaccgtgcc tggcaagtgg gagaattacg tgacccccga gcaggccgtg 900
cctgagcatg ccctgtctat cccttgggag tcctgtatca caatgggcga cagcttctcc 960
tacgtgccca acgacaacta caagtcctcc cagaagatca tcgagacact gatcaggatc 1020
atctccagag gcggcaacta cctgatgaat atcgcccctg gccccaacgg cgactacgac 1080
gctgtggtgt acgagcggct gaaagaaatc tccggctgga tggataagaa ccagtccgcc 1140
gtgtttacca cccgggctct ggccccttac cacgagtccg acttctacta cacccagtcc 1200
aaggacggaa agatcgtgaa cgtgttccac atctccgaga agtccaacta ccaggccccc 1260
tccgagctgt ccttcagcat ccccgagaac atcaacccca agaccgtgaa ggtgctgggc 1320
atctccagcc agatcaagtg gaagaagaag ggcaacaaga tccacgtgca gctgcccgag 1380
gaacggacca agctgaacta ctccaccgtg atccagatca cccagcacca ccaccatcac 1440
cactga 1446
<210> 9
<211> 440
<212> PRT
<213> 人工序列
<220>
<223> 合成多肽
<400> 9
Met Arg Val Pro Ala Gln Leu Leu Gly Leu Leu Leu Leu Trp Leu Pro
1 5 10 15
Gly Ala Arg Cys Gln Gln Lys Tyr Gln Pro Thr Glu Ala Asn Leu Lys
20 25 30
Ala Arg Ser Glu Phe Gln Asp Asn Lys Phe Gly Ile Phe Leu His Trp
35 40 45
Gly Leu Tyr Ala Met Leu Ala Thr Gly Glu Trp Thr Met Thr Asn Asn
50 55 60
Asn Leu Asn Tyr Lys Glu Tyr Ala Lys Leu Ala Gly Gly Phe Tyr Pro
65 70 75 80
Ser Lys Phe Asp Ala Asp Lys Trp Val Ala Ala Ile Lys Ala Ser Gly
85 90 95
Ala Lys Tyr Ile Cys Phe Thr Thr Arg His His Glu Gly Phe Ser Met
100 105 110
Phe Asp Thr Lys Tyr Ser Asp Tyr Asn Ile Val Lys Ala Thr Pro Phe
115 120 125
Lys Arg Asp Val Val Lys Glu Leu Ala Asp Ala Cys Ala Lys His Gly
130 135 140
Ile Lys Leu His Phe Tyr Tyr Ser His Ile Asp Trp Tyr Arg Glu Asp
145 150 155 160
Ala Pro Gln Gly Arg Thr Gly Arg Arg Thr Gly Arg Pro Asn Pro Lys
165 170 175
Gly Asp Trp Lys Ser Tyr Tyr Gln Phe Met Asn Asn Gln Leu Thr Glu
180 185 190
Leu Leu Thr Asn Tyr Gly Pro Ile Gly Ala Ile Trp Phe Asp Gly Trp
195 200 205
Trp Asp Gln Asp Ile Asn Pro Asp Phe Asp Trp Glu Leu Pro Glu Gln
210 215 220
Tyr Ala Leu Ile His Arg Leu Gln Pro Ala Cys Leu Val Gly Asn Asn
225 230 235 240
His His Gln Thr Pro Phe Ala Gly Glu Asp Ile Gln Ile Phe Glu Arg
245 250 255
Asp Leu Pro Gly Glu Asn Thr Ala Gly Leu Ser Gly Gln Ser Val Ser
260 265 270
His Leu Pro Leu Glu Thr Cys Glu Thr Met Asn Gly Met Trp Gly Tyr
275 280 285
Lys Ile Thr Asp Gln Asn Tyr Lys Ser Thr Lys Thr Leu Ile His Tyr
290 295 300
Leu Val Lys Ala Ala Gly Lys Asp Ala Asn Leu Leu Met Asn Ile Gly
305 310 315 320
Pro Gln Pro Asp Gly Glu Leu Pro Glu Val Ala Val Gln Arg Leu Lys
325 330 335
Glu Val Gly Glu Trp Met Ser Lys Tyr Gly Glu Thr Ile Tyr Gly Thr
340 345 350
Arg Gly Gly Leu Val Ala Pro His Asp Trp Gly Val Thr Thr Gln Lys
355 360 365
Gly Asn Lys Leu Tyr Val His Ile Leu Asn Leu Gln Asp Lys Ala Leu
370 375 380
Phe Leu Pro Ile Val Asp Lys Lys Val Lys Lys Ala Val Val Phe Ala
385 390 395 400
Asp Lys Thr Pro Val Arg Phe Thr Lys Asn Lys Glu Gly Ile Val Leu
405 410 415
Glu Leu Ala Lys Val Pro Thr Asp Val Asp Tyr Val Val Glu Leu Thr
420 425 430
Ile Asp His His His His His His
435 440
<210> 10
<211> 1323
<212> DNA
<213> 人工序列
<220>
<223> 合成多核苷酸
<400> 10
atgagagtgc ctgctcagct gctgggactg ctgctgctgt ggctgcctgg tgctagatgc 60
cagcagaagt accagcccac cgaggccaac ctgaaggcca gatccgagtt ccaggacaac 120
aagttcggca tcttcctgca ctggggcctg tacgccatgc tggctactgg cgagtggacc 180
atgaccaaca acaacctgaa ctacaaagag tacgctaagc tggctggcgg cttctacccc 240
tccaagttcg acgccgacaa atgggtggcc gccatcaagg cctctggcgc caagtacatc 300
tgcttcacca cccggcacca cgagggcttc tccatgttcg acaccaagta ctccgactac 360
aacatcgtga aggccacccc cttcaagcgg gacgtcgtga aagagctggc cgacgcctgc 420
gctaagcacg gcatcaagct gcacttctac tactcccaca tcgactggta cagagaggac 480
gccccccagg gcagaaccgg cagaagaaca ggcagaccca accccaaggg cgactggaag 540
tcctactacc agtttatgaa caaccagctg accgagctgc tgaccaacta cggccccatc 600
ggcgccattt ggttcgacgg gtggtgggac caggacatca accccgactt cgactgggag 660
ctgcccgagc agtacgccct gatccacaga ctgcagcccg cctgtctcgt gggcaacaac 720
caccaccaga ccccctttgc cggcgaggac atccagattt tcgagcggga tctgcccggc 780
gagaacaccg ctggactgtc tggccagtcc gtgtcccatc tgcccctgga aacctgcgag 840
acaatgaacg gcatgtgggg ctacaagatc accgaccaga actacaagtc caccaagaca 900
ctgatccact acctcgtgaa agccgctggc aaggacgcca acctgctgat gaacatcggc 960
ccccagcctg acggcgagct gcctgaagtg gctgtgcagc ggctgaaaga agtgggagag 1020
tggatgtcta agtacggcga gactatctac ggcaccagag gcggcctggt ggcccctcat 1080
gattggggcg tgaccaccca gaagggcaac aagctgtacg tgcacatcct gaacctgcag 1140
gacaaggccc tgttcctgcc catcgtggac aagaaagtga agaaagccgt ggtgttcgcc 1200
gacaagaccc ccgtgcggtt caccaagaac aaagagggca tcgtgctgga actggccaag 1260
gtgcccaccg acgtggacta cgtggtggaa ctgaccatcg accaccatca tcaccaccac 1320
tga 1323
<210> 11
<211> 920
<212> PRT
<213> 人工序列
<220>
<223> 合成多肽
<400> 11
Met Arg Val Pro Ala Gln Leu Leu Gly Leu Leu Leu Leu Trp Leu Pro
1 5 10 15
Gly Ala Arg Cys Ala Gln His Asp Ser Leu Ile Arg Val Lys Ala Glu
20 25 30
Asp Lys Val Val Gln Thr Ser Pro Ser Val Ser Ala Ile Asp Asp Leu
35 40 45
His Tyr Leu Ser Glu Asn Ser Lys Lys Glu Phe Lys Glu Gly Leu Ser
50 55 60
Lys Ala Gly Glu Val Pro Glu Lys Leu Lys Asp Ile Leu Ser Lys Ala
65 70 75 80
Gln Gln Ala Asp Lys Gln Ala Lys Val Leu Ala Glu Met Lys Val Pro
85 90 95
Glu Lys Ile Ala Met Lys Pro Leu Lys Gly Pro Leu Tyr Gly Gly Tyr
100 105 110
Phe Arg Thr Trp His Asp Lys Thr Ser Asp Pro Ala Glu Lys Asp Lys
115 120 125
Val Asn Ser Met Gly Glu Leu Pro Lys Glu Val Asp Leu Ala Phe Val
130 135 140
Phe His Asp Trp Thr Lys Asp Tyr Ser Leu Phe Trp Gln Glu Leu Ala
145 150 155 160
Thr Lys His Val Pro Thr Leu Asn Lys Gln Gly Thr Arg Val Ile Arg
165 170 175
Thr Ile Pro Trp Arg Phe Leu Ala Gly Gly Asp His Ser Gly Ile Ala
180 185 190
Glu Asp Thr Gln Lys Tyr Pro Asn Thr Pro Glu Gly Asn Lys Ala Leu
195 200 205
Ala Lys Ala Ile Val Asp Glu Tyr Val Tyr Lys Tyr Asn Leu Asp Gly
210 215 220
Leu Asp Val Asp Ile Glu Arg Asp Ser Ile Pro Lys Val Asn Gly Lys
225 230 235 240
Glu Ser Asn Glu Asn Ile Gln Arg Ser Ile Ala Val Phe Glu Glu Ile
245 250 255
Gly Lys Leu Ile Gly Pro Lys Gly Ala Asp Lys Ser Arg Leu Phe Ile
260 265 270
Met Asp Ser Thr Tyr Met Ala Asp Lys Asn Pro Leu Ile Glu Arg Gly
275 280 285
Ala Pro Tyr Ile Asp Leu Leu Leu Val Gln Val Tyr Gly Ile Gln Gly
290 295 300
Glu Lys Gly Asp Trp Asp Pro Val Ala Arg Lys Pro Glu Lys Thr Met
305 310 315 320
Glu Glu Arg Trp Glu Ser Tyr Ser Lys Tyr Ile Arg Pro Glu Gln Tyr
325 330 335
Met Val Gly Phe Ser Phe Tyr Glu Glu Asn Ala Gly Ser Gly Asn Leu
340 345 350
Trp Tyr Asp Ile Asn Glu Arg Lys Asp Asp His Asn Pro Leu Asn Ser
355 360 365
Glu Ile Ala Gly Thr Arg Ala Glu Arg Tyr Ala Lys Trp Gln Pro Lys
370 375 380
Thr Gly Gly Val Lys Gly Gly Ile Phe Ser Tyr Ala Ile Asp Arg Asp
385 390 395 400
Gly Val Ala His Gln Pro Lys Lys Val Ser Asp Asp Glu Lys Arg Thr
405 410 415
Asn Lys Ala Ile Lys Asp Ile Thr Asp Gly Ile Val Lys Ser Asp Tyr
420 425 430
Lys Val Ser Lys Ala Leu Lys Lys Val Met Glu Asn Asp Lys Ser Tyr
435 440 445
Glu Leu Ile Asp Gln Lys Asp Phe Pro Asp Lys Ala Leu Arg Glu Ala
450 455 460
Val Ile Ala Gln Val Gly Ser Arg Arg Gly Asp Leu Glu Arg Phe Asn
465 470 475 480
Gly Thr Leu Arg Leu Asp Asn Pro Asp Ile Lys Ser Leu Glu Gly Leu
485 490 495
Asn Lys Leu Lys Lys Leu Ala Lys Leu Glu Leu Ile Gly Leu Ser Gln
500 505 510
Ile Thr Lys Leu Asp Ser Ser Val Leu Pro Glu Asn Ile Lys Pro Thr
515 520 525
Lys Asp Thr Leu Val Ser Val Leu Glu Thr Tyr Lys Asn Asp Asp Arg
530 535 540
Lys Glu Glu Ala Lys Ala Ile Pro Gln Val Ala Leu Thr Ile Ser Gly
545 550 555 560
Leu Thr Gly Leu Lys Glu Leu Asn Leu Ala Gly Phe Asp Arg Asp Ser
565 570 575
Leu Ala Gly Ile Asp Ala Ala Ser Leu Thr Ser Leu Glu Lys Val Asp
580 585 590
Leu Ser Lys Asn Lys Leu Asp Leu Ala Ala Gly Thr Glu Asn Arg Gln
595 600 605
Ile Phe Asp Val Met Leu Ser Thr Val Ser Asn Arg Val Gly Ser Asn
610 615 620
Glu Gln Thr Val Thr Phe Asp His Gln Lys Pro Thr Gly His Tyr Pro
625 630 635 640
Asn Thr Tyr Gly Thr Thr Ser Leu Arg Leu Pro Val Gly Glu Gly Lys
645 650 655
Ile Asp Leu Gln Ser Gln Leu Leu Phe Gly Thr Val Thr Asn Gln Gly
660 665 670
Thr Leu Ile Asn Ser Glu Ala Asp Tyr Lys Ala Tyr Gln Glu Gln Leu
675 680 685
Ile Ala Gly Arg Arg Phe Val Asp Pro Gly Tyr Ala Tyr Lys Asn Phe
690 695 700
Ala Val Thr Tyr Asp Ala Tyr Lys Val Arg Val Thr Asp Ser Thr Leu
705 710 715 720
Gly Val Thr Asp Glu Lys Lys Leu Ser Thr Ser Lys Glu Glu Thr Tyr
725 730 735
Lys Val Glu Phe Phe Ser Pro Thr Asn Gly Thr Lys Pro Val His Glu
740 745 750
Ala Lys Val Val Val Gly Ala Glu Lys Thr Met Met Val Asn Leu Ala
755 760 765
Ala Gly Ala Thr Val Ile Lys Ser Asp Ser His Glu Asn Ala Lys Lys
770 775 780
Val Phe Asp Gly Ala Ile Glu Tyr Asn Pro Leu Ser Phe Ser Ser Lys
785 790 795 800
Thr Ser Ile Thr Phe Glu Phe Lys Glu Pro Gly Leu Val Lys Tyr Trp
805 810 815
Arg Phe Phe Asn Asp Ile Thr Arg Lys Asp Asp Tyr Ile Lys Glu Ala
820 825 830
Lys Leu Glu Ala Phe Val Gly His Leu Glu Asp Asp Ser Lys Val Lys
835 840 845
Asp Ser Leu Glu Lys Ser Thr Glu Trp Val Thr Val Ser Asp Tyr Ser
850 855 860
Gly Glu Ala Gln Glu Phe Ser Gln Pro Leu Asp Asn Ile Ser Ala Lys
865 870 875 880
Tyr Trp Arg Val Thr Val Asp Thr Lys Gly Gly Arg Tyr Ser Ser Pro
885 890 895
Ser Leu Pro Glu Leu Gln Ile Leu Gly Tyr Arg Leu Pro Leu Thr His
900 905 910
Asp Tyr Lys Asp Asp Asp Asp Lys
915 920
<210> 12
<211> 2763
<212> DNA
<213> 人工序列
<220>
<223> 合成多核苷酸
<400> 12
atgagagtgc ctgctcagct gctgggcctg ctgctgctgt ggctgcctgg tgctagatgc 60
gcccagcacg actccctgat cagagtgaag gccgaggaca aggtggtgca gacctcccct 120
tccgtgtccg ccatcgacga cctgcactac ctgtccgaga actccaagaa agagttcaaa 180
gagggcctgt ccaaggccgg cgaggtgccc gaaaagctga aggacatcct gagcaaggct 240
cagcaggccg acaagcaggc caaggtgctg gccgagatga aggtgccaga gaagatcgcc 300
atgaagcccc tgaagggccc tctgtacggc ggctacttca gaacctggca cgacaagacc 360
tccgaccccg ccgagaagga caaagtgaac tccatgggcg agctgcccaa agaggtggac 420
ctggccttcg tgttccacga ctggaccaag gactactccc tgttctggca ggaactggcc 480
accaagcacg tgcccaccct gaacaagcag ggcaccagag tgatccggac aatcccctgg 540
cggtttctgg ctggcggcga ccactctgga atcgccgagg atacccagaa gtaccccaac 600
acccccgagg gcaacaaggc cctggctaag gccatcgtgg acgagtacgt gtacaagtac 660
aacctggacg gcctggacgt ggacatcgag cgggactcca tccctaaagt gaacggcaaa 720
gagtccaacg agaacatcca gcggtctatc gccgtgttcg aggaaatcgg caagctgatc 780
ggccccaagg gcgccgacaa gtcccggctg ttcatcatgg actccaccta catggccgat 840
aagaaccccc tgatcgagag aggcgcccct tacatcgatc tgctgctggt gcaggtgtac 900
ggcatccagg gcgagaaggg cgattgggac cctgtggccc ggaagcctga aaagaccatg 960
gaagagagat gggagtccta ctccaagtac atccggcccg agcagtatat ggtgggattc 1020
agcttctacg aggaaaacgc cggctccggc aacctgtggt acgacatcaa cgagcggaag 1080
gacgaccaca accctctgaa ctccgagatc gccggcaccc gggctgagag atacgctaag 1140
tggcagccca agaccggcgg agtgaagggc ggcatcttct cctacgccat cgatagggat 1200
ggcgtggccc accagcctaa gaaggtgtcc gacgacgaga agcggaccaa caaggctatc 1260
aaggacatca ccgacggcat cgtgaagtcc gactacaagg tgtccaaagc cctgaagaaa 1320
gtgatggaaa acgacaagag ctacgagctg atcgaccaga aggacttccc cgataaggcc 1380
ctgcgcgagg ccgtgattgc tcaagtgggc tccagacggg gcgacctgga aagattcaac 1440
ggcaccctgc ggctggacaa ccccgacatc aagtccctgg aaggcctgaa caaactgaag 1500
aagctggcca agctggaact gatcggactg tcccagatca caaagctgga ctcctccgtg 1560
ctgcctgaga acatcaagcc caccaaggac accctggtgt ccgtgctgga aacctacaag 1620
aacgacgacc ggaaagagga agccaaggcc atccctcagg tggccctgac catctctggc 1680
ctgaccggcc tgaaagagct gaatctggcc ggcttcgacc gggattccct ggctggaatc 1740
gatgccgcct ctctgacctc cctggaaaaa gtggacctgt ctaagaacaa gctggatctg 1800
gctgccggca ccgagaaccg gcagatcttc gacgtgatgc tgtccaccgt gtccaacaga 1860
gtgggcagca acgagcagac cgtgaccttc gaccaccaga agcccaccgg ccactaccct 1920
aacacctacg gcaccacctc cctgagactg cctgtgggcg agggcaagat cgacctgcag 1980
tcccagctgc tgttcggcac cgtgaccaac cagggcacac tgatcaactc cgaggccgat 2040
tacaaggcct accaggaaca gctgatcgct gggcggagat tcgtggaccc tggctacgct 2100
tacaagaact tcgccgtgac ctacgatgcc tacaaagtgc gcgtgaccga ctccaccctg 2160
ggcgtgacag acgaaaagaa gctgagcacc tccaaagaag agacatacaa ggtggaattc 2220
ttctccccca ccaatggcac caagcctgtg catgaggcta aggtggtcgt gggcgccgag 2280
aaaaccatga tggtcaacct ggccgctggc gccaccgtga tcaagtctga ctctcacgag 2340
aatgccaaaa aggtgttcga cggcgccatc gagtacaatc ctctgagctt ctccagcaag 2400
accagcatca ccttcgagtt taaagaaccc ggcctcgtga aatactggcg gttcttcaac 2460
gatatcaccc gcaaggacga ctacatcaaa gaggctaagc tggaagcctt cgtgggccat 2520
ctggaagatg actccaaagt gaaggactct ctggaaaagt ccaccgagtg ggtcaccgtg 2580
tctgactact ctggcgaggc ccaggaattc tcccagcccc tggacaacat ctccgccaag 2640
tattggagag tgaccgtgga caccaagggc ggacggtaca gctctcctag cctgcccgag 2700
ctgcagatcc tgggctacag actgcctctg acccacgact ataaggacga cgacgacaaa 2760
tga 2763