亚洲成年人黄色一级片,日本香港三级亚洲三级,黄色成人小视频,国产青草视频,国产一区二区久久精品,91在线免费公开视频,成年轻人网站色直接看

具有抑制癌細胞生長功能的新的人蛋白及其編碼序列的制作方法

文檔序號:3475020閱讀:5796來源:國知局
專利名稱:具有抑制癌細胞生長功能的新的人蛋白及其編碼序列的制作方法
技術領域
本發(fā)明屬于生物技術領域,具體地說,本發(fā)明涉及新的編碼具有抑癌功能(即抑制癌細胞生長)的人蛋白的多核苷酸,以及此多核苷酸編碼的多肽。本發(fā)明還涉及此多核苷酸和多肽的用途和制備。
人基因組學研究目前是國際上的熱點,除人染色體DNA大規(guī)模測序,表達序列測序(EST)的方法外,還缺少從功能開始的篩選具有功能基因的高通量的方法。
癌癥是危害人類健康的主要疾病之一。為了有效地治療和預防腫瘤,目前人們已越來越關注腫瘤的基因治療。因此,本領域迫切需要開發(fā)研究具有抑癌功能的人蛋白及其激動劑/抑制劑。
本發(fā)明的目的是提供一類新的具有抑癌功能的人蛋白多肽以及其片段、類似物和衍生物。
本發(fā)明的另一目的是提供編碼這些多肽的多核苷酸。
本發(fā)明的另一目的是提供生產(chǎn)這些多肽的方法以及該多肽和編碼序列的用途。
在本發(fā)明的第一方面,提供新穎的分離出的具有抑癌功能的蛋白多肽,它包含具有選自下組的氨基酸序列的多肽SEQ ID NO:2、SEQ ID NO:5、SEQ ID NO:8、SEQ IDNO:11、SEQ ID NO:14、SEQ ID NO:17、SEQ ID NO:20、SEQ ID NO:23、SEQ ID NO:26、SEQ ID NO:29、SEQ ID NO:32、SEQ ID NO:35、SEQ ID NO:38、SEQ ID NO:41、SEQ IDNO:44、SEQ ID NO:47;或其保守性變異多肽、或其活性片段、或其活性衍生物。
較佳地,該多肽是具有選自下組的氨基酸序列的多肽SEQ ID NO:2、SEQ ID NO:5、SEQ ID NO:8、SEQ ID NO:11、SEQ ID NO:14、SEQ ID NO:17、SEQ ID NO:20、SEQ IDNO:23、SEQ ID NO:26、SEQ ID NO:29、SEQ ID NO:32、SEQ ID NO:35、SEQ ID NO:38、SEQ ID NO:41、SEQ ID NO:44、SEQ ID NO:47。
在本發(fā)明的第二方面,提供了一種分離的多核苷酸,它包含一核苷酸序列,該核苷酸序列與選自下組的一種核苷酸序列有至少85%相同性(a)編碼上述的具有抑癌功能的蛋白多肽的多核苷酸;(b)與多核苷酸(a)互補的多核苷酸。較佳地,該多核苷酸編碼的多肽具有選自下組的氨基酸序列SEQ ID NO:2、SEQ ID NO:5、SEQ ID NO:8、SEQ IDNO:11、SEQ ID NO:14、SEQ ID NO:17、SEQ ID NO:20、SEQ ID NO:23、SEQID NO:26、SEQ ID NO:29、SEQ ID NO:32、SEQ ID NO:35、SEQ ID NO:38、SEQ ID NO:41、SEQ IDNO:44、SEQ ID NO:47。更佳地,該多核苷酸的序列選自下組SEQ ID NO:3、SEQ IDNO:6、SEQ ID NO:9、SEQ ID NO:12、SEQ ID NO:15、SEQ ID NO:18、SEQ ID NO:21、SEQ ID NO:24、SEQ ID NO:27、SEQ ID NO:30、SEQ ID NO:33、SEQ ID NO:36、SEQ IDNO:39、SEQ ID NO:42、SEQ ID NO:45、SEQ ID NO:48的編碼區(qū)序列或全長序列。
在本發(fā)明的第三方面,提供了含有上述多核苷酸的載體,以及被該載體轉(zhuǎn)化或轉(zhuǎn)導的宿主細胞或者被上述多核苷酸直接轉(zhuǎn)化或轉(zhuǎn)導的宿主細胞。
在本發(fā)明的第四方面,提供了制備具有具有抑癌功能的蛋白活性的多肽的制備方法,該方法包含(a)在適合表達具有抑癌功能的蛋白的條件下,培養(yǎng)上述被轉(zhuǎn)化或轉(zhuǎn)導的宿主細胞;(b)從培養(yǎng)物中分離出具有具有抑癌功能的蛋白活性的多肽。
在本發(fā)明的第五方面,提供了與上述的具有抑癌功能的蛋白多肽特異性結(jié)合的抗體。還提供了可用于檢測的核酸分子,它含有上述的多核苷酸中連續(xù)的10-800個核苷酸。
在本發(fā)明的第六方面,提供了一種藥物組合物,它含有安全有效量的本發(fā)明的具有抑癌功能的蛋白多肽以及藥學上可接受的載體。這些藥物組合物可治療癌癥以及細胞異常增殖等病癥。
本發(fā)明的其它方面由于本文的技術的公開,對本領域的技術人員而言是顯而易見的。
本發(fā)明采用大規(guī)模cDNA克隆轉(zhuǎn)染癌細胞,在獲得具有抑癌作用的基礎上,經(jīng)測序證明為新的基因,進一步得到全長cDNA克隆。DNA轉(zhuǎn)染試驗證明,本發(fā)明的具有抑癌功能的蛋白對癌細胞(肝癌細胞)具有抑制克隆形成的作用,其抑制率在50%或50%以上。
如本文所用,“分離的”是指物質(zhì)從其原始環(huán)境中分離出來(如果是天然的物質(zhì),原始環(huán)境即是天然環(huán)境)。如活體細胞內(nèi)的天然狀態(tài)下的多聚核苷酸和多肽是沒有分離純化的,但同樣的多聚核苷酸或多肽如從天然狀態(tài)中同存在的其他物質(zhì)中分開,則為分離純化的。
如本文所用,“分離的具有抑癌功能的蛋白或多肽”是指具有抑癌功能的蛋白多肽基本上不含天然與其相關的其它蛋白、脂類、糖類或其它物質(zhì)。本領域的技術人員能用標準的蛋白質(zhì)純化技術純化具有抑癌功能的蛋白?;旧霞兊亩嚯脑诜沁€原聚丙烯酰胺凝膠上能產(chǎn)生單一的主帶。具有抑癌功能的蛋白多肽的純度能用氨基酸序列分析。
本發(fā)明的多肽可以是重組多肽、天然多肽、合成多肽,優(yōu)選重組多肽。本發(fā)明的多肽可以是天然純化的產(chǎn)物,或是化學合成的產(chǎn)物,或使用重組技術從原核或真核宿主(例如,細菌、酵母、高等植物、昆蟲和哺乳動物細胞)中產(chǎn)生。根據(jù)重組生產(chǎn)方案所用的宿主,本發(fā)明的多肽可以是糖基化的,或可以是非糖基化的。本發(fā)明的多肽還可包括或不包括起始的甲硫氨酸殘基。
本發(fā)明還包括具有抑癌功能的人蛋白的片段、衍生物和類似物。如本文所用,術語“片段”、“衍生物”和“類似物”是指基本上保持本發(fā)明的天然具有抑癌功能的人蛋白相同的生物學功能或活性的多肽。本發(fā)明的多肽片段、衍生物或類似物可以是(ⅰ)有一個或多個保守或非保守性氨基酸殘基(優(yōu)選保守性氨基酸殘基)被取代的多肽,而這樣的取代的氨基酸殘基可以是也可以不是由遺傳密碼編碼的,或(ⅱ)在一個或多個氨基酸殘基中具有取代基團的多肽,或(ⅲ)成熟多肽與另一個化合物(比如延長多肽半衰期的化合物,例如聚乙二醇)融合所形成的多肽,或(ⅳ)附加的氨基酸序列融合到此多肽序列而形成的多肽(如前導序列或分泌序列或用來純化此多肽的序列或蛋白原序列)。根據(jù)本文的教導,這些片段、衍生物和類似物屬于本領域熟練技術人員公知的范圍。
本發(fā)明的多核苷酸可以是DNA形式或RNA形式。DNA形式包括cDNA、基因組DNA或人工合成的DNA。DNA可以是單鏈的或是雙鏈的。DNA可以是編碼鏈或非編碼鏈。以PP1224蛋白(在本申請中,蛋白質(zhì)的命名采用其克隆編號)(在本申請中,蛋白質(zhì)的命名采用其克隆編號)為例,編碼成熟多肽的編碼區(qū)序列可以與SEQ ID NO:3所示的編碼區(qū)序列相同或者是簡并的變異體。如本文所用,“簡并的變異體”在本發(fā)明中是指編碼具有SEQ ID NO:2的蛋白質(zhì),但與SEQ ID NO:3所示的編碼區(qū)序列有差別的核酸序列。以PP265蛋白(在本申請中,蛋白質(zhì)的命名采用其克隆編號)(在本申請中,蛋白質(zhì)的命名采用其克隆編號)為例,編碼成熟多肽的編碼區(qū)序列可以與SEQ ID NO:6所示的編碼區(qū)序列相同或者是簡并的變異體。如本文所用,“簡并的變異體”在本發(fā)明中是指編碼具有SEQ ID NO:5的蛋白質(zhì),但與SEQ ID NO:6所示的編碼區(qū)序列有差別的核酸序列。對于其他具有抑癌功能的蛋白,可依此類推。對于其他具有抑癌功能的蛋白,可依此類推。
編碼成熟多肽的多核苷酸包括只編碼成熟多肽的編碼序列;成熟多肽的編碼序列和各種附加編碼序列;成熟多肽的編碼序列(和任選的附加編碼序列)以及非編碼序列。
術語“編碼多肽的多核苷酸”可以是包括編碼此多肽的多核苷酸,也可以是還包括附加編碼和/或非編碼序列的多核苷酸。
本發(fā)明還涉及上述多核苷酸的變異體,其編碼與本發(fā)明有相同的氨基酸序列的多肽或多肽的片段、類似物和衍生物。此多核苷酸的變異體可以是天然發(fā)生的等位變異體或非天然發(fā)生的變異體。這些核苷酸變異體包括取代變異體、缺失變異體和插入變異體。如本領域所知的,等位變異體是一個多核苷酸的替換形式,它可能是一個或多個核苷酸的取代、缺失或插入,但不會從實質(zhì)上改變其編碼的多肽的功能。
本發(fā)明還涉及與上述的序列雜交且兩個序列之間具有至少50%,較佳地至少70%,更佳地至少80%相同性的多核苷酸。本發(fā)明特別涉及在嚴格條件下與本發(fā)明所述多核苷酸可雜交的多核苷酸。在本發(fā)明中,“嚴格條件”是指(1)在較低離子強度和較高溫度下的雜交和洗脫,如0.2×SSC,0.1%SDS,60℃;或(2)雜交時加有變性劑,如50%(v/v)甲酰胺,0.1%小牛血清/0.1%Ficoll,42℃等;或(3)僅在兩條序列之間的相同性至少在95%以上,更好是97%以上時才發(fā)生雜交。并且,可雜交的多核苷酸編碼的多肽與SEQ IDNO:2所示的成熟多肽有相同的生物學功能和活性。
本發(fā)明還涉及與上述的序列雜交的核酸片段。如本文所用,“核酸片段”的長度至少含15個核苷酸,較好是至少30個核苷酸,更好是至少50個核苷酸,最好是至少100個核苷酸以上。核酸片段可用于核酸的擴增技術(如PCR)以確定和/或分離編碼具有抑癌功能的蛋白的多聚核苷酸。
本發(fā)明中的多肽和多核苷酸優(yōu)選以分離的形式提供,更佳地被純化至均質(zhì)。
本發(fā)明的DNA序列能用幾種方法獲得。例如,用本領域熟知的雜交技術分離DNA。這些技術包括但不局限于1)用探針與基因組或cDNA文庫雜交以檢出同源性核苷酸序列,和2)表達文庫的抗體篩選以檢出具有共同結(jié)構特征的克隆的DNA片段。
編碼具有抑癌功能的蛋白的特異DNA片段序列產(chǎn)生也能用下列方法獲得1)從基因組DNA分離雙鏈DNA序列;2)化學合成DNA序列以獲得所需多肽的雙鏈DNA。
上述提到的方法中,分離基因組DNA最不常用。當需要的多肽產(chǎn)物的整個氨基酸序列已知時,DNA序列的直接化學合成是經(jīng)常選用的方法。如果所需的氨基酸的整個序列不清楚時,DNA序列的直接化學合成是不可能的,選用的方法是cDNA序列的分離。分離感興趣的cDNA的標準方法是從高表達該基因的供體細胞分離mRNA并進行逆轉(zhuǎn)錄,形成質(zhì)粒或噬菌體cDNA文庫。提取mRNA的方法已有多種成熟的技術,試劑盒也可從商業(yè)途徑獲得(Qiagene)。而構建cDNA文庫也是通常的方法(Sambrook,et al.,Molecular Cloning,A Laboratory Manual,Cold Spring Harbor Laboratory.New York,1989)。還可得到商業(yè)供應的cDNA文庫,如Clontech公司的不同cDNA文庫。當結(jié)合使用聚合酶反應技術時,即使極少的表達產(chǎn)物也能克隆。
可用常規(guī)方法從這些cDNA文庫中篩選本發(fā)明的基因。這些方法包括(但不限于)(1)DNA-DNA或DNA-RNA雜交;(2)標志基因的功能出現(xiàn)或喪失;(3)測定具有抑癌功能的蛋白的轉(zhuǎn)錄本的水平;(4)通過免疫學技術或測定生物學活性,來檢測基因表達的蛋白產(chǎn)物。上述方法可單用,也可多種方法聯(lián)合應用。
在第(1)種方法中,雜交所用的探針是與本發(fā)明的多核苷酸的任何一部分同源,其長度至少15個核苷酸,較好是至少30個核苷酸,更好是至少50個核苷酸,最好是至少100個核苷酸。此外,探針的長度通常在2kb之內(nèi),較佳地為1kb之內(nèi)。此處所用的探針通常是在本發(fā)明的基因DNA序列信息的基礎上化學合成的DNA序列。本發(fā)明的基因本身或者片段當然可以用作探針。DNA探針的標記可用放射性同位素,熒光素或酶(如堿性磷酸酶)等。
在第(4)種方法中,檢測具有抑癌功能的蛋白基因表達的蛋白產(chǎn)物可用免疫學技術如Western印跡法,放射免疫沉淀法,酶聯(lián)免疫吸附法(ELISA)等。
應用PCR技術擴增DNA/RNA的方法(Saiki,et al.Science 1985;230:1350-1354)被優(yōu)選用于獲得本發(fā)明的基因。特別是很難從文庫中得到全長的cDNA時,可優(yōu)選使用RACE法(RACE-cDNA末端快速擴增法),用于PCR的引物可根據(jù)本文所公開的本發(fā)明的序列信息適當?shù)剡x擇,并可用常規(guī)方法合成??捎贸R?guī)方法如通過凝膠電泳分離和純化擴增的DNA/RNA片段。
如上所述得到的本發(fā)明的基因,或者各種DNA片段等的核苷酸序列的測定可用常規(guī)方法如雙脫氧鏈終止法(Sanger et al.PNAS,1977,74:5463-5467)。這類核苷酸序列測定也可用商業(yè)測序試劑盒等。為了獲得全長的cDNA序列,測序需反復進行。有時需要測定多個克隆的cDNA序列,才能拼接成全長的cDNA序列。
本發(fā)明也涉及包含本發(fā)明的多核苷酸的載體,以及用本發(fā)明的載體或具有抑癌功能的蛋白編碼序列經(jīng)基因工程產(chǎn)生的宿主細胞,以及經(jīng)重組技術產(chǎn)生本發(fā)明所述多肽的方法。
通過常規(guī)的重組DNA技術,可利用本發(fā)明的多聚核苷酸序列可用來表達或生產(chǎn)重組的具有抑癌功能的蛋白多肽(Science,1984;224:1431)。一般來說有以下步驟
(1).用本發(fā)明的編碼具有抑癌功能的人蛋白的多核苷酸(或變異體),或用含有該多核苷酸的重組表達載體轉(zhuǎn)化或轉(zhuǎn)導合適的宿主細胞;(2).在合適的培養(yǎng)基中培養(yǎng)的宿主細胞;(3).從培養(yǎng)基或細胞中分離、純化蛋白質(zhì)。
本發(fā)明中,具有抑癌功能的人蛋白多核苷酸序列可插入到重組表達載體中。術語“重組表達載體”指本領域熟知的細菌質(zhì)粒、噬菌體、酵母質(zhì)粒、植物細胞病毒、哺乳動物細胞病毒如腺病毒、逆轉(zhuǎn)錄病毒或其他載體。在本發(fā)明中適用的載體包括但不限于在細菌中表達的基于T7的表達載體(Rosenberg,et al.Gene,1987,56:125);在哺乳動物細胞中表達的pMSXND表達載體(Lee and Nathans,J Bio Chem.263:3521,1988)和在昆蟲細胞中表達的來源于桿狀病毒的載體??傊灰茉谒拗黧w內(nèi)復制和穩(wěn)定,任何質(zhì)粒和載體都可以用。表達載體的一個重要特征是通常含有復制起點、啟動子、標記基因和翻譯控制元件。
本領域的技術人員熟知的方法能用于構建含具有抑癌功能的人蛋白編碼DNA序列和合適的轉(zhuǎn)錄/翻譯控制信號的表達載體。這些方法包括體外重組DNA技術、DNA合成技術、體內(nèi)重組技術等(Sambroook,et al.Molecular Cloning,a Laboratory Manual,coldSpring Harbor Laboratory.New York,1989)。所述的DNA序列可有效連接到表達載體中的適當啟動子上,以指導mRNA合成。這些啟動子的代表性例子有大腸桿菌的lac或trp啟動子;入噬菌體PL啟動子;真核啟動子包括CMV立即早期啟動子、HSV胸苷激酶啟動子、早期和晚期SV40啟動子、反轉(zhuǎn)錄病毒的LTRs和其他一些已知的可控制基因在原核或真核細胞或其病毒中表達的啟動子。表達載體還包括翻譯起始用的核糖體結(jié)合位點和轉(zhuǎn)錄終止子。
此外,表達載體優(yōu)選地包含一個或多個選擇性標記基因,以提供用于選擇轉(zhuǎn)化的宿主細胞的表型性狀,如真核細胞培養(yǎng)用的二氫葉酸還原酶、新霉素抗性以及綠色熒光蛋白(GFP),或用于大腸桿菌的四環(huán)素或氨芐青霉素抗性。
包含上述的適當DNA序列以及適當啟動子或者控制序列的載體,可以用于轉(zhuǎn)化適當?shù)乃拗骷毎?,以使其能夠表達蛋白質(zhì)。
宿主細胞可以是原核細胞,如細菌細胞;或是低等真核細胞,如酵母細胞;或是高等真核細胞,如哺乳動物細胞。代表性例子有大腸桿菌,鏈霉菌屬;鼠傷寒沙門氏菌的細菌細胞;真菌細胞如酵母;植物細胞;果蠅S2或Sf9的昆蟲細胞;CHO、COS或Bowes黑素瘤細胞的動物細胞等。
本發(fā)明的多核苷酸在高等真核細胞中表達時,如果在載體中插入增強子序列時將會使轉(zhuǎn)錄得到增強。增強子是DNA的順式作用因子,通常大約有10到300個堿基對,作用于啟動子以增強基因的轉(zhuǎn)錄??膳e的例子包括在復制起始點晚期一側(cè)的100到270個堿基對的SV40增強子、在復制起始點晚期一側(cè)的多瘤增強子以及腺病毒增強子等。
本領域一般技術人員都清楚如何選擇適當?shù)妮d體、啟動子、增強子和宿主細胞。
用重組DNA轉(zhuǎn)化宿主細胞可用本領域技術人員熟知的常規(guī)技術進行。當宿主為原核生物如大腸桿菌時,能吸收DNA的感受態(tài)細胞可在指數(shù)生長期后收獲,用CaCl2法處理,所用的步驟在本領域眾所周知??晒┻x擇的是用MgCl2。如果需要,轉(zhuǎn)化也可用電穿孔的方法進行。當宿主是真核生物,可選用如下的DNA轉(zhuǎn)染方法磷酸鈣共沉淀法,常規(guī)機械方法如顯微注射、電穿孔、脂質(zhì)體包裝等。
獲得的轉(zhuǎn)化子可以用常規(guī)方法培養(yǎng),表達本發(fā)明的基因所編碼的多肽。根據(jù)所用的宿主細胞,培養(yǎng)中所用的培養(yǎng)基可選自各種常規(guī)培養(yǎng)基。在適于宿主細胞生長的條件下進行培養(yǎng)。當宿主細胞生長到適當?shù)募毎芏群?,用合適的方法(如溫度轉(zhuǎn)換或化學誘導)誘導選擇的啟動子,將細胞再培養(yǎng)一段時間。
在上面的方法中的重組多肽可包被于細胞內(nèi)、細胞外或在細胞膜上表達或分泌到細胞外。如果需要,可利用其物理的、化學的和其它特性通過各種分離方法分離和純化重組的蛋白。這些方法是本領域技術人員所熟知的。這些方法的例子包括但并不限于常規(guī)的復性處理、用蛋白沉淀劑處理(鹽析方法)、離心、滲透破菌、超處理、超離心、分子篩層析(凝膠過濾)、吸附層析、離子交換層析、高效液相層析(HPLC)和其它各種液相層析技術及這些方法的結(jié)合。
重組的具有抑癌功能的人蛋白或多肽有多方面的用途。這些用途包括(但不限于)直接做為藥物治療具有抑癌功能的蛋白功能低下或喪失所致的疾病,和用于篩選促進或?qū)咕哂幸职┕δ艿牡鞍坠δ艿目贵w、多肽或其它配體。例如,抗體可用于激活或抑制具有抑癌功能的人蛋白的功能。用表達的重組具有抑癌功能的人蛋白篩選多肽庫可用于尋找有治療價值的能抑制或刺激具有抑癌功能的人蛋白功能的多肽分子。
本發(fā)明也提供了篩選藥物以鑒定提高(激動劑)或阻遏(拮抗劑)具有抑癌功能的人蛋白的藥劑的方法。激動劑提高具有抑癌功能的人蛋白刺激細胞增殖等生物功能,而拮抗劑阻止和治療與細胞過度增殖有關的紊亂如各種癌癥。例如,能在藥物的存在下,將哺乳動物細胞或表達具有抑癌功能的人蛋白的膜制劑與標記的具有抑癌功能的人蛋白一起培養(yǎng)。然后測定藥物提高或阻遏此相互作用的能力。
具有抑癌功能的人蛋白的拮抗劑包括篩選出的抗體、化合物、受體缺失物和類似物等。具有抑癌功能的人蛋白的拮抗劑可以與具有抑癌功能的人蛋白結(jié)合并消除其功能,或是抑制具有抑癌功能的人蛋白的產(chǎn)生,或是與多肽的活性位點結(jié)合使多肽不能發(fā)揮生物學功能。具有抑癌功能的人蛋白的拮抗劑可用于治療用途。
在篩選作為拮抗劑的化合物時,可以將具有抑癌功能的蛋白加入生物分析測定中,通過測定化合物影響具有抑癌功能的蛋白和其受體之間的相互作用來確定化合物是否是拮抗劑。用上述篩選化合物的同樣方法,可以篩選出起拮抗劑作用的受體缺失物和類似物。
本發(fā)明的多肽可直接用于疾病治療,例如,各種惡性腫瘤、和細胞異常增殖等。
本發(fā)明的多肽,及其片段、衍生物、類似物或它們的細胞可以用來作為抗原以生產(chǎn)抗體。這些抗體可以是多克隆或單克隆抗體。多克隆抗體可以通過將此多肽直接注射動物的方法得到。制備單克隆抗體的技術包括雜交瘤技術,三瘤技術,人B-細胞雜交瘤技術,EBV-雜交瘤技術等。
可以將本發(fā)明的多肽和拮抗劑與合適的藥物載體組合后使用。這些載體可以是水、葡萄糖、乙醇、鹽類、緩沖液、甘油以及它們的組合。組合物包含安全有效量的多肽或拮抗劑以及不影響藥物效果的載體和賦形劑。這些組合物可以作為藥物用于疾病治療。
本發(fā)明還提供含有一種或多種容器的藥盒或試劑盒,容器中裝有一種或多種本發(fā)明的藥用組合物成分。與這些容器一起,可以有由制造、使用或銷售藥品或生物制品的政府管理機構所給出的指示性提示,該提示反映出生產(chǎn)、使用或銷售的政府管理機構許可其在人體上施用。此外,本發(fā)明的多肽可以與其它的治療化合物結(jié)合使用。
藥物組合物可以以方便的方式給藥,如通過局部、靜脈內(nèi)、腹膜內(nèi)、肌內(nèi)、皮下、鼻內(nèi)或皮內(nèi)的給藥途徑。具有抑癌功能的蛋白以有效地治療和/或預防具體的適應癥的量來給藥。施用于患者的具有抑癌功能的蛋白的量和劑量范圍將取決于許多因素,如給藥方式、待治療者的健康條件和診斷醫(yī)生的判斷。
具有抑癌功能的人蛋白的多聚核苷酸也可用于多種治療目的?;蛑委熂夹g可用于治療由于具有抑癌功能的蛋白的無表達或異常/無活性的具有抑癌功能的蛋白的表達所致的細胞增殖、發(fā)育或代謝異常。重組的基因治療載體(如病毒載體)可設計成表達變異的具有抑癌功能的蛋白,以抑制內(nèi)源性的具有抑癌功能的蛋白活性。例如,一種變異的具有抑癌功能的蛋白可以是縮短的、缺失了信號傳導功能域的具有抑癌功能的蛋白,雖可與下游的底物結(jié)合,但缺乏信號傳導活性。因此重組的基因治療載體可用于治療具有抑癌功能的蛋白表達或活性異常所致的疾病。來源于病毒的表達載體如逆轉(zhuǎn)錄病毒、腺病毒、腺病毒相關病毒、單純皰疹病毒、細小病毒等可用于將具有抑癌功能的蛋白基因轉(zhuǎn)移至細胞內(nèi)。構建攜帶具有抑癌功能的蛋白基因的重組病毒載體的方法可見于已有文獻(Sambrook,et al.)。另外重組具有抑癌功能的人蛋白基因可包裝到脂質(zhì)體中轉(zhuǎn)移至細胞內(nèi)。
抑制具有抑癌功能的人蛋白mRNA的寡聚核苷酸(包括反義RNA和DNA)以及核酶也在本發(fā)明的范圍之內(nèi)。核酶是一種能特異性分解特定RNA的酶樣RNA分子,其作用機制是核酶分子與互補的靶RNA特異性雜交后進行核酸內(nèi)切作用。反義的RNA和DNA及核酶可用已有的任何RNA或DNA合成技術獲得,如固相磷酸酰胺化學合成法合成寡核苷酸的技術已廣泛應用。反義RNA分子可通過編碼該RNA的DNA序列在體外或體內(nèi)轉(zhuǎn)錄獲得。這種DNA序列已整合到載體的RNA聚合酶啟動子的下游。為了增加核酸分子的穩(wěn)定性,可用多種方法對其進行修飾,如增加兩側(cè)的序列長度,核糖核苷之間的連接應用磷酸硫酯鍵或肽鍵而非磷酸二酯鍵。
多聚核苷酸導入組織或細胞內(nèi)的方法包括將多聚核苷酸直接注入到體內(nèi)組織中;或在體外通過載體(如病毒、噬菌體或質(zhì)粒等)先將多聚核苷酸導入細胞中,再將細胞移植到體內(nèi)等。
本發(fā)明的多肽還可用作肽譜分析,例如,多肽可用物理的、化學或酶進行特異性切割,并進行一維或二維或三維的凝膠電泳分析。
本發(fā)明還提供了針對具有抑癌功能的人蛋白抗原決定簇的抗體。這些抗體包括(但不限于)多克隆抗體、單克隆抗體、嵌合抗體、單鏈抗體、Fab片段和Fab表達文庫產(chǎn)生的片段。
抗具有抑癌功能的人蛋白的抗體可用于免疫組織化學技術中,檢測活檢標本中的具有抑癌功能的人蛋白。
與具有抑癌功能的人蛋白結(jié)合的單克隆抗體也可用放射性同位素標記,注入體內(nèi)可跟蹤其位置和分布。這種放射性標記的抗體可作為一種非創(chuàng)傷性診斷方法用于腫瘤細胞的定位和判斷是否有轉(zhuǎn)移。
本發(fā)明中的抗體可用于治療或預防與具有抑癌功能的人蛋白相關的疾病。給予適當劑量的抗體可以刺激或阻斷具有抑癌功能的人蛋白的產(chǎn)生或活性。
抗體也可用于設計針對體內(nèi)某一特殊部位的免疫毒素。如具有抑癌功能的人蛋白高親和性的單克隆抗體可與細菌或植物毒素(如白喉毒素,蓖麻蛋白,紅豆堿等)共價結(jié)合。一種通常的方法是用巰基交聯(lián)劑如SPDP,攻擊抗體的氨基,通過二硫鍵的交換,將毒素結(jié)合于抗體上,這種雜交抗體可用于殺滅具有抑癌功能的人蛋白陽性的細胞。
多克隆抗體的生產(chǎn)可用具有抑癌功能的人蛋白或多肽免疫動物,如家兔,小鼠,大鼠等。多種佐劑可用于增強免疫反應,包括但不限于弗氏佐劑等。
具有抑癌功能的人蛋白單克隆抗體可用雜交瘤技術生產(chǎn)(Kohler and Milstein.Nature,1975,256:495-497)。將人恒定區(qū)和非人源的可變區(qū)結(jié)合的嵌合抗體可用已有的技術生產(chǎn)(Morrison et al,PNAS,1985,81:6851)。而已有的生產(chǎn)單鏈抗體的技術(U.S.PatNo.4946778)也可用于生產(chǎn)抗具有抑癌功能的人蛋白的單鏈抗體。
能與具有抑癌功能的人蛋白結(jié)合的多肽分子可通過篩選由各種可能組合的氨基酸結(jié)合于固相物組成的隨機多肽庫而獲得。篩選時,必須對具有抑癌功能的人蛋白分子進行標記。
本發(fā)明還涉及定量和定位檢測具有抑癌功能的人蛋白水平的診斷試驗方法。這些試驗是本領域所熟知的,且包括FISH測定和放射免疫測定。試驗中所檢測的具有抑癌功能的人蛋白水平,可以用作解釋具有抑癌功能的人蛋白在各種疾病中的重要性和用于診斷具有抑癌功能的蛋白起作用的疾病。
具有抑癌功能的蛋白的多聚核苷酸可用于具有抑癌功能的蛋白相關疾病的診斷和治療。在診斷方面,具有抑癌功能的蛋白的多聚核苷酸可用于檢測具有抑癌功能的蛋白的表達與否或在疾病狀態(tài)下具有抑癌功能的蛋白的異常表達。如具有抑癌功能的蛋白DNA序列可用于對活檢標本的雜交以判斷具有抑癌功能的蛋白的表達異常。雜交技術包括Southern印跡法,Northern印跡法、原位雜交等。這些技術方法都是公開的成熟技術,相關的試劑盒都可從商業(yè)途徑得到。本發(fā)明的多核苷酸的一部分或全部可作為探針固定在微陣列(Microarray)或DNA芯片(又稱為“基因芯片”)上,用于分析組織中基因的差異表達分析和基因診斷。用具有抑癌功能的蛋白特異的引物進行RNA-聚合酶鏈反應(RT-PCR)體外擴增也可檢測具有抑癌功能的蛋白的轉(zhuǎn)錄產(chǎn)物。
檢測具有抑癌功能的蛋白基因的突變也可用于診斷具有抑癌功能的蛋白相關的疾病。具有抑癌功能的蛋白突變的形式包括與正常野生型具有抑癌功能的蛋白DNA序列相比的點突變、易位、缺失、重組和其它任何異常等??捎靡延械募夹g如Southern印跡法、DNA序列分析、PCR和原位雜交檢測突變。另外,突變有可能影響蛋白的表達,因此用Northern印跡法、Western印跡法可間接判斷基因有無突變。
本發(fā)明的序列對染色體鑒定也是有價值的。該序列會特異性地針對某條人染色體具體位置且并可以與其雜交。目前,需要鑒定染色體上的各基因的具體位點。現(xiàn)在,只有很少的基于實際序列數(shù)據(jù)(重復多態(tài)性)的染色體標記物可用于標記染色體位置。根據(jù)本發(fā)明,為了將這些序列與疾病相關基因相關聯(lián),其重要的第一步就是將這些DNA序列定位于染色體上。
簡而言之,根據(jù)cDNA制備PCR引物(優(yōu)選15-35bp),可以將序列定位于染色體上。然后,將這些引物用于PCR篩選含各條人染色體的體細胞雜合細胞。只有那些含有相應于引物的人基因的雜合細胞會產(chǎn)生擴增的片段。
體細胞雜合細胞的PCR定位法,是將DNA定位到具體染色體的快捷方法。使用本發(fā)明的的寡核苷酸引物,通過類似方法,可利用一組來自特定染色體的片段或大量基因組克隆而實現(xiàn)亞定位??捎糜谌旧w定位的其它類似策略包括原位雜交、用標記的流式分選的染色體預篩選和雜交預選,從而構建染色體特異的cDNA庫。
將cDNA克隆與中期染色體進行熒光原位雜交(FISH),可以在一個步驟中精確地進行染色體定位。此技術的綜述,參見Verma等,Human Chromosomes:a Manual of BasicTechniques,Pergamon Press,New York(1988)。
一旦序列被定位到準確的染色體位置,此序列在染色體上的物理位置就可以與基因圖數(shù)據(jù)相關聯(lián)。這些數(shù)據(jù)可見于例如,V.Mckusick,Mendelian Inheritance in Man(可通過與Johns Hopkins University Welch Medical Library聯(lián)機獲得)。然后可通過連鎖分析,確定基因與業(yè)已定位到染色體區(qū)域上的疾病之間的關系。
接著,需要測定患病和未患病個體間的cDNA或基因組序列差異。如果在一些或所有的患病個體中觀察到某突變,而該突變在任何正常個體中未觀察到,則該突變可能是疾病的病因。比較患病和未患病個體,通常涉及首先尋找染色體中結(jié)構的變化,如從染色體水平可見的或用基于cDNA序列的PCR可檢測的缺失或易位。根據(jù)目前的物理作圖和基因定位技術的分辨能力,被精確定位至與疾病有關的染色體區(qū)域的cDNA,可以是50至500個潛在致病基因間之一種(假定1兆堿基作圖分辨能力和每20kb對應于一個基因)。
本發(fā)明的具有抑癌功能的蛋白核苷酸全長序列或其片段通??梢杂肞CR擴增法、重組法或人工合成的方法獲得。對于PCR擴增法,可根據(jù)本發(fā)明所公開的有關核苷酸序列,尤其是開放閱讀框序列來設計引物,并用市售的cDNA庫或按本領域技術人員已知的常規(guī)方法所制備的cDNA庫作為模板,擴增而得有關序列。當序列較長時,常常需要進行兩次或多次PCR擴增,然后再將各次擴增出的片段按正確次序拼接在一起。
一旦獲得了有關的序列,就可以用重組法來大批量地獲得有關序列。這通常是將其克隆入載體,再轉(zhuǎn)入細胞,然后通過常規(guī)方法從增殖后的宿主細胞中分離得到有關序列。
此外,還可用人工合成的方法來合成有關序列,尤其是片段長度較短時。通常,通過先合成多個小片段,然后再進行連接可獲得序列很長的片段。
目前,已經(jīng)可以完全通過化學合成來編碼本發(fā)明蛋白(或其片段,或其衍生物)的DNA序列。然后可將該DNA序列引入本領域中的各種DNA分子(如載體)和細胞中。此外,還可通過化學合成將突變引入本發(fā)明蛋白序列中。
此外,由于本發(fā)明的具有抑癌功能的蛋白具有源自人的天然氨基酸序列,因此,與來源于其他物種的同族蛋白相比,預計在施用于人時將具有更高的活性和/或更低的副作用(例如在人體內(nèi)的免疫原性更低或沒有)。
下面結(jié)合具體實施例,進一步闡述本發(fā)明。應理解,這些實施例僅用于說明本發(fā)明而不用于限制本發(fā)明的范圍。下列實施例中未注明具體條件的實驗方法,通常按照常規(guī)條件如Sambrook等人,分子克隆實驗室手冊(New York:Cold Spring Harbor LaboratoryPress,1989)中所述的條件,或按照制造廠商所建議的條件。
實施例1cDNA基因的獲得及對癌細胞克隆形成的抑制作用SP1224來自于從GIBCO BRL公司購得的肝cDNA文庫(cat,No.10422-012),PP265,PP384,PP432,PP552,PP591,PP603,PP632,PP844,PP928,PP1200,PP1226,PP1292,PP1396,PP1563和PP1746是通過用常規(guī)方法構建人胎盤cDNA文庫獲得的。取3、6、10月齡的胎盤組織,用Trizol試劑(GIBCO BRL公司)按廠方說明書提取總RNA,用mRNA提純試劑盒(pharmacia公司)提取mRNA。用pCMV-script TMXR cDNA文庫構建試劑盒(Stratagene公司)構建上述mRNA的cDNA文庫。其中反轉(zhuǎn)錄酶改用MMLV-RT-SuperscriptⅡ(GIBCO BRL),反轉(zhuǎn)錄反應在42℃進行。轉(zhuǎn)化XL10-Gold感受細胞,獲得了1×106cfu/μg cDNA滴度的cDNA文庫。第一輪隨機挑取cDNA克隆,其后以高豐度cDNA克隆和已證明有抑癌細胞生長功能的cDNA克隆為探針,雜交篩選cDNA文庫,挑取弱陽性及陰性克隆。用Qiagen 96孔板質(zhì)粒抽提試劑盒,按廠家說明書進行質(zhì)粒DNA的提取。質(zhì)粒DNA和空載體同時轉(zhuǎn)染肝癌細胞系7721。100ngDNA酒精沉淀干燥后,加6μl H2O溶解,待轉(zhuǎn)染。每份DNA樣品中加0.74μl脂質(zhì)體及9.3μl無血清培液,混勻后,室溫放置10分鐘。每管中加150μl無血清培液,均分加入3孔生長于96孔板的7721細胞中,37℃放置2小時,每孔再加50μl無血清培液,37℃24小時。每孔換100μl全培液,37℃24小時,換含G418的全培液100μl,37℃24~48小時,邊觀察,邊換G418濃度不等的培液。約2~3次后,直到鏡檢細胞有克隆形成,計數(shù)。發(fā)現(xiàn)上述克隆有抑制細胞克隆形成作用,結(jié)果如下表所示。
cDNA克隆轉(zhuǎn)染細胞(7721)克隆形成情況
對cDNA克隆采用雙脫氧終止法,在ABI377 DNA自動測序儀上測定其一端近500bp的核苷酸序列。分析后,確定為新基因克隆,進行另一端測序,仍未獲得全長cDNA序列,設計引物,再次進行測序,直到獲得全長序列(SEQ ID NO:1、4、7、10、13、16、19、22、25、28、31、34、37、40、43、46)。
實施例2從胎盤cDNA中PCR獲得基因克隆取3、6、10月齡的胎盤組織,用Trizol試劑(GIBCO BRL公司)按廠方說明書提取總RNA,用mRNA提純試劑盒(pharmacia公司)提取mRNA。用MMLV-RT-SuperscriptⅡ(GIBCO BRL)反轉(zhuǎn)錄酶在42℃進行反轉(zhuǎn)錄反應,獲得胎盤cDNA。利用各個基因的轉(zhuǎn)異引物(如下表所示),按90℃3分鐘1個循環(huán)94℃30秒,60℃30秒,72℃1分鐘,共35個循環(huán);72℃10分鐘,1個循環(huán)進行PCR擴增,獲得含有完整開放閱讀框序列的各蛋白基因的擴增產(chǎn)物。擴增產(chǎn)物經(jīng)測序驗證,與實施例1測得的序列相符,隨后用常規(guī)技術將擴增產(chǎn)物轉(zhuǎn)入宿主細胞,以獲得重組蛋白。
基因特異引物
實施例3cDNA克隆序列分析1.SP1224A核苷酸序列(SEQ ID NO:1)長度2492bp1 CCCACGCGTC CGGGAATCCA GTCCGGGGGC CGAGCTGGCT GCGCCCTCCG51 CCAAGCGCCG GCAGCGCGGG GCGAGCTCCG GACGGCGCGC GGCCCAGGCA101 GCGGCTCCCG CTCGGCCCGC CCTCCGAGCC GCAGGGGCCG CCACCGCCGC151 GGCGCCTCCC CTGGCGACCG CGCCCCCGGG CCCCGGCTCC GGCCCGGGAC201 GGAGGAGCCG GCGCTCGACA CAGAGAGCTC TTCAGAAACC AGGCTGCTTT251 CAGGAACATT GCTGTGGATT CCCAGGGCCT ATTCCACTAG AAGCAAGATG301 GCTGAACTCA ATACTCATGT GAATGTCAAG GAAAAGATCT ATGCAGTTAG351 ATCAGTTGTT CCCAACAAAA GCAATAATGA AATAGTCCTG GTGCTCCAAC401 AGTTTGATTT TAATGTGGAT AAAGCCGTGC AAGCCTTTGT GGATGGCAGT451 GCAATTCAAG TTCTAAAAGA ATGGAATATG ACAGGAAAAA AGAAGAACAA501 TAAAAGAAAA AGAAGCAAGT CCAAGCAGCA TCAAGGCAAC AAAGATGCTA551 AAGACAAGGT GGAGAGGCCT TGAGGCAGGG CCCCTGCAGC CGCAGCCACC601 ACAGATTCAA AACGGCCCCA TGAATGGCTG CGAGAAGGAC AGCTCGTCCA651 CAGATTCTGC TAACGAAAAA CCAGCCCTTA TCCCTCGTGA GAAAAAGATC701 TCGATACTTG AGGAACCTTC AAAGGCACTT CGTGGGGTCA CAGAAGGCAA751 CAGACTACTG CAACAGAAAC TATCCTTAGA TGGGAACCCC AAACCTATAC801 ATGGAACAAC AGAGAGGTCA GATGGCCTAC AGTGGTCAGC TGAGCAGCCT851 TGTAACCCAA GCAAGCCTAA GGCAAAAACA TCTCCTGTTA AGTCCAATAC901 CCCTGCAGCT CATCTTGAAA TAAAGCCAGA TGAGTTGGCA AAGAAAAGAG951 GCCCAAATAT TGAGAAATCA GTGAAGGATT TGCAACGCTG CACCGTTTCT1001 CTAACTAGAT ATCGCGTCAT GATTAAGGAA GAAGTGGATA GTTCCGTGAA1051 GAAGATCAAA GCTGCCTTTG CTGAATTACA CAACTGCATC ATTGACAAAG1101 AAGTTTCATT AATGGCAGAA ATGGATAAAG TTAAAGAAGA AGCCATGGAA1151 ATCCTGACTG CTCGTCAGAA GAAAGCAGAA GAACTAAAGA GACTCACTGA1201 CCTTGCCAGT CAGATGGCAG AGATGCAGCT GGCCGAACTC AGGGCAGAAA1251 TTAAGCACTT TGTCAGCGAG CGTAAATATG ACGAGGAGCT CGGGAAAGCT1301 GCCCGGTTTT CCTGTGACAT CGAACAGCTG AAGGCCCAAA TCATGCTCTG1351 CGGAGAAATT ACACATCCAA AGAACAACTA TTCCTCAAGA ACTCCCTGCA1401 GCTCCCTGCT GCCTCTGCTG AATGCGCACG CAGCAACCTC TGGGAAACAG1451 AGTAACTTTT CCCGAAAATC ATCCACTCAC AATAAGCCCT CTGAAGGCAA1501 AGCGGCAAAC CCCAAAATGG TGAGCAGTCT CCCCAGCACC GCCGACCCCT1551 CTCACCAGAC CATGCCGGCC AACAAGCAGA ATGGATCTTC TAACCAAAGA1601 CGGAGATTTA ATCCACAGTA TCATAACAAC AGGCTAAATG GGCCTGCCAA1651 GTCGCAGGGC AGTGGGAATG AAGCCGAGCC ACTGGGAAAG GGCAACAGCC1701 GCCACGAACA CAGAAGACAG CCGCACAACG GCTTCCGGCC CAAAAACAAA1751 GGCGGTGCCA AAAATCAAGA GGCTTCCTTG GGGATGAAGA CCCCCGAGGC1801 CCCGGCCCAT TCTGAAAAGC CCCGGCGAAG GCAGCACGCT GCAGACACCT1851 CGGAGGCCAG GCCCTTCCGG GGTAGTGTCG GTAGGGTTTC ACAGTGCAAT1901 CTCTGCCCCA CGAGAATAGA AGTTTCCACA GATGCAGCAG TTCTCTCAGT1951 CCCGGCTGTG ACGTTGGTGG CCTGAGCTAG GAGGAAAAAG AGCAGTTTTC2001 ACTCAGTTTT GGTTCCCTGC CCGAGGTGCT GACCCAATTC GCTGCCAAAA2051 GAGTGTCAAT CAGAATATAC AAATCCCGTA TGGTTGTGTC ATCCTCTCTT2101 AATCATTTTT ACTAATTCTA ATAATCAGCT CTAGCTTGCT TCATAATTTT2151 CATGGCTTTG CTTGATCTGT TGATGCTTTC TCTCATCAAG ACTTTGCAGC2201 ATTTTAGCCA GGCAGTATTT ACTCATTATT AGGAAAATCA AGATGTGGCT2251 GAAGATCAGA GGCTCAGTTA GCAACCTGTG TTGTAGCAGT GATGTCAGTC2301 CATTGATTGT CTTTAGAGAG TTAATGTTAC AAAAAAGAAT TCTTAATAAT2351 CAGACAAACA TGATCTGCTG AGGACACATG CGCTTTTGTA GAATTTAACA2401 TCTGGTGTTT TTCTGAAAAA ATATATATAC ATATATTGCT TTATTTGAAA2451 CAAATTAAAA TATGCTGCAT TTGAAAAAAA AAAAAAAAAA AAB氨基酸序列(SFQ ID NO:2)長度476個氨基酸1 MLKTRWRGLE AGPLQPQPPQ IQNGPMNGCE KDSSSTDSAN EKPALIPREK51 KISILEEPSK ALRGVTEGNR LLQQKLSLDG NPKPIHGTTE RSDGLQWSAE101 QPCNPSKPKA KTSPVKSNTP AAHLEIKPDE LAKKRGPNIE KSVKDLQRCT151 VSLTRYRVMI KEEVDSSVKK IKAAFAELHN CIIDKEVSLM AEMDKVKEEA201 MEILTARQKK AEELKRLTDL ASQMAEMQLA ELRAEIKHFV SERKYDEELG251 KAARFSCDIE QLKAQIMLCG EITHPKNNYS SRTPCSSLLP LLNAHAATSG301 KQSNFSRKSS THNKPSEGKA ANPKMVSSLP STADPSHQTM PANKQNGSSN351 QRRRFNPQYH NNRLNGPAKS QGSGNEAEPL GKGNSRHEHR RQPHNGFRPK401 NKGGAKNQEA SLGMKTPEAP AHSEKPRRRQ HAADTSEARP FRGSVGRVSQ451 CNLCPTRIEV STDAAVLSVP AVTLVAC核苷酸及氨基酸組合序列(SEQ ID NO:3)克隆號和蛋白名稱SP1224起始編碼子545 ATG終止編碼子1975 TGA蛋白質(zhì)分子量52451.741C CCA CGC GTC CGG GAA TCC AGT CCG GGG GCC GAG CTG GCT GCG CCC 4647 TCC GCC AAG CGC CGG CAG CGC GGG GCG AGC TCC GGA CGG CGC GCG GCC 9495 CAG GCA GCG GCT CCC GCT CGG CCC GCC CTC CGA GCC GCA GGG GCC GCC 142143 ACC GCC GCG GCG CCT CCC CTG GCG ACC GCG CCC CCG GGC CCC GGC TCC 190191 GGC CCG GGA CGG AGG AGC CGG CGC TCG ACA CAG AGA GCT CTT CAG AAA 238239 CCA GGC TGC TTT CAG GAA CAT TGC TGT GGA TTC CCA GGG CCT ATT CCA 286287 CTA GAA GCA AGA TGG CTG AAC TCA ATA CTC ATG TGA ATG TCA AGG AAA 334335 AGA TCT ATG CAG TTA GAT CAG TTG TTC GCA ACA AAA GCA ATA ATG AAA 382383 TAG TCC TGG TGC TCC AAC AGT TTG ATT TTA ATG TGG ATA AAG CCG TGC 430431 AAG CCT TTG TGG ATG GCA GTG CAA TTC AAG TTC TAA AAG AAT GGA ATA 478479 TGA CAG GAA AAA AGA AGA ACA ATA AAA GAA AAA GAA GCA AGT CCA AGC 526527 AGC ATC AAG GCA ACA AAG ATG CTA AAG ACA AGG TGG AGA GGC CTT GAG 5741 Met Leu Lys Thr Arg Trp Arg Gly Leu Glu 10575 GCA GGG CCC CTG CAG CCG CAG CCA CCA CAG ATT CAA AAC GGC CCC ATG 62211 Ala Gly Pro Leu Gln Pro Gln Pro Pro Gln Ile Gln Asn Gly Pro Met 26623 AAT GGC TGC GAG AAG GAC AGC TCG TCC ACA GAT TCT GCT AAC GAA AAA 67027 Asn Gly Cys Glu Lys Asp Ser Ser Ser Thr Asp Ser Ala Asn Glu Lys 42671 CCA GCC CTT ATC CCT CGT GAG AAA AAG ATC TCG ATA CTT GAG GAA CCT 71843 Pro Ala Leu Ile Pro Arg Glu Lys Lys Ile Ser Ile Leu Glu Glu Pro 58719 TCA AAG GCA CTT CGT GGG GTC ACA GAA GGC AAC AGA CTA CTG CAA CAG 76659 Ser Lys Ala Leu Arg Gly Val Thr Glu Gly Asn Arg Leu Leu Gln Gln 74767 AAA CTA TCC TTA GAT GGG AAC CCC AAA CCT ATA CAT GGA ACA ACA GAG 81475 Lys Leu Ser Leu Asp Gly Asn Pro Lys Pro Ile His Gly Thr Thr Glu 90815 AGG TCA GAT GGC CTA CAG TGG TCA GCT GAG CAG CCT TGT AAC CCA AGC 86291 Arg Ser Asp Gly Leu Gln Trp Ser Ala Glu Gln Pro Cys Asn Pro Ser 106863 AAG CCT AAG GCA AAA ACA TCT CCT GTT AAG TCC AAT ACC CCT GCA GCT 910107 Lys Pro Lys Ala Lys Thr Ser Pro Val Lys Ser Asn Thr Pro Ala Ala 122911 CAT CTT GAA ATA AAG CCA GAT GAG TTG GCA AAG AAA AGA GGC CCA AAT 958123 His Leu Glu Ile Lys Pro Asp Glu Leu Ala Lys Lys Arg Gly Pro Asn 138959 ATT GAG AAA TCA GTG AAG GAT TTG CAA CGC TGC ACC GTT TCT CTA ACT1006139 Ile Glu Lys Ser Val Lys Asp Leu Gln Arg Cys Thr Val Ser Leu Thr 1541007 AGA TAT CGC GTC ATG ATT AAG GAA GAA GTG GAT AGT TCC GTG AAG AAG1054155 Arg Tyr Arg Val Met Ile Lys Glu Glu Val Asp Ser Ser Val Lys Lys 1701055 ATC AAA GCT GCC TTT GCT GAA TTA CAC AAC TGC ATC ATT GAC AAA GAA1102171 Ile Lys Ala Ala Phe Ala Glu Leu His Asn Cys Ile Ile Asp Lys Glu 1861103 GTT TCA TTA ATG GCA GAA ATG GAT AAA GTT AAA GAA GAA GCC ATG GAA1150187 Val Ser Leu Met Ala Glu Met Asp Lys Val Lys Glu Glu Ala Met Glu 2021151 ATC CTG ACT GCT CGT CAG AAG AAA GCA GAA GAA CTA AAG AGA CTC ACT1198203 Ile Leu Thr Ala Arg Gln Lys Lys Ala Glu Glu Leu Lys Arg Leu Thr 2181199 GAC CTT GCC AGT CAG ATG GCA GAG ATG CAG CTG GCC GAA CTC AGG GCA1246219 Asp Leu Ala Ser Gln Met Ala Glu Met Gln Leu Ala Glu Leu Arg Ala 2341247 GAA ATT AAG CAC TTT GTC AGC GAG CGT AAA TAT GAC GAG GAG CTC GGG1294235 Glu Ile Lys His Phe Val Ser Glu Arg Lys Tyr Asp Glu Glu Leu Gly 2501295 AAA GCT GCC CGG TTT TCC TGT GAC ATC GAA CAG CTG AAG GCC CAA ATC1342251 Lys Ala Ala Arg Phe Ser Cys Asp Ile Glu Gln Leu Lys Ala Gln Ile 2661343 ATG CTC TGC GGA GAA ATT ACA CAT CCA AAG AAC AAC TAT TCC TCA AGA1390267 Met Leu Cys Gly Glu Ile Thr His Pro Lys Asn Asn Tyr Ser Ser Arg 2821391 ACT CCC TGC AGC TCC CTG CTG CCT CTG CTG AAT GCG CAC GCA GCA ACC1438283 Thr Pro Cys Ser Ser Leu Leu Pro Leu Leu Asn Ala His Ala Ala Thr 2981439 TCT GGG AAA CAG AGT AAC TTT TCC CGA AAA TCA TCC ACT CAC AAT AAG1486299 Ser Gly Lys Gln Ser Asn Phe Ser Arg Lys Ser Ser Thr His Asn Lys 3141487 CCC TCT GAA GGC AAA GCG GCA AAC CCC AAA ATG GTG AGC AGT CTC CCC1534315 Pro Ser Glu Gly Lys Ala Ala Asn Pro Lys Met Val Ser Ser Leu Pro 3301535 AGC ACC GCC GAC CCC TCT CAC CAG ACC ATG CCG GCC AAC AAG CAG AAT1582331 Ser Thr Ala Asp Pro Ser His Gln Thr Met Pro Ala Asn Lys Gln Asn 3461583 GGA TCT TCT AAC CAA AGA CGG AGA TTT AAT CCA CAG TAT CAT AAC AAC1630347 Gly Ser Ser Asn Gln Arg Arg Arg Phe Asn Pro Gln Tyr His Asn Asn 3621631 AGG CTA AAT GGG CCT GCC AAG TCG CAG GGC AGT GGG AAT GAA GCC GAG1678363 Arg Leu Asn Gly Pro Ala Lys Ser Gln Gly Ser Gly Asn Glu Ala Glu 3781679 CCA CTG GGA AAG GGC AAC AGC CGC CAC GAA CAC AGA AGA CAG CCG CAC1726379 Pro Leu Gly Lys Gly Asn Ser Arg His Glu His Arg Arg Gln Pro His 3941727 AAC GGC TTC CGG CCC AAA AAC AAA GGC GGT GCC AAA AAT CAA GAG GCT1774395 Asn Gly Phe Arg Pro Lys Asn Lys Gly Gly Ala Lys Asn Gln Glu Ala 4101775 TCC TTG GGG ATG AAG ACC CCC GAG GCC CCG GCC CAT TCT GAA AAG CCC1822411 Ser Leu Gly Met Lys Thr Pro Glu Ala Pro Ala His Ser Glu Lys Pro 4261823 CGG CGA AGG CAG CAC GCT GCA GAC ACC TCG GAG GCC AGG CCC TTC CGG1870427 Arg Arg Arg Gln His Ala Ala Asp Thr Ser Glu Ala Arg Pro Phe Arg 4421871 GGT AGT GTC GGT AGG GTT TCA CAG TGC AAT CTC TGC CCC ACG AGA ATA1918443 Gly Ser Val Gly Arg Val Ser Gln Cys Asn Leu Cys Pro Thr Arg Ile 4581919 GAA GTT TCC ACA GAT GCA GCA GTT CTC TCA GTC CCG GCT GTG ACG TTG1966459 Glu Val Ser Thr Asp Ala Ala Val Leu Ser Val Pro Ala Val Thr Leu 4741967 GTG GCC TGA GCT AGG AGG AAA AAG AGC AGT TTT CAC TCA GTT TTG GTT2014475 Val Ala *** 4772015 CCC TGC CCG AGG TGC TGA CCC AAT TCG CTG CCA AAA GAG TGT CAA TCA20622063 GAA TAT ACA AAT CCC GTA TGG TTG TGT CAT CCT CTC TTA ATC ATT TTT21102111 ACT AAT TCT AAT AAT CAG CTC TAG CTT GCT TCA TAA TTT TCA TGG CTT21582159 TGC TTG ATC TGT TGA TGC TTT CTC TCA TCA AGA CTT TGC AGC ATT TTA22062207 GCC AGG CAG TAT TTA CTC ATT ATT AGG AAA ATC AAG ATG TGG CTG AAG22542255 ATC AGA GGC TCA GTT AGC AAC CTG TGT TGT AGC AGT GAT GTC AGT CCA23022303 TTG ATT GTC TTT AGA GAG TTA ATG TTA CAA AAA AGA ATT CTT AAT AAT23502351 CAG ACA AAC ATG ATC TGC TGA GGA CAC ATG CGC TTT TGT AGA ATT TAA23982399 CAT CTG GTG TTT TTC TGA AAA AAT ATA TAT ACA TAT ATT GCT TTA TTT24462447 GAA ACA AAT TAA AAT ATG CTG CAT TTG AAA AAA AAA AAA AAA AAA A 2492DBlastp結(jié)果Query=SP1224[基因=SP1224](476個氨基酸)>SP IN:046309 046309 drosophila melanogaster(fruit fly).eg:8d8.6protein.5/1999長度=402分值=45.7 bits(106),預計值=8e-04相同性=42/186(22%),相似性=80/186(42%),缺口=16/186(8%)Query:291 LLNAHAATSGKQSNFSRKSSTHNKPSEGKAANPKMVSSLPSTADPSHQTMPANKQNGSSN 350+ +A AA +GK+ ++K+SP+++ ++ + PS HQ A ++Sbjct:1 MADAQAAAAGKKKYKNKKNSAEKNPNHNPNSSGQVEAQTPSNGHVQHQEEEATEDQEPAQ 60Query:351 QRRRFNPQYHNNRLNGPAKSQGSGNEAEPLGKGNSRHEHRRQPHNGFRPKNKGGAKNQEA 410+R + HNG+EA PLG+ + H H+N RG + N +Sbjct:61 ELRGLLKKMH--LCNGHGHKE---QEARPLGEVVNGHAHGHSNNNHIR-CTSGSSNNNNS 114Query:411 SLGMKTPEAPAHSEKPRRR---QHAADTSEARPFRGSVGRVSQ--CNLCPT-----RIEV 460++ ++ ++ K RR +D++ +P+ S+ N+ PT + +VSbjct:115 THNNNSVDSSNNNRKQRREGGDGGGSDSNSLKPEEKPITATSKTTANIHPTTTTDPKPKV 174Query:461 STDAAV 466S D AVSbjct:175 SEDVAV 180>SW:YG6P CAEEL P90970 caenorhabditis elegans.hypothetical 60.7 kdprotein t23g11.8 in chromosome ⅰ.11/1997長度=530分值=44.9 bits(104),預計值=0.001相同性=51/201(25%),相似性=98/201(48%),缺口=18/201(8%)Query:115 VKSNTPAAHLEIKPDELAKKRGPNIEKSVKDLQRCTVSLTRYRVMIKEEVDSSVKKIKAA 174V++ ++H EL R++ K ++ C+R V ++EE+ +V++++ ASbjct:180 VENQKVSSHEMDSLQELKLARQKAQDQKEKAVEECNMH-KRKIVGLEEEIRAMVEQLRLA 238Query:175 FAELHNCIIDKEVSLMAEMDKVKEEAMEILTARQKKAEELKRLTDLASQMAEMQLAELRA 234L++K+ E D+ K +A +ILTA++K E LK +S +L L+ASbjct:239 KFNLNE---NKK-----EFDEYKNKAQKILTAKEKLVESLKSEQGIGSSDRPVHL--LQA 288Query:235 EIKHFVSER---KYDEELGKAARFS--CDIEQLKAQIM-LCGEITHPKNNY-SSRTPCSS 287E++ER K D E + ++ D+E+L+AQI L +++ K + +SSbjct:289 EVEEIRVERDLTKADLESAQLQVYTLRSDMEELEAQIRDLQSQLSDQKRTHLEEKQTWDS 348Query:288 LLPLLNAHAATSGKQSNFSRK 308+LLN S ++ F+++Sbjct:349 TIGLLNEKVECSRIENEFTKQ 3692.PP265A核苷酸序列(SEQ ID NO:4)長度1969bp1 CGGCCGCGAG GTGGCCATGA AGATCCAGTA CCCTGGCGTG GCCCAGAGCA51 TCAACAGTGA TGTCAACAAC CTCATGGCCG TGTTGAACAT GAGCAACATG101 CTTCCAGAAG GCCTGTTCCC CGAGCACCTG ATCGACGTGC TGAGGCGGGA151 GCTGGCCCTG GAGTGTGACT ACCAGCGAGA GGCCGCCTGT GCCCGCAAGT201 TCAGGTGTGG CCCCCGGCCG GGCCCCTTGC GTGTTTGCAC CAGGGAGGCA251 GAAGGGACCA TGTTCAGCAG CTGGTGAAGG CCCCTCCAGC TCTGAGGGGC301 AGAGGGCTGG GGTTGCAGCC TGGGCCGAGG CCATATCCTG CCTGGGGTGA351 AGGAGGGCCC TCTGCCTGGT TGGGGGGTGT GTGTGGGGGG GGGGACGGTG401 TGGAGGGCCT GTGGCTAGGG CGTGACCTCC CTCCCCTACC CAGGGACCTG451 CTGAAGGGCC ACCCCTTCTT CTATGTGCCT GAGATTGTGG ATGAGCTCTG501 CAGCCCACAT GTGCTGACCA CAGAGCTGGT GTCTGGCTTC CCCCTGGACC551 AGGCCGAAGG GCTCAGCCAG GAGATTCGGA ACGAGATCTG CTACAACATC601 CTGGTTCTGT GCCTGAGGGA GCTGTTCGAG TTCCACTTCA TGCAAACAGA651 CCCCAACTGG TCCAACTTCT TCTATGACCC CCAGCAGCAC AAGGTGGCTC701 TTTTGGATTT TGGGGCAACG CGGGAATATG ACAGATCCTT CACCGACCTC751 TACATTCAGA TCATCAGGGC TGCTGCCGAC AGGGACAGGG AGACTGTGCG801 GGCGAAATCC ATAGAGATGA AGTTCCTCAC CGGCTACGAG GTCAAGGTCA851 TGGAAGACGC CCACTTGGAT GCCATCCTCA TCCTGGGGGA GGCCTTCGCC901 TCTGATGAGC CTTTTGATTT TGGCACTCAG AGCACCACCG AGAAGATCCA951 CAACCTGATT CCCGTCATGC TGAGGCACCG TCTCGTCCCC CCACCCGAGG1001 AAACCTACTC CCTGCACAGG AAGATGGGGG GCTCCTTCCT CATCTGCTCC1051 AAGCTGAAGG CCCGCTTCCC CTGCAAGGCC ATGTTCGAGG AGGCCTACAG1101 CAACTACTGC AAGAGGCAGG CCCAGCAGTA GGGCTGCGGG CCACGCCCAG1151 GCCGGCTCCG CGGGAACTCT CTCCCTCAGA CAGGCCAAAA ACCAGTAGCG1201 AGGTCGTGGT GATGCTCTTT TTAACTCCTT TGCCCAATAA GGGGGGTGGC1251 TGCCTGGAGC CCCGTAGCCA GCGCTTTCCA CGGTTTCTGT TGCTAAATGG1301 TTGTAGGGTG AGAAGTGCAA GAATGAAGAT GAAGCCCCAC TGCTCGGTCA1351 GTCTGCCTCC GTGTGTCCTC TGAAATAAGC AGATGAAGAT GAAAGGGCAA1401 CTTTGTTTTC TTCTTTTTCC TGATGTGAAT GTTAAGCAGA AGGGAGAGAG1451 TCCTTACTCC CTTCCAATCT CTGTTCAGTG CAAAACCCAG AAACATGACA1501 GATACGATTG TGGGATTTTA TCATCTGTGT AGTAGGTGTG TGTATGTGTT1551 TCTAGAGTGA GATTTGTGTT TTCTGCCCTT TTCCTCTCCA GCCAATGGGC1601 TGGAGCTGGG AGAGGTGCTG AGCTAACAGT GCCAACAAGT GCTCCTTAAG1651 CCTGCGAGGC CCAGGCCTGT GGGGCTGGTT CTCACCTTTG ACAGCTGAAT1701 GTTCCTAAAG AACTGCTGCC CCACAGTGAG GGTGGGAGCA GCGGAACAGG1751 GAATGCCAGA CACAGGCTCG CTGCTGCTGG AAGGCGGGGT GGGACTTCCT1801 TCCTCTGTCC AGAGAGGCAC AGGTGTCACC AGTTCCAGCC AAAGGCTCCT1851 CACAGGCGCT GTGAATTTTT GTACAAGTCT TGTAATTATC GAATCAACAA1901 CTTGTTTCAA TTTAATAAAA ATGCTCATGG GAAGTGCAAA AAAAAAAAAA1951 AAAAAAAAAA AAAAAAAAAB氨基酸序列(SEQ ID NO:5)長度163個氨基酸1 MQTDPNWSNF FYDPQQHKVA LLDFGATREY DRSFTDLYIQ IIRAAADRDR51 ETVRAKSIEM KFLTGYEVKV MEDAHLDAIL ILGEAFASDE PFDFGTQSTT101 EKIHNLIPVM LRHRLVPPPE ETYSLHRKMG GSFLICSKLK ARFPCKAMFE151 EAYSNYCKRQ AQQC核苷酸及氨基酸組合序列(SEQ ID NO:6)克隆號和蛋白名稱PP265起始編碼子640 ATG終止編碼子1131 TAG蛋白質(zhì)分子量18935.631 CGG CCG CGA GGT GGC CAT GAA GAT CCA GTA CCC TGG CGT GGC CCA GAG 4849 CAT CAA CAG TGA TGT CAA CAA CCT CAT GGC CGT GTT GAA CAT GAG CAA 9697 CAT GCT TCC AGA AGG CCT GTT CCC CGA GCA CCT GAT CGA CGT GCT GAG 144145 GCG GGA GCT GGC CCT GGA GTG TGA CTA CCA GCG AGA GGC CGC CTG TGC 192193 CCG CAA GTT CAG GTG TGG CCC CCG GCC GGG CCC CTT GCG TGT TTG CAC 240241 CAG GGA GGC AGA AGG GAC CAT GTT CAG CAG CTG GTG AAG GCC CCT CCA 288289 GCT CTG AGG GGC AGA GGG CTG GGG TTG CAG CCT GGG CCG AGG CCA TAT 336337 CCT GCC TGG GGT GAA GGA GGG CCC TCT GCC TGG TTG GGG GGT GTG TGT 384385 GGG GGG GGG GAC GGT GTG GAG GGC CTG TGG CTA GGG CGT GAC CTC CCT 432433 CCC CTA CCC AGG GAC CTG CTG AAG GGC CAC CCC TTC TTC TAT GTG CCT 480481 GAG ATT GTG GAT GAG CTC TGC AGC CCA CAT GTG CTG ACC ACA GAG CTG 528529 GTG TCT GGC TTC CCC CTG GAC CAG GCC GAA GGG CTC AGC CAG GAG ATT 576577 CGG AAC GAG ATC TGC TAC AAC ATC CTG GTT CTG TGC CTG AGG GAG CTG 624625 TTC GAG TTC CAC TTC ATG CAA ACA GAC CCC AAC TGG TCC AAC TTC TTC 6721 Met Gln Thr Asp Pro Asn Trp Ser Asn Phe Phe 11673 TAT GAC CCC CAG CAG CAC AAG GTG GCT CTT TTG GAT TTT GGG GCA ACG 72012 Tyr Asp Pro Gln Gln His Lys Val Ala Leu Leu Asp Phe Gly Ala Thr 27721 CGG GAA TAT GAC AGA TCC TTC ACC GAC CTC TAC ATT CAG ATC ATC AGG 76828 Arg Glu Tyr Asp Arg Ser Phe Thr Asp Leu Tyr Ile Gln Ile Ile Arg 43769 GCT GCT GCC GAC AGG GAC AGG GAG ACT GTG CGG GCG AAA TCC ATA GAG 81644 Ala Ala Ala Asp Arg Asp Arg Glu Thr Val Arg Ala Lys Ser Ile Glu 59817 ATG AAG TTC CTC ACC GGC TAC GAG GTC AAG GTC ATG GAA GAC GCC CAC 86460 Met Lys Phe Leu Thr Gly Tyr Glu Val Lys Val Met Glu Asp Ala His 75865 TTG GAT GCC ATC CTC ATC CTG GGG GAG GCC TTC GCC TCT GAT GAG CCT 91276 Leu Asp Ala Ile Leu Ile Leu Gly Glu Ala Phe Ala Ser Asp Glu Pro 91913 TTT GAT TTT GGC ACT CAG AGC ACC ACC GAG AAG ATC CAC AAC CTG ATT 96092 Phe Asp Phe Gly Thr Gln Ser Thr Thr Glu Lys Ile His Asm Leu Ile 107961 CCC GTC ATG CTG AGG CAC CGT CTC GTC CCC CCA CCC GAG GAA ACC TAC1008108 Pro Va1 Met Leu Arg His Arg Leu Val Pro Pro Pro Glu Glu Thr Tyr 1231009 TCC CTG CAC AGG AAG ATG GGG GGC TCC TTC CTC ATC TGC TCC AAG CTG1056124 Ser Leu His Arg Lys Met Gly Gly Ser Phe Leu Ile Cys Ser Lys Leu 1391057 AAG GCC CGC TTC CCC TGC AAG GCC ATG TTC GAG GAG GCC TAC AGC AAC1104140 Lys Ala Arg Phe Pro Cys Lys Ala Met Phe Glu Glu Ala Tyr Ser Asn 1551105 TAC TGC AAG AGG CAG GCC CAG CAG TAG GGC TGC GGG CCA CGC CCA GGC1152156 Tyr Cys Lys Arg Gln Ala Gln Gln *** 1641153 CGG CTC CGC GGG AAC TCT CTC CCT CAG ACA GGC CAA AAA CCA GTA GCG12001201 AGG TCG TGG TGA TGC TCT TTT TAA CTC CTT TGC CCA ATA AGG GGG GTG12481249 GCT GCC TGG AGC CCC GTA GCC AGC GCT TTC CAC GGT TTC TGT TGC TAA12961297 ATG GTT GTA GGG TGA GAA GTG CAA GAA TGA AGA TGA AGC CCC ACT GCT13441345 CGG TCA GTC TGC CTC CGT GTG TCC TCT GAA ATA AGC AGA TGA AGA TGA13921393 AAG GGC AAC TTT GTT TTC TTC TTT TTC CTG ATG TGA ATG TTA AGC AGA14401441 AGG GAG AGA GTC CTT ACT CCC TTC CAA TCT CTG TTC AGT GCA AAA CCC14881489 AGA AAC ATG ACA GAT ACG ATT GTG GGA TTT TAT CAT CTG TGT AGT AGG15361537 TGT GTG TAT GTG TTT CTA GAG TGA GAT TTG TGT TTT CTG CCC TTT TCC15841585 TCT CCA GCC AAT GGG CTG GAG CTG GGA GAG GTG CTG AGC TAA CAG TGC16321633 CAA CAA GTG CTC CTT AAG CCT GCG AGG CCC AGG CCT GTG GGG CTG GTT16801681 CTC ACC TTT GAC AGC TGA ATG TTC CTA AAG AAC TGC TGC CCC ACA GTG17281729 AGG GTG GGA GCA GCG GAA CAG GGA ATG CCA GAC ACA GGC TCG CTG CTG17761777 CTG GAA GGC GGG GTG GGA CTT CCT TCC TCT GTC CAG AGA GGC ACA GGT18241825 GTC ACC AGT TCC AGC CAA AGG CTC CTC ACA GGC GCT GTG AAT TTT TGT18721873 ACA AGT CTT GTA ATT ATC GAA TCA ACA ACT TGT TTC AAT TTA ATA AAA19201921 ATG CTC ATG GGA AGT GCA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA19681969 A 1969DBlastp結(jié)果Query=PP265[基因=PP265](163個氨基酸)>SW:YLC4_CAEEL Q18486 caenorhabditis elegans.hypothetical 81.0 kdprotein c35d10.4 in chromosome ⅲ.7/1998長度=733分值=176 bits(441),預計值=1e-43相同性=81/160(50%),相似性=110/160(68%),缺口=4/160(2%)Query:1 MQTDPNWSNFFYDPQ----QHKVALLDFGATREYDRSFTDLYIQIIRAAADRDRETVRAK 56MQTDPNWSNFF+ ++ LLDFGA+R Y + F D+Y+ II++A D D++ +Sbjct:562 MQTDPNWSNFFLGKHPKTGEPRLVLLDFGASRAYGKKFVDIYMNIIKSAYDGDKKKIIEY 621Query:57 SIEMKFLTGYEVKVMEDAHLDAILILGEAFASDEPFDFGTQSTTEKIHNLIPVMLRHRLV 116S E+ FLTGYE VMEDAH+++++I+GE AS+ P++F Q T +I LIPVML HRLSbjct:622 SREIGFLTGYETSVMEDAHVESVMIMGETLASNHPYNFANQDVTMRIQKLIPVMLEHRLT 681Query:117 PPPEETYSLHRKMGGSFLICSKLKARFPCKAMFEEAYSNY 156PPEE YSLHRK+ G +L ++KLKA C +F E + NYSbjct:682 SPPEEIYSLHRKLSGCYLLAAKLKATVSCGGLFHEIHENY 721>PIR2:S71110 abcl protein-fission yeast(Schizosaccharomycespombe)長度=610分值=161 bits(404),預計值=3e-39相同性=78/158(49%),相似性=105/158(66%),缺口=2/158(1%)Query:1 MQTDPNWSNFFYDPQQHKVALLDFGATREYDRSFTDLYIQIIRAAADRDRETVRAKSIEM 60MQTDPNWSNF Y+ + K+ LLDFGA+ EYD F Y +++ AAA R+RE + S+E+Sbjct:451 MQTDPNWSNFLYNGKTKKIELLDFGASIEYDEKFIKKYCRLLLAAAHRNREKCKKLSVEL 510Query:61 KFLTGYEVKVMEDAHLDAILILGEAFASDEP--FDFGTQSTTEKIHNLIPVMLRHRLVPP 118+L +E M DAH+++I L E FA D P +DFG Q+ T ++ IPVML RL PPSbjct:511 GYLNNHESAQMIDAHINSIFTLAEPFAFDAPDVYDFGDQTITARVKQQIPVMLDLRLQPP 570Query:119 PEETYSLHRKMGGSFLICSKLKARFPCKAMFEEAYSNY 156PEETYSLHR++ G FL+C+KL A+ CK +F +YSbjct:571 PEETYSLHRRLSGHFLLCAKLGAKVRCKELFSGMLKHY 608>SW:ABCI_SCHPO Q92338 schizosaccharomyces pombe(fission yeast).
abcl protein homolog precursor.12/1998長度=610分值=161 bits(404),預計值=3e-39相同性=78/158(49%),相似性=105/158(66%),缺口=2/158(1%)Query:1 MQTDPNWSNFFYDPQQHKVALLDFGATREYDRSFTDLYIQIIRAAADRDRETVRAKSIEM 60MQTDPNWSNF Y+ + K +LLDFGA+EYD F Y +++ AAA R+RE + S+E+Sbjct:451 MQTDPNWSNFLYNGKTKKIELLDFGASIEYDEKFIKKYCRLLLAAAHRNREKCKKLSVEL 510Query:61 KFLTGYEVKVMEDAHLDAILILGEAFASDEP--FDFGTQSTTEKIHNLIPVMLRHRLVPP 118+L +E M DAH+++I L E FA D P +DFG Q+ T ++ IPVML RL PPSbjct:511 GYLNNHESAQMIDAHINSIFTLAEPFAFDAPDVYDFGDQTITARVKQQIPVMLDLRLQPP 570Query:119 PEETYSLHRKMGGSFLICSKLKARFPCKAMFEEAYSNY 156PEETYSLHR++ G FL+C+KL A+ CK +F +YSbjct:571 PEETYSLHRRLSGHFLLCAKLGAKVRCKELFSGMLKHY 6083.PP384A核苷酸序列(SEQ ID NO:7)長度2357bp1 CAAAGGGCTG TTGCTGACAG TTAATACCAG TAGTCAGAAT GGGAAGGCCT51 GGAAGAACAC TTATTAAAGA AATCCAGAGT CCTCTGTCTA GTATCTGTGA101 TGGCTCCATA GCTCTAGATG CTGAGCCTGT TACCCAGCCA GCATCGCTGC151 CCAGACACAG CAGCACACCA GACCACACCA GCACACTGGA GCCTCCTCGT201 TTGCCTCAAA GAAAGAACTT ACAAAGTGAA AAGGAAACTT ATCAGCTGTC251 TAAGGAAGTG GAAATTTTAT CTAGGAACCT GGTTGAAATG CAACGGTGTC301 TTTCTGAACT TACAAACCGT CTGCATAATG GGAAGAAATC CTCTTCAGTG351 TATCCACTCT CTCAAGATCT TCCTTATGTT CACATCATTT ACCAGAAACC401 TTATTATCTA GGTCCTGTTG TTGAAAAAAG AGCGGTGCTT CTCTGTGATG451 GTAAACTAAG GCTCAGTACA GTTCAGCAGA CTTTTGGCCT TTCTCTCATT501 GAAATGCTAC ATGATTCCCA CTGGATTCTT CTCTCTGCTG ACAGTGAGGG551 CTTTATCCCG TTAACCTTCA CAGCCACACA GGAAATAATC ATAAGAGATG601 GCAGCCTGTC CAGGTCAGAT GTCTTCAGAG ACTCTTTTTC TCACAGTCCA651 GGTGCTGTTT CTTCTCTTAA AGTCTTTACA GGCCTTGCTG CCCCCAGTTT701 AGATACCACT GGCTGTTGTA ACCATGTAGA TGGCATGGCT TGATATCTGC751 AGTGTCCTTG CTGTGTAGCT CTTCAGATGA GACCATTACA AACAAGGCCT801 GCTTGACACT GGACACTCGC CAATGAGACT CCCACTGCAC TCAGGCGAAG851 CGCTTGCCAT GGTCGGCTCT CCTGGTTTCC CCCTGTTTCC CCTGAGCTGA901 GGCTCGCTGC TGTGTAGCAG AGCTCAGTCT TTATTAGATG GCTCCGAAAG951 TGGTGTTTAT GTATTCATGA CTGTGTGGTT TTGACTAAGG GCAGAATTCT1001 CAGAACAAAA CAATATTATG GTGCCATATG GATGGTGTTT TATGGTTTCT1051 CTGAGGCTTT GTGTCCCTTG TCCAAAGCTG CATTGAAGCT GTCTTAGGAG1101 CACTTAAAAG ATACCTTGGC ATTGTTATAG GTCTTTTTCT TGGCTTCAAG1151 AGGAGGTTGA GGAGTCTGCT GGGGGGCATG TGCTCTAGCA TATTAACCTC1201 AAACCAGCAA AGAATTAGCA GAGCTCCAAG GAGGACCAAG AGACCCACTG1251 GCTTCTGCTC TCAGGAACAG GAAGTGGCTC TGATGTTGCC TGGACCTCCC1301 AGAATTTAAA CCAAACCCTC TTGCTTCCTT AACAAATTCT GGCTGACGAA1351 GGTCCAGGTA CTCTTAAAAA CTGGCCCTGG GAAAATTTTG AATGAAATTT1401 CAAGGGAATT TGTCCCCTCT GGGTTCCACT TGAGGTTGTG CCGATGCTGC1451 TACCACACTG TCGAGCCCAG GTAAGTCCTA CTGCAGGATT TTGTGCTGTG1501 GCCACTCATG AGTGTCCCTG AAATAACTTT TTTTTTTTTT AAATCCAGTT1551 TTGGGATCAC GCAACTTTCC TATTTTTCTC CCAGTAGTCA GCTCCCTTAG1601 TTAACTTGTC ACTTTAATTT GATATTTTTA TTTTCTCTCC TTTTAAGTCT1651 TAGAGACCAG CAGAGAATCT GTGAGAGAAA GTATTTCAGG AAGTTAGAAA1701 TTCAACCGAA TCTGAGGTAG TCCTAAAAAG TGCCATTTTG TTTCACTTAT1751 GGGCTAAAGT ACCAGCTTAG TCAGGTAAGA GCCCTGACCC ACTTCAGATG1801 GTAACACCAC TTCTCACTGC CTTCAGATGG AATCACAGAT TTCAGTCACG1851 GCGCATAACA AATTGATCAG TGAGTGGCTA GGCATCTGCA GATAAATTGT1901 TTCAGCCATA GAAGCTCCAT TAGCACATAT GCTTCCTTTT CCCCCCTTCC1951 TTTAAAATCA TCTGGAAAGA AACTATTTTG TGCCCTTGGG GACTCCTGTC2001 TGTCTGTTAC AGTTTACCAA GATGGAGCTG GGTTAGGAAA GAAGTGAGGG2051 CCCATTTTGT GGTTCAAGTG CACTAGACAG CTGCTGGGGT AGGAAGCACA2101 GGCAATGTCT GCAATCAGCT GTGGGAGAGC GGTGACTGAG AACAGTCTGA2151 GGCCTGGCTC CACTTGGAAG TATCTGGGGT GCGATGAAAT CACAATTATC2201 TTGAAGCCTA AAGAGGGAAC TACAAGACTG TTAACTAAGA TCAATGTGGG2251 CACCTAAAAG GGTATGTTAA AATCACCATT TCTCAGGTCA AAATACTGTG2301 AATAAGTCTT CAATAAAATC ACTAATGGTT AAAAAAAAAA AAAAAAAAAA2351 AAAAAAAB氨基酸序列(SEQ ID NO:8)長度234個氨基酸1 MGRPGRTLIK EIQSPLSSIC DGSIALDAEP VTQPASLPRH SSTPDHTSTL51 EPPRLPQRKN LQSEKETYQL SKEVEILSRN LVEMQRCLSE LTNRLHNGKK101 SSSVYPLSQD LPYVHIIYQK PYYLGPVVEK RAVLLCDGKL RLSTVQQTFG151 LSLIEMLHDS HWILLSADSE GFIPLTFTAT QEIIIRDGSL SRSDVFRDSF201 SHSPGAVSSL KVFTGLAAPS LDTTGCCNHV DGMAC核苷酸及氨基酸組合序列(SEQ ID NO:9)克隆號和蛋白名稱PP384起始編碼子39 ATG終止編碼子743 TGA蛋白質(zhì)分子量25844.051 CA AAG GGC TGT TGC TGA CAG TTA ATA CCA GTA GTC AGA ATG GGA AGG 471 Met Gly Arg 348 CCT GGA AGA ACA CTT ATT AAA GAA ATC CAG AGT CCT CTG TCT AGT ATC 954 Pro Gly Arg Thr Leu Ile Lys Glu Ile Gln Ser Pro Leu Ser Ser Ile 1996 TGT GAT GGC TCC ATA GCT CrA GAT GCT GAG CCT GTT ACC CAG CCA GCA 14320 Cys Asp Gly Ser Ile Ala Leu Asp Ala Glu Pro Val Thr Gln Pro Ala 35144 TCG CTG CCC AGA CAC AGC AGC ACA CCA GAC CAC ACC AGC ACA CTG GAG 19136 Ser Leu Pro Arg His Ser Ser Thr Pro Asp His Thr Ser Thr Leu Glu 51192 CCT CCT CGT TTG CCT CAA AGA AAG AAC TTA CAA AGT GAA AAG GAA ACT 23952 Pro Pro Arg Leu Pro Gln Arg Lys Asn Leu Gln Ser Glu Lys Glu Thr 67240 TAT CAG CTG TCT AAG GAA GTG GAA ATT TTA TCT AGG AAC CTG GTT GAA 28768 Tyr Gln Leu Ser Lys Glu Val Glu Ile Leu Ser Arg Asn Leu Val Glu 83288 ATG CAA CGG TGT CTT TCT GAA CTT ACA AAC CGT CTG CAT AAT GGG AAG 33584 Met Gln Arg Cys Leu Ser Glu Leu Thr Ash Arg Leu His Asn Gly Lys 99336 AAA TCC TCT TCA GTG TAT CCA CTC TCT CAA GAT CTT CCT TAT GTT CAC 383100 Lys Ser Ser Ser Val Tyr Pro Leu Ser Gln Asp Leu Pro Tyr Val His 115384 ATC ATT TAC CAG AAA CCT TAT TAT CTA GGT CCT GTT GTT GAA AAA AGA 431116 Ile Ile Tyr Gln Lys Pro Tyr Tyr Leu Gly Pro Val Val Glu Lys Arg 131432 GCG GTG CTT CTC TGT GAT GGT AAA CTA AGG CTC AGT ACA GTT CAG CAG 479132 Ala Val Leu Leu Cys Asp Gly Lys Leu Arg Leu Ser Thr Val Gln Gln 147480 ACT TTT GGC CTT TCT CTC ATT GAA ATG CTA CAT GAT TCC CAC TGG ATT 527148 Thr Phe Gly Leu Ser Leu Ile Glu Met Leu His Asp Ser His Trp Ile 163528 CTT CTC TCT GCT GAC AGT GAG GGC TTT ATC CCG TTA ACC TTC ACA GCC 575164 Leu Leu Ser Ala Asp Ser Glu Gly Phe Ile Pro Leu Thr Phe Thr Ala 179576 ACA CAG GAA ATA ATC ATA AGA GAT GGC AGC CTG TCC AGG TCA GAT GTC 623180 Thr Gln Glu Ile Ile Ile Arg Asp Gly Ser Leu Ser Arg Ser Asp Val 195624 TTC AGA GAC TCT TTT TCT CAC AGT CCA GGT GCT GTT TCT TCT CTT AAA 671196 Phe Arg Asp Ser Phe Ser His Ser Pro Gly Ala Val Ser Ser Leu Lys 211672 GTC TTT ACA GGC CTT GCT GCC CCC AGT TTA GAT ACC ACT GGC TGT TGT 719212 Val Phe Thr Gly Leu Ala Ala Pro Ser Leu Asp Thr Thr Gly Cys Cys 227720 AAC CAT GTA GAT GGC ATG GCT TGA TAT CTG CAG TGT CCT TGC TGT GTA 767228 Asn His Val Asp Gly Met Ala *** 235768 GCT CTT CAG ATG AGA CCA TTA CAA ACA AGG CCT GCT TGA CAC TGG ACA 815816 CTC GCC AAT GAG ACT CCC ACT GCA CTC AGG CGA AGC GCT TGC CAT GGT 863864 CGG CTC TCC TGG TTT CCC CCT GTT TCC CCT GAG CTG AGG CTC GCT GCT 911912 GTG TAG CAG AGC TCA GTC TTT ATT AGA TGG CTC CGA AAG TGG TGT TTA 959960 TGT ATT CAT GAC TGT GTG GTT TTG ACT AAG GGC AGA ATT CTC AGA ACA10071008 AAA CAA TAT TAT GGT GCC ATA TGG ATG GTG TTT TAT GGT TTC TCT GAG10551056 GCT TTG TGT CCC TTG TCC AAA GCT GCA TTG AAG CTG TCT TAG GAG CAC11031104 TTA AAA GAT ACC TTG GCA TTG TTA TAG GTC TTT TTC TTG GCT TCA AGA11511152 GGA GGT TGA GGA GTC TGC TGG GGG GCA TGT GCT CTA GCA TAT TAA CCT11991200 CAA ACC AGC AAA GAA TTA GCA GAG CTC CAA GGA GGA CCA AGA GAC CCA12471248 CTG GCT TCT GCT CTC AGG AAC AGG AAG TGG CTC TGA TGT TGC CTG GAC12951296 CTC CCA GAA TTT AAA CCA AAC CCT CTT GCT TCC TTA ACA AAT TCT GGC13431344 TGA CGA AGG TCC AGG TAC TCT TAA AAA CTG GCC CTG GGA AAA TTT TGA13911392 ATG AAA TTT CAA GGG AAT TTG TCC CCT CTG GGT TCC ACT TGA GGT TGT14391440 GCC GAT GCT GCT ACC ACA CTG TCG AGC CCA GGT AAG TCC TAC TGC AGG14871488 ATT TTG TGC TGT GGC CAC TCA TGA GTG TCC CTG AAA TAA CTT TTT TIT15351536 TTT TTA AAT CCA GTT TTG GGA TCA CGC AAC TTT CCT ATT TTT CTC CCA15831584 GTA GTC AGC TCC CTT AGT TAA CTT GTC ACT TTA ATT TGA TAT TTT TAT16311632 TTT CTC TCC TTT TAA GTC TTA GAG ACC AGC AGA GAA TCT GTG AGA GAA16791680 AGT ATT TCA GGA AGT TAG AAA TTC AAC CGA ATC TGA GGT AGT CCT AAA17271728 AAG TGC CAT TTT GTT TCA CTT ATG GGC TAA AGT ACC AGC TTA GTC AGG17751776 TAA GAG CCC TGA CCC ACT TCA GAT GGT AAC ACC ACT TCT CAC TGC CTT18231824 CAG ATG GAA TCA CAG ATT TCA GTC ACG GCG CAT AAC AAA TTG ATC AGT18711872 GAG TGG CTA GGC ATC TGC AGA TAA ATT GTT TCA GCC ATA GAA GCT CCA19191920 TTA GCA CAT ATG CTT CCT TTT CCC CCC TTC CTT TAA AAT CAT CTG GAA19671968 AGA AAC TAT TTT GTG CCC TTG GGG ACT CCT GTC TGT CTG TTA CAG TTT20152016 ACC AAG ATG GAG CTG GGT TAG GAA AGA AGT GAG GGC CCA TTT TGT GGT20632064 TCA AGT GCA CTA GAC AGC TGC TGG GGT AGG AAG CAC AGG CAA TGT CTG21112112 CAA TCA GCT GTG GGA GAG CGG TGA CTG AGA ACA GTC TGA GGC CTG GCT21592160 CCA CTT GGA AGT ATC TGG GGT GCG ATG AAA TCA CAA TTA TCT TGA AGC22072208 CTA AAG AGG GAA CTA CAA GAC TGT TAA CTA AGA TCA ATG TGG GCA CCT22552256 AAA AGG GTA TGT TAA AAT CAC CAT TTC TCA GGT CAA AAT ACT GTG AAT23032304 AAG TCT TCA ATA AAA TCA CTA ATG GTT AAA AAA AAA AAA AAA AAA AAA23512352 AAA AAA2357DBlastp結(jié)果Query=PP384[基因=PP384](234個氨基酸)>PIR2:S06286 major merozoite surface antigen precursor-Plasmodiumfalciparum(strain RO-33 Ghana)(fragment)長度=1060分值=34.8 bits(78),預計值=0.68相同性=20/70(28%),相似性=36/70(50%),缺口=3/70(4%)Query:55 LPQRKNLQSE---KETYQLSKEVEILSRNLVEMQRCLSELTNRLHNGKKSSSVYPLSQDL 111+ Q KN +E K+ YQ ++ I ++ L E+S L R+ KK+ ++ L+ D+Sbjct:251 IDQNKNADNEEGKKKLYQAQYDLFIYNKQLQEAHNLISVLEKRIDTLKKNENIKKLLEDI 310Query:112 PYVHIIYQKP 121+ I +KPSbjct:311 DKIKIDAEKP 320>SP_IN:P90922 P90922 caenorhabditis elegans.k07a12.4 protein.
5/1999長度=936分值=32.8 bits(73),預計值=2.6相同性=22/56(39%),相似性=31/56(55%),缺口=4/56(7%)Query:34 PASLPRHSSTPDHTSTLEPP-RLPQRKNLQSEKETYQLSK---EVEILSRNLVEMQ 85P +LSTP +S L P R PQ KNLQ+E T +S+ EV++ S++QSbjct:421 PKNLNSRPSTPQTSSNLNTPKRTPQVKNLQAESTTPTVSRPSSEVDLTSFRRNQLQ 476>SW:YQU3_CAEEL Q09550 caenorhabditis elegans.hypothetical 133.5 kdprotein f26c11.3 in chromosome ⅱ.11/1995長度=1251分值=32.5 bits(72),預計值=3.4相同性=27/108(25%),相似性=47/108(43%),缺口=18/108(16%)Query:8 LIKEIQSPLSSICDGSIALDAEPVTQPASLPRHSSTPDHTSTLEPP---------------53L++ I +P+ I DAE + +S P SST H++T PSbjct:462 LMQLIYNPRTKETRTEITSDAEGCKKTSSTPTPSSTSVHSTTATPSTTPGTTTYNWPTGG 521Query:54 ---RLPQRKNLQSEKETY-QLSKEVEILSRNLVEMQRCLSELTNRLHN 97
LP + + + Y Q+ K+++ILS +L+C + L ++NSbjct:522 TTRMLPSGEIVGFDLHLYAQVRKKLQILSESLIAYPNCTTVLMQLIYN 5694.PP432A核苷酸序列(SEQ ID NO:10)長度1615bp1 GGCGCGCCCG CTCCCAAGTC GGCTTCCTCC CCGCCGGGGC CGCTTTGCCT51 CGGGTCTCCC CATTCTCCAG GTCCCCTGAA CTGCACAGTC GGAGGCCGTG101 GGCGGCGGGC TCTGCCTCCG CCGAGGGACA GCCGGATCGC CCCTCTGCTT151 CCCGCAACTG CCCTGATCAC CCCCCGTCCC AGCCCTTGAG TGAACGTCCT201 TCTGAGCGGC TTCCTGGGGT CCTCCCCACG TCCCAAAGGC CGGCAAGATG251 GTGTCCTGGA TGATCTGTCG CCTGGTGGTG CTGGTGTTTG GGATGCTGTG301 TCCAGCTTAT GCTTCCTATA AGGCTGTGAA GACCAAGAAC ATTCGTGAAT351 ATGTGCGGTG GATGATGTAC TGGATTGTTT TTGCACTCTT CATGGCAGCA401 GAGATCGTTA CAGACATTTT TATCTCCTGG TTCCCTTTCT ACTATGAGAT451 CAAGATGGCC TTCGTGCTGT GGCTGCTCTC ACCCTACACC AAGGGCGCCA501 GCCTGCTTTA CCGCAAGTTT GTCCACCCGT CCCTGTCCCG CCATGAGAAG551 GAGATCGACG CGTACATCGT GCAGGCCAAG GAGCGCAGCT ACGAGACCGT601 GCTCAGCTTC GGGAAGCGGG GCCTCAACAT TGCCGCCTCC GCTGCTGTGC651 AGGCTGCCAC CAAGAGTCAG GGGGCGCTGG CCGGCAGGCT GCAGAGCTTC701 TCCATGCAGG ACCTGCGCTC CATCTCTGAC GCACCTGCCC CTGCCTACCA751 TGACCCCCTC TACCTGGAGG ACCAGGTGTC CCACCGGAGG CCACCCATTG801 GGTACCGGGC CGGGGGCCTG CAGGACAGCG ACACCGAGGA TGAGTGTTGG851 TCAGATACTG AGGCAGTCCC CCGGGCGCCA GCCCGGCCCC GAGAGAAGCC901 CCTAATCCGC AGCCAGAGCC TGCGTGTGGT CAAGAGGAAG CCACCGGTGC951 GGGAGGGCAC CTCGCGCTCC CTGAAGGTTC GGACGAGGAA AAAGACTGTG1001 CCCTCAGACG TGGACAGCTA GGGTCTGCTG CATCTGCCCC CTTCTTACCT1051 CGTGCCCTGC AGGGCTCCAG GGCTATTTGG AGGGACCTTG GGCTGCACAT1101 CTGGCCTGCC TGCACCAGCT GCCTGGGCCC CACCCTCCTG ACTCCTGCTG1151 ATGGTTAAGG CCCGGAAGCA GAATGCTGCC AAGGCCACAA TGCAGGAATG1201 CACCCACATT GACCAAAGCA GCTGGGCCCA GGGTTCTATT TATTGCCTTG1251 CTCTGCCTCT CCTTCCCCGG TTGTGGGACA AGAACCCTCC CTTAACCCCT1301 GCAACCCTTC CTGAACCCCT GCAAATGAAA CCAAACGTCC ACCTGGGTGT1351 GTTCATTCCT TCCTGTCCTT CAAAAGTACT TGATAGCCTT TCATAAGGCC1401 TGGCACATGT GTCCTGGTTG TGTGTGTGTG TGTTGGTGAG TGAGGTCAGG1451 TTTGCGAGTG TTTTGATAAA TAAATACATA AAGGGGCAAA AAAAAAAAAA1501 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA1551 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA1601 AAAAAAAAAA AAAAAB氨基酸序列(SEQ ID NO:11)長度257個氨基酸1 MVSWMICRLV VLVFGMLCPA YASYKAVKTK NIREYVRWMM YWIVFALFMA51 AEIVTDIFIS WFPFYYEIKM AFVLWLLSPY TKGASLLYRK FVHPSLSRHE101 KEIDAYIVQA KERSYETVLS FGKRGLNIAA SAAVQAATKS QGALAGRLQS151 FSMQDLRSIS DAPAPAYHDP LYLEDQVSHR RPPIGYRAGG LQDSDTEDEC201 WSDTEAVPRA PARPREKPLI RSQSLRVVKR KPPVREGTSR SLKVRTRKKT251 VPSDVDSC核苷酸及氨基酸組合序列(SEQ ID NO:12)克隆號和蛋白名稱PP432起始編碼子248 ATG終止編碼子1021 TAG蛋白質(zhì)分子量29365.461G GCG CGC CCG CTC CCA AGT CGG CTT CCT CCC CGC CGG GGC CGC TTT 4647 GCC TCG GGT CTC CCC ATT CTC CAG GTC CCC TGA ACT GCA CAG TCG GAG 9495 GCC GTG GGC GGC GGG CTC TGC CTC CGC CGA GGG ACA GCC GGA TCG CCC 142143 CTC TGC TTC CCG CAA CTG CCC TGA TCA CCC CCC GTC CCA GCC CTT GAG 190191 TGA ACG TCC TTC TGA GCG GCT TCC TGG GGT CCT CCC CAC GTC CCA AAG 238239 GCC GGC AAG ATG GTG TCC TGG ATG ATC TGT CGC CTG GTG GTG CTG GTG 2861 Met Val Ser Trp Met Ile Cys Arg Leu Val Val Leu Val 13287 TTT GGG ATG CTG TGT CCA GCT TAT GCT TCC TAT AAG GCT GTG AAG ACC 33414 Phe Gly Met Leu Cys Pro Ala Tyr Ala Ser Tyr Lys Ala Val Lys Thr 29335 AAG AAC ATT CGT GAA TAT GTG CGG TGG ATG ATG TAC TGG ATT GTT TTT 38230 Lys Asn Ile Arg Glu Tyr Val Arg Trp Met Met Tyr Trp Ile Val Phe 45383 GCA CTC TTC ATG GCA GCA GAG ATC GTT ACA GAC ATT TTT ATC TCC TGG 43046 Ala Leu Phe Met Ala Ala Glu Ile Val Thr Asp Ile Phe Ile Ser Trp 61431 TTC CCT TTC TAC TAT GAG ATC AAG ATG GCC TTC GTG CTG TGG CTG CTC 47862 Phe Pro Phe Tyr Tyr Glu Ile Lys Met Ala Phe Val Leu Trp Leu Leu 77479 TCA CCC TAC ACC AAG GGC GCC AGC CTG CTT TAC CGC AAG TTT GTC CAC 52678 Ser Pro Tyr Thr Lys Gly Ala Ser Leu Leu Tyr Arg Lys Phe Val His 93527 CCG TCC CTG TCC CGC CAT GAG AAG GAG ATC GAC GCG TAC ATC GTG CAG 57494 Pro Ser Leu Ser Arg His Glu Lys Glu Ile Asp Ala Tyr Ile Val Gln 109575 GCC AAG GAG CGC AGC TAC GAG ACC GTG CTC AGC TTC GGG AAG CGG GGC 622110 Ala Lys Glu Arg Ser Tyr Glu Thr Val Leu Ser Phe Gly Lys Arg Gly 125623 CTC AAC ATT GCC GCC TCC GCT GCT GTG CAG GCT GCC ACC AAG AGT CAG 670126 Leu Asn Ile Ala Ala Ser Ala Ala Val Gln Ala Ala Thr Lys Ser Gln 141671 GGG GCG CTG GCC GGC AGG CTG CAG AGC TTC TCC ATG CAG GAC CTG CGC 718142 Gly Ala Leu Ala Gly Arg Leu Gln Ser Phe Ser Met Gln Asp Leu Arg 157719 TCC ATC TCT GAC GCA CCT GCC CCT GCC TAC CAT GAC CCC CTC TAC CTG 766158 Ser Ile Ser Asp Ala Pro Ala Pro Ala Tyr His Asp Pro Leu Tyr Leu 173767 GAG GAC CAG GTG TCC CAC CGG AGG CCA CCC ATT GGG TAC CGG GCC GGG 814174 Glu Asp Gln Val Ser His Arg Arg Pro Pro Ile Gly Tyr Arg Ala Gly 189815 GGC CTG CAG GAC AGC GAC ACC GAG GAT GAG TGT TGG TCA GAT ACT GAG 862190 Gly Leu Gln Asp Ser Asp Thr Glu Asp Glu Cys Trp Ser Asp Thr Glu 205863 GCA GTC CCC CGG GCG CCA GCC CGG CCC CGA GAG AAG CCC CTA ATC CGC 910206 Ala Val Pro Arg Ala Pro Ala Arg Pro Arg Glu Lys Pro Leu Ile Arg 221911 AGC CAG AGC CTG CGT GTG GTC AAG AGG AAG CCA CCG GTG CGG GAG GGC 958222 Ser Gln Ser Leu Arg Val Val Lys Arg Lys Pro Pro Val Arg Glu Gly 237959 ACC TCG CGC TCC CTG AAG GTT CGG ACG AGG AAA AAG ACT GTG CCC TCA1006238 Thr Ser Arg Ser Leu Lys Val Arg Thr Arg Lys Lys Thr Val Pro Ser 2531007 GAC GTG GAC AGC TAG GGT CTG CTG CAT CTG CCC CCT TCT TAC CTC GTG1054254 Asp Val Asp Ser *** 2581055 CCC TGC AGG GCT CCA GGG CTA TTT GGA GGG ACC TTG GGC TGC ACA TCT11021103 GGC CTG CCT GCA CCA GCT GCC TGG GCC CCA CCC TCC TGA CTC CTG CTG11501151 ATG GTT AAG GCC CGG AAG CAG AAT GCT GCC AAG GCC ACA ATG CAG GAA11981199 TGC ACC CAC ATT GAC CAA AGC AGC TGG GCC CAG GGT TCT ATT TAT TGC12461247 CTT GCT CTG CCT CTC CTT CCC CGG TTG TGG GAC AAG AAC CCT CCC TTA12941295 ACC CCT GCA ACC CTT CCT GAA CCC CTG CAA ATG AAA CCA AAC GTC CAC13421343 CTG GGT GTG TTC ATT CCT TCC TGT CCT TCA AAA GTA CTT GAT AGC CTT13901391 TCA TAA GGC CTG GCA CAT GTG TCC TGG TTG TGT GTG TGT GTG TTG GTG14381439 AGT GAG GTC AGG TTT GCG AGT GTT TTG ATA AAT AAA TAC ATA AAG GGG14861487 CAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA15341535 AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA15821583 AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA1615DBlastp結(jié)果Query=PP432[基因=PP432](257個氨基酸)>SW:YSV4_CAEEL Ql00l0 caenorhabditis elegans.hypothetical 26.6 kdprotein t19c3.4 in chromosome ⅲ.11/1997長度=229分值=163 bits(409),預計值=1e-39相同性=77/163(47%),相似性=111/163(67%),缺口=2/163(1%)Query:2 VSWMICRLVVLVFGMLCPAYASYKAVKTKNIREYVRWMMYWIVFALFMAAEIVTDIFIS- 60+S + RL+++ G L PAY SYKAV+TK+ REYV+WMMYWIVFA++ E + D+ ++Sbjct:1 MSETLSRLLIITAGTLYPAYRSYKAVRTKDTREYVKWMMYWIVFAIYSFLENLLDLVLAF 60Query:61 WFPFYYEIKMAFVLWLLSPYTKGASLLYRKFVHPSLSRHEKEIDAYIVQAKERSYETVLS 120WFPFY+++K+ F+ WLLSP+TKGAS+LYRK+VHP+L+RHEK+IDA + AK SY ++Sbjct:61 WFPFYFQLKIVFIFWLLSPWTKGASILYRKWVHPTLNRHEKDIDALLESAKSESYNQLMR 120Query:121 FGKRGLNIXXXXXXXXXTKSQGALAGRLQ-SFSMQDLRSISDA 162G + L +Q L +LQ S+S D+ S +ASbjct:121 IGSKSLVYAKDVVAEAAVRGQQQLVNQLQRSYSANDVGSEREA 163>PIR2:T16888 hypothetical protein T19C3.4-Caenorhabditis elegans長度=229分值=163 bits(409),預計值=1e-39相同性=77/163(47%),相似性=111/163(67%),缺口=2/163(1%)Query:2 VSWMICRLVVLVFGMLCPAYASYKAVKTKNIREYVRWMMYWIVFALFMAAEIVTDIFIS-60+S + RL+++ G L PAY SYKAV+TK+ REYV+WMMYWIVFA++ E + D+ ++Sbjct:1 MSETLSRLLIITAGTLYPAYRSYKAVRTKDTREYVKWMMYWIVFAIYSFLENLLDLVLAF60Query:61 WFPFYYEIKMAFVLWLLSPYTKGASLLYRKFVHPSLSRHEKEIDAYIVQAKERSYETVLS 120WFPFY+++K+ F+ WLLSP+TKGAS+LYRK+VHP+L+RHEK+IDA + AK SY ++Sbjct:61 WFPFYFQLKIVFIFWLLSPWTKGASILYRKWVHPTLNRHEKDIDALLESAKSESYNQLMR 120Query:121 FGKRGLNIXXXXXXXXXTKSQGALAGRLQ-SFSMQDLRSISDA 162G + L +Q L +LQ S+S D+ S +ASbjct:121 IGSKSLVYAKDVVAEAAVRGQQQLVNQLQRSYSANDVGSEREA 1635.PP552A核苷酸序列(SEQ ID NO:1 3)長度1786bp1 TGGTGGCGCA TGTCTGTAAT CCCAGCTACT CGGGAAGCTG AGGCAGGAGA51 ATCGCTTGAA CCCAGGAAGC GGAGGTTGCA GTGAGCCGAG ATCGCGCCAC101 TGCACTCCAA CCTGGGCAAC AATACAAGAC TCCATCTGAA AAAAAAAAGA151 TCACACAGGA AAACAGAAGT TCGATTTTAC GTCGTACACT GCTGTAATTT201 CAGCACATGT GGACTCGTGT AACCAACACC ATAACCTTCC ATCACCCCTG251 AAACTCCCTC CCGCCAGCCC TTTAGGGTTG CCCCTCCCCC CGAACCCCAC301 CAGCCCCTGG TGACCACTGA TCTGTCCTCC AACCCATAGT GTTTTTCCGG351 GAATGTCACA AAAACAGAAG CCGACCATGG GTCACCTTTC TGGCGCCTTT401 CTCCCCGCAC AAAGTCTTTG TCCTTGTGAA GTTGTCACGT GCCAAACGCT451 TGTCCCTTTT TCCTGCTGGG TAATACTCCC GGTGCCGCCC TTGCTGTTCG501 TCGATGCACA TCTGGCTGCT TTTCGCTGGC TGCGAGCGGA GCTGCTAGGG551 ACATGGCCAC GGGGCTGTGA GAGCGGAGTT TCCTCTCTCC GGTGACCCTG601 AGCTGCGCCT TTCTCAGCCG CCTCCCGAGG CCCCAGGCGC TCTGCGGGGG651 CTCTGGCGGG GTTGGTGGGG GTGGGCGTTC TCGTTGTTTC AGCGGCGCTG701 CCCCAGGCCC TGCGGGAGGG ACCGTGGGAC CCGAGACATC CCCGCCTGGC751 CTCCGCTCCC CACCCGGGAG TGGGGCTCGC ACCCCCCCAA CCTCGGGTAA801 AGACGCTTCT GGAAGGAAGG GCGCCCCGCG GACCCCGCCC AACCCTGCCC851 AGCCCAGCCC AGCCCAGCCC AGCCCTTCCC GGGGCGGCGG CGCGGGAAGC901 AGGCGGCGGC GCACGGGCGT CGTCATGGCA ACCCCACCGG CTCCGGGGGC951 CGGGACCGCT GCCCCCTCCG CCCCTCGACC CCCGCCCCCC CGCCCTTCCT1001 GGCTGCGGCT GGACCCGGCT GCGCGGGGCG CGAGGCTGCC TTTCCCGGGA1051 TCACCAGGGA CCACCCGGCG CGCTCCCCGG GAATCCGCAC CCCTGGCCCC1101 AGCGCTCCGG AGCGACCCGG GTCAGCCCCT GGCTGCCTGC AATGGGCCCC1151 CGGGCGAACC CCGGGCGGAC CCAGGAGTGA GCACCCGGTG CGCGGCAACG1201 ATGATCCCGC AAGGGAAGCT CACGGGAGGC AGGAGCTGTG GCAGCCGCCC1251 CAGGATGGGG CGCGGGGAGC GCGCTGAGCT GTCCTTTCCC GCAGCGGCCC1301 CGCGGTTGAA GCGTGGGCTT GGGTTTTGGT TTTTCTTCTG TGGCAACAGT1351 TCTGTTGAGA TATTACTCGC CTGCCATACA ACTCACCCAT TTTAAAAGTA1401 CACCTCAGGG GTCCTGCGTG TATTGACAAA CCCGCCGCCG TCACCACAGC1451 CAATTTCAGA ACATTTTCAT CTCTTCAAAA GAAACCCTGT ACCCTTCAGC1501 TGTCACCCTC CTGGTCCCCA TCCGGTCCTC GTCCCGCCCT CAGCAGCCAC1551 GCACTGCCTG TAAAGTCCCC TGTCCTGCCC TGTAGGTGGA ATCTATACCT1601 TGGGGTCTGT TCTGACGTTC ACCTAACAGC CTTTCCAGGC TCAGCTGTGC1651 TATTGTATGG ACCAGGGGGT TGTTTTGTTT TTGTTGTTTG TTGATTGTGT1701 GTGTGTGTGT GTGTGTGTGT GTGAGCCTGG CGTGGTTGCG GGCGCCTATA1751 ATCCCAGCTG CTCAGGAGGC TGAGGCAGGA GGATCAB氨基酸序列(SEQ ID NO:14)長度156個氨基酸1 MATPPAPGAG TAAPSAPRPP PPRPSWLRLD PAARGARLPF PGSPGTTRRA51 PRESAPLAPA LRSDPGQPLA ACNGPPGEPR ADPGVSTRCA ATMIPQGKLT101 GGRSCGSRPR MGRGERAELS FPAAAPRLKR GLGFWFFFCG NSSVEILLAC151 HTTHPFC核苷酸及氨基酸組合序列(SEQ ID NO:15)克隆號和蛋白名稱PP552起始編碼子925 ATG終止編碼子1395 TAA蛋白質(zhì)分子量16177.611 TGG TGG CGC ATG TCT GTA ATC CCA GCT ACT CGG GAA GCT GAG GCA GGA 4849 GAA TCG CTT GAA CCC AGG AAG CGG AGG TTG CAG TGA GCC GAG ATC GCG 9697 CCA CTG CAC TCC AAC CTG GGC AAC AAT ACA AGA CTC CAT CTG AAA AAA 144145 AAA AGA TCA CAC AGG AAA ACA GAA GTT CGA TTT TAC GTC GTA CAC TGC 192193 TGT AAT TTC AGC ACA TGT GGA CTC GTG TAA CCA ACA CCA TAA CCT TCC 240241 ATC ACC CCT GAA ACT CCC TCC CGC CAG CCC TTT AGG GTT GCC CCT CCC 288289 CCC GAA CCC CAC CAG CCC CTG GTG ACC ACT GAT CTG TCC TCC AAC CCA 336337 TAG TGT TTT TCC GGG AAT GTC ACA AAA ACA GAA GCC GAC CAT GGG TCA 384385 CCT TTC TGG CGC CTT TCT CCC CGC ACA AAG TCT TTG TCC TTG TGA AGT 432433 TGT CAC GTG CCA AAC GCT TGT CCC TTT TTC CTG CTG GGT AAT ACT CCC 480481 GGT GCC GCC CTT GCT GTT CGT CGA TGC ACA TCT GGC TGC TTT TCG CTG 528529 GCT GCG AGC GGA GCT GCT AGG GAC ATG GCC ACG GGG CTG TGA GAG CGG 576577 AGT TTC CTC TCT CCG GTG ACC CTG AGC TGC GCC TTT CTC AGC CGC CTC 624625 CCG AGG CCC CAG GCG CTC TGC GGG GGC TCT GGC GGG GTT GGT GGG GGT 672673 GGG CGT TCT CGT TGT TTC AGC GGC GCT GCC CCA GGC CCT GCG GGA GGG 720721 ACC GTG GGA CCC GAG ACA TCC CCG CCT GGC CTC CGC TCC CCA CCC GGG 768769 AGT GGG GCT CGC ACC CCC CCA ACC TCG GGT AAA GAC GCT TCT GGA AGG 816817 AAG GGC GCC CCG CGG ACC CCG CCC AAC CCT GCC CAG CCC AGC CCA GCC 864865 CAG CCC AGC CCT TCC CGG GGC GGC GGC GCG GGA AGC AGG CGG CGG CGC 912913 ACG GGC GTC GTC ATG GCA ACC CCA CCG GCT CCG GGG GCC GGG ACC GCT 9601 Met Ala Thr Pro Pro Ala Pro Gly Ala Gly Thr Ala 12961 GCC CCC TCC GCC CCT CGA CCC CCG CCC CCC CGC CCT TCC TGG CTG CGG100813 Ala Pro Ser Ala Pro Arg Pro Pro Pro Pro Arg Pro Ser Trp Leu Arg 281009 CTG GAC CCG GCT GCG CGG GGC GCG AGG CTG CCT TTC CCG GGA TCA CCA105629 Leu Asp Pro Ala Ala Arg Gly Ala Arg Leu Pro Phe Pro Gly Ser Pro 441057 GGG ACC ACC CGG CGC GCT CCC CGG GAA TCC GCA CCC CTG GCC CCA GCG110445 Gly Thr Thr Arg Arg Ala Pro Arg Glu Ser Ala Pro Leu Ala Pro Ala 601105 CTC CGG AGC GAC CCG GGT CAG CCC CTG GCT GCC TGC AAT GGG CCC CCG115261 Leu Arg Ser Asp Pro Gly Gln Pro Leu Ala Ala Cys Asn Gly Pro Pro 761153 GGC GAA CCC CGG GCG GAC CCA GGA GTG AGC ACC CGG TGC GCG GCA ACG120077 Gly Glu Pro Arg Ala Asp Pro Gly Val Ser Thr Arg Cys Ala Ala Thr 921201 ATG ATC CCG CAA GGG AAG CTC ACG GGA GGC AGG AGC TGT GGC AGC CGC124893 Met Ile Pro Gln Gly Lys Leu Thr Gly Gly Arg Ser Cys Gly Ser Arg 1081249 CCC AGG ATG GGG CGC GGG GAG CGC GCT GAG CTG TCC TTT CCC GCA GCG1296109 Pro Arg Met Gly Arg Gly Glu Arg Ala Glu Leu Ser Phe Pro Ala Ala 1241297 GCC CCG CGG TTG AAG CGT GGG CTT GGG TTT TGG TTT TTC TTC TGT GGC1344125 Ala Pro Arg Leu Lys Arg Gly Leu Gly Phe Trp Phe Phe Phe Cys Gly 1401345 AAC AGT TCT GTT GAG ATA TTA CTC GCC TGC CAT ACA ACT CAC CCA TTT1392141 Asn Ser Ser Val Glu Ile Leu Leu Ala Cys His Thr Thr His Pro Phe 1561393 TAA AAG TAC ACC TCA GGG GTC CTG CGT GTA TTG ACA AAC CCG CCG CCG1440157 *** 1571441 TCA CCA CAG CCA ATT TCA GAA CAT TTT CAT CTC TTC AAA AGA AAC CCT14881489 GTA CCC TTC AGC TGT CAC CCT CCT GGT CCC CAT CCG GTC CTC GTC CCG15361537 CCC TCA GCA GCC ACG CAC TGC CTG TAA AGT CCC CTG TCC TGC CCT GTA15841585 GGT GGA ATC TAT ACC TTG GGG TCT GTT CTG ACG TTC ACC TAA CAG CCT16321633 TTC CAG GCT CAG CTG TGC TAT TGT ATG GAC CAG GGG GTT GTT TTG TTT16801681 TTG TTG TTT GTT GAT TGT GTG TGT GTG TGT GTG TGT GTG TGT GAG CCT17281729 GGC GTG GTT GCG GGC GCC TAT AAT CCC AGC TGC TCA GGA GGC TGA GGC17761777 AGG AGG ATC A 17866.PP591A核苷酸序列(SEQ ID NO:16)長度1838bp1 GAAAGAGCCG GTGAAGGGGC AGAACAGGCA GGTTCCCTCG ACCCAGGACC51 CCCTGTTCCC AGGCTATGGC CCCCAGTGCC CTGTAGACCT GGCAGGCCCC101 CCGTGCTTGC GACCCCTATT TGGGGGTCTG GGTGGCTACT GGAGGGCCTT151 GCAGAGGGGC AGAGAAGGCA GGACCATGAC ATCTAGGGCC TCTGAACTTT201 CTCCGGGGCG CAGCGTGACG GCTGGCATCA TCATTGTTGG AGATGAGATC251 CTTAAGTTGG AAACAACAAA TGGCTTTTGA GTCCAAGAGT GATGCAATCA301 CAGTGACGCA TTAAAACGGT TACTCCGGAG ACATCAGAGC ACTGTGGCTG351 GAGGCTGGGA GCCTGGCCAG GAAGCTGTCG CCATTGTCCA GGTGAAAGGT401 GCTAAGGACC TGCTTGGTGG CAGTGGGGAC AGAAAGAAGA AAGCAGGCCA451 GGCGTGGTGG CTCACACCTA TAATTCCAGC ACTTTGGGAG GCTGAGGCAG501 GAGGATCACT TGAGACCAGG AATTCAACAC CAGCCTGGGC AACATGGCAA551 GACCCCATTT CTACAAAAAA AATTTAAAAT GAGCTGAATG TGGTGGCACG601 CGCCTGTAGT CCCAGCTACT CGGAAGGCTG GGGTGGCCCT TGAAGCCAGG651 AGGTTGAGGC TGCAGTGAAC TGTGACTGAG CCACTATACT CCAGCCTGGG701 TGACAGAGAC CCAGCTTTAA AACCAAACAA AFGGATTTTC CCACTCTTGT751 GTCCAGTCCA GGCCCCTCAG CAGCCTGAGG TGGTGTCCTT CAAAGAGCAG801 AGCACTGCAT CATCAGGTGG ATGCAGCCAT CATCTTCAAC CCCTCCCCTT851 CATCCCTACA GTACTGATGG CCTCATCTTC CCCTTCAACC CCCAGGGACA901 CACTCAGGAC ACCAACACCT TCTTTCTGTG CCGGACACTG CGCTCCCTAG951 GGGTCCAGGT TTGCCGAGTC TCAGTTGTAC CTGATGAGGT AGCCACCATT1001 GCAGCTGAGG TCACTTCTTT CTCCAACCGC TTCACCCATG TCCTCACAGC1051 AGGGGGCATC GGCCCCACTC ATGATGATGT GACCTTTGAG GCAGTGGCAC1101 AGGCCTTTGG AGATGAGCTG AAGCCACACC CCAAGTTGGA AGCAGCCACC1151 AAAGCCCTAG GAGGGGAAGG CTGGGAGAAG CTATCATTGG TGCCCTCCTC1201 TGCCCGCCTG CATTATGGCA CAGATCCTTG CACTGGTCAA CCTTTCAGAT1251 TCCCTCTGGT CTCCGTCCGA AACGTCTACC TCTTCCCAGG CATTCCAGAG1301 CTGCTGCGGC GGGTGCTGGA GGGGATGAAG GGACTATTCC AAAACCCAGC1351 TGTTCAGTTC CACTCAAAGG AGCTATATGT GGCTGCTGAT GAAGCCTCCA1401 TCGCCCCCAT TCTGGCTGAG GCCCAGGCCC ACTTTGGACG TAGGCTTGGC1451 CTGGGTTCCT ACCCTGACTG GGGCAGCAAC TACTATCAGG TGAAGCTGAC1501 TCTAGACTCA GAGGAAGAAG GACCCCTGGA GGAATGCTTG GCCTACCTGA1551 CTGCCCGTTT GCCCCAGGGA TCGCTGGTCC CCTACATGCC CAACGCTGTG1601 GAGCAGGCCA GTGAGGCTGT ATACAAACTC GCTGAATCAG GTAGGGACCT1651 TATGGAGGAG GGGCATTATG CCCAAAGCCA TTGGTGGCAC CCCAGATCTC1701 AGTAATGCAG GGGCTGTTGG GTGCTTCCTG CAAATCCCTG AGAGGGCAGA1751 AGATAGCTTC TGTTAATTCA TTATTCTTCC AATAAATGTT GATTGAGTAC1801 CTAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAB氨基酸序列(SEQ ID NO:17)長度294個氨基酸1 MQPSSSTPPL HPYSTDGLIF PFNPQGHTQD TNTFFLCRTL RSLGVQVCRV51 SVVPDEVATI AAEVTSFSNR FTHVLTAGGI GPTHDDVTFE AVAQAFGDEL101 KPHPKLEAAT KALGGEGWEK LSLVPSSARL HYGTDPCTGQ PFRFPLVSVR151 NVYLFPGIPE LLRRVLEGMK GLFQNPAVQF HSKELYVAAD EASIAPILAE201 AQAHFGRRLG LGSYPDWGSN YYQVKLTLDS EEEGPLEECL AYLTARLPQG251 SLVPYMPNAV EQASEAVYKL AESGRDLMEE GHYAQSHWWH PRSQC核苷酸及氨基酸組合序列(SEQ ID NO:18)克隆號和蛋白名稱PP591起始編碼子821 ATG終止編碼子1705 TAA蛋白質(zhì)分子量32367.761G AAA GAG CCG GTG AAG GGG CAG AAC AGG CAG GTT CCC TCG ACC CAG 4647 GAC CCC CTG TTC CCA GGC TAT GGC CCC CAG TGC CCT GTA GAC CTG GCA 9495 GGC CCC CCG TGC TTG CGA CCC CTA TTT GGG GGT CTG GGT GGC TAC TGG 142143 AGG GCC TTG CAG AGG GGC AGA GAA GGC AGG ACC ATG ACA TCT AGG GCC 190191 TCT GAA CTT TCT CCG GGG CGC AGC GTG ACG GCT GGC ATC ATC ATT GTT 238239 GGA GAT GAG ATC CTT AAG TTG GAA ACA ACA AAT GGC TTT TGA GTC CAA 286287 GAG TGA TGC AAT CAC AGT GAC GCA TTA AAA CGG TTA CTC CGG AGA CAT 334335 CAG AGC ACT GTG GCT GGA GGC TGG GAG CCT GGC CAG GAA GCT GTC GCC 382383 ATT GTC CAG GTG AAA GGT GCT AAG GAC CTG CTT GGT GGC AGT GGG GAC 430431 AGA AAG AAG AAA GCA GGC CAG GCG TGG TGG CTC ACA CCT ATA ATT CCA 478479 GCA CTT TGG GAG GCT GAG GCA GGA GGA TCA CTT GAG ACC AGG AAT TCA 526527 ACA CCA GCC TGG GCA ACA TGG CAA GAC CCC ATT TCT ACA AAA AAA ATT 574575 TAA AAT GAG CTG AAT GTG GTG GCA CGC GCC TGT AGT CCC AGC TAC TCG 622623 GAA GGC TGG GGT GGC CCT TGA AGC CAG GAG GTT GAG GCT GCA GTG AAC 670671 TGT GAC TGA GCC ACT ATA CTC CAG CCT GGG TGA CAG AGA CCC AGC TTT 718719 AAA ACC AAA CAA ATG GAT TTT CCC ACT CTT GTG TCC AGT CCA GGC CCC 766767 TCA GCA GCC TGA GGT GGT GTC CTT CAA AGA GCA GAG CAC TGC ATC ATC 814815 AGG TGG ATG CAG CCA TCA TCT TCA ACC CCT CCC CTT CAT CCC TAC AGT 8621 Met Gln Pro Ser Ser Ser Thr Pro Pro Leu His Pro Tyr Ser 14863 ACT GAT GGC CTC ATC TTC CCC TTC AAC CCC CAG GGA CAC ACT CAG GAC 91015 Thr Asp Gly Leu Ile Phe Pro Phe Asn Pro Gln Gly His Thr Gln Asp 30911 ACC AAC ACC TTC TTT CTG TGC CGG ACA CTG CGC TCC CTA GGG GTC CAG 95831 Thr Asn Thr Phe Phe Leu Cys Arg Thr Leu Arg Ser Leu Gly Val Gln 46959 GTT TGC CGA GTC TCA GTT GTA CCT GAT GAG GTA GCC ACC ATT GCA GCT100647 Val Cys Arg Val Ser Val Val Pro Asp Glu Val Ala Thr Ile Ala Ala 621007 GAG GTC ACT TCT TTC TCC AAC CGC TTC ACC CAT GTC CTC ACA GCA GGG105463 Glu Val Thr Ser Phe Ser Asn Arg Phe Thr His Val Leu Thr Ala Gly 781055 GGC ATC GGC CCC ACT CAT GAT GAT GTG ACC TTT GAG GCA GTG GCA CAG110279 Gly Ile Gly Pro Thr His Asp Asp Val Thr Phe Glu Ala Val Ala Gln 941103 GCC TTT GGA GAT GAG CTG AAG CCA CAC CCC AAG TTG GAA GCA GCC ACC115095 Ala Phe Gly Asp Glu Leu Lys Pro His Pro Lys Leu Glu Ala Ala Thr 1101151 AAA GCC CTA GGA GGG GAA GGC TGG GAG AAG CTA TCA TTG GTG CCC TCC1198111 Lys Ala Leu Gly Gly Glu Gly Trp Glu Lys Leu Ser Leu Val Pro Ser 1261199 TCT GCC CGC CTG CAT TAT GGC ACA GAT CCT TGC ACT GGT CAA CCT TTC1246127 Ser Ala Arg Leu His Tyr Gly Thr Asp Pro Cys Thr Gly Gln Pro Phe 1421247 AGA TTC CCT CTG GTC TCC GTC CGA AAC GTC TAC CTC TTC CCA GGC ATT1294143 Arg Phe Pro Leu Val Ser Val Arg Asn Val Tyr Leu Phe Pro Gly Ile 1581295 CCA GAG CTG CTG CGG CGG GTG CTG GAG GGG ATG AAG GGA CTA TTC CAA1342159 Pro Glu Leu Leu Arg Arg Val Leu Glu Gly Met Lys Gly Leu Phe Gln 1741343 AAC CCA GCT GTT CAG TTC CAC TCA AAG GAG CTA TAT GTG GCT GCT GAT1390175 Asn Pro Ala Val Gln Phe His Ser Lys Glu Leu Tyr Val Ala Ala Asp 1901391 GAA GCC TCC ATC GCC CCC ATT CTG GCT GAG GCC CAG GCC CAC TTT GGA1438191 Glu Ala Ser Ile Ala Pro Ile Leu Ala Glu Ala Gln Ala His Phe Gly 2061439 CGT AGG CTT GGC CTG GGT TCC TAC CCT GAC TGG GGC AGC AAC TAC TAT1486207 Arg Arg Leu Gly Leu Gly Ser Tyr Pro Asp Trp Gly Ser Asn Tyr Tyr 2221487 CAG GTG AAG CTG ACT CTA GAC TCA GAG GAA GAA GGA CCC CTG GAG GAA1534223 Gln Val Lys Leu Thr Leu Asp Ser Glu Glu Glu Gly Pro Leu Glu Glu 2381535 TGC TTG GCC TAC CTG ACT GCC CGT TTG CCC CAG GGA TCG CTG GTC CCC1582239 Cys Leu Ala Tyr Leu Thr Ala Arg Leu Pro Gln Gly Ser Leu Val Pro 2541583 TAC ATG CCC AAC GCT GTG GAG CAG GCC AGT GAG GCT GTA TAC AAA CTC1630255 Tyr Met Pro Asn Ala Val Glu Gln Ala Ser Glu Ala Val Tyr Lys Leu 2701631 GCT GAA TCA GGT AGG GAC CTT ATG GAG GAG GGG CAT TAT GCC CAA AGC1678271 Ala Glu Ser Gly Arg Asp Leu Met Glu Glu Gly His Tyr Ala Gln Ser 2861679 CAT TGG TGG CAC CCC AGA TCT CAG TAA TGC AGG GGC TGT TGG GTG CTT1726287 His Trp Trp His Pro Arg Ser Gln *** 2951727 CCT GCA AAT CCC TGA GAG GGC AGA AGA TAG CTT CTG TTA ATT CAT TAT17741775 TCT TCC AAT AAA TGT TGA TTG AGT ACC TAA AAA AAA AAA AAA AAA AAA18221823 AAA AAA AAA AAA AAA A 1838DBlastp結(jié)果Query=PP591[基因=PP591](294個氨基酸)>SP_IN:Q22017 Q22017 caenorhabditis elegans.r53.1 protein.1/1999長度=519分值=132 bits(329),預計值=3e-30相同性=86/260(33%),相似性=138/260(53%),缺口=28/260(10%)Query:25 QGHTQDTNTFFLCRTLRSLGVQVCRVSVVPDEVATIAAEVTSFSNRFTHVLTAGGIGPTH 84+G T+DTN+ FLC+ L LGV + ++SV+ D+++ I+ EV S S + +V+T+GG+GPTHSbjct:27 KGTTRDTNSHFLCKRLHKLGVNIRKISVIGDDISEISREVQSASGAYDYVITSGGVGPTH 86Query:85 DDVTFEAVAQAFGDELKPH-----------PKLEAATKALG-GEG--------WEKLSLV 124DD T+ +A AF D+++ P A +AG GEG EKL +Sbjct:87 DDKTYLGLAHAFTDQMQFSDEIRQAVNRFLPTYTAKKRAEGVGEGLEEAVRLATEKLCTI 146Query:125 PSSARLHYGTDPCTGQPFRFPLVSVRNVYLFPGIPELLRRVLEGMKG-LFQNPAVQFHS- 182P ++L +GTGFP+V + NV PG+P+ R + ++ LF P + SSbjct:147 PKMSQLLWGTQKINGSLSTFPVVRISNVVALPGVPKFCERAFDELQDQLF--PIEERQSL 204Query:183 --KELYVAADEASIAPILAEAQAHF-GRRLGLGSYPDWGSNYYQVKLTLDSEEEGPLEEC 239+ LY DE + L +A F R + +GSYP+ + +++ KLT+++E+ESbjct:205 CFETLYTDLDEFDFSKKLTDLAAQFEDRNVQIGSYPELKNKFFKTKLTIETESSETMEAV 264Query:240 LAYLTARLPQGSLVPYMPNA 259+ L L G +V Y +ASbjct:265 VTSL-RELLAGHIVYYDSHA 283>SW:YM44_YEAST Q03219 saccharomyces cerevisiae(baker's yeast).
hypothetical 31.1 kd protein in sip18-spt21 intergenicregion.11/1997長度=274分值=67.9 bits(163),預計值=1e-10相同性=49/234(20%),相似性=106/234(44%),缺口=27/234(11%)Query:26 GHTQDTNTFFLCRTLRSLGVQVCRVSVVPDEVATIAAEVTSFSNRFTHVLTAGGIGPTHD 85G DTN+ F + G+Q+ ++ + D+ I V + +++ GGIGPTHDSbjct:18 GKVVDTNSTFFADYCFDHGIQLKEIATIGDDETQIVDTVRRLVKNYDFIISTGGIGPTHD 77Query:86 DVTFEAVAQAFG----------DELKPHPKLEAATKALGGEGWEKLSLVP--SSARLHYG 133D+T+E +A++F + ++ EA A + +++ +P ++ + +YSbjct:78 DITYECMAKSFNLPCELDEECKERMRHKSDPEARLDADALKAHYQMATMPKGTNVKNYYV 137Query:134 TDPCTGQPFRFPLVSV-RNVYLFPGIPELLRRVLEG-------MKGLFQNPA--VQFHSK 183D P+ S+ +Y +PGIP+L R+L++ L ++P V++ +Sbjct:138 CD-----DLWVPICSISHKMYILPGIPQLFARMLKAFTPTLKKIYNLDKDPREYVRYFVR 192Query:184 ELYVAADEASIAPILAEAQAHFGRRLGLGSYPDWGSNYYQVKLTLDSEEEGPLE 237++ +++ + +GSYP +G + V + + +++ L+Sbjct:193 THLTESQISKELKLIQDESTKVSEAIKIGSYPHFGMGFNTVSILGEKKDDSYLK 2467.PP603A核苷酸序列(SEQ ID NO:19)長度1619bp1 GCGCGGCGCG CTTAGTTGCC GGAGCTGAAC GGCGCGGAGC TGGTCTGAGG51 CGAGCCGAGC CGAGCGAGCG CGGCGGTGGG GCCGAGAGGA CGCGCAGGTG101 GCGGCGTTGC CATGTCGCAC GGTCACAGCC ACGGCGGGGG TGGCTGCCGC151 TGCGCCGCCG AACGGGAGGA GCCGCCCGAG CAGCGCGGCC TGGCCTACGG201 CCTGTACCTG CGCATCGACC TGGAGCGGCT GCAATGCCTT AACGAGAGCC251 GCGAGGGCAG CGGCCGCGGC GTCTTCAAGC CGTGGGAGGA GCGGACCGAC301 CGCTCCAAGT TTGTTGAAAG TGATGCAGAT GAAGAGCTTC TGTTTAATAT351 TCCATTTACG GGCAATGTCA AGCTCAAAGG CATCATTATA ATGGGAGAGG401 ATGATGACTC ACACCCCTCT GAGATGAGAC TGTACAAGAA TATTCCACAG451 ATGTCCTTTG ATGATACAGA AAGGGAGCCA GATCAGACCT TTAGTCTGAA501 CCGGGATCTT ACAGGAGAAT TAGAGTATGC TACAAAAATT TCTCGTTTTT551 CAAATGTCTA TCATCTCTCA ATTCATATTT CAAAAAACTT CGGAGCAGAT601 ACGACAAAGG TCTTTTATAT TGGCCTGAGA GGAGAGTGGA CTGAGCTTCG651 CCGACACGAG GTGACCATCT GCAATTACGA AGCATCTGCC AACCCAGCAG701 ACCATAGGGT CCATCAGGTT ACCCCACAGA CACACTTTAT TTCCTAAGGG751 CTGGCCAAGG CTCCCATAGA GGCGCTGTGT CAGTGAAGAT GTACGACTAC801 CTGTTGGGAA GGACAAAGGG ATGAGGCTCC AGAGAGAGTT GGCTGCCACA851 GCCTCTGCCA AGCTTTGTCT TTGGGGCTTG CTGCAGAAAC CTGGCCTACG901 GAAGATACGA CACCACTGGG AGGGTTGTGT AGGTGCCAGG GGACCATCGT951 GGTTCTCTAG GGCGCTGTGG AAATTGGGTC TTGGGCTGGG TGGCATCTGG1001 CAGTCATGGA TAACACTTGC TTTTCCAGTT AATGTGGCCA TGTGATTCCA1051 AGTGTCATGT TGCTTTGTGG CAAGATTGTT GTGTGACTTG TTTTTTTGTT1101 TTTGTTTTTG TTTTTTTAAA GGAAACTATT TGTGGGCTAT AGGAAACTTT1151 CTGATGCCTC CGGATTGTGT TAGTAGTAGC CATCAGGAGG GTCTCCAACT1201 AAAACACTTG TTCCTGCTTG CTCCTTTCCC CTCTCATTGT TCAGCATTCT1251 TGTCAAGTTG CCCAGCTTGG AGTTGTCTGT CACGCACATG TGTCCTGTGG1301 TTATAGCTAG AAGGACAGGA GTCTCCTGCT GATGCGTGAT AGCTTAAGCT1351 TGGGGAGAAG GTCTTTTCCA CTGCCTAGCT AAGCAGTCTG GGGAGAGCAT1401 GGGGATCATT TCTATGTGTG TGGGTAATCT GGTCAGTAAG ATTGAGACTT1451 AGTTAAGATT CCCCTTGGAA ATTCCTTAAT GTTTATTAGC TTCTAACTAG1501 TGTTGTAAGT CCGATGCCAG AATTTGGAGA TTTGAGTTCT TCTTTTCATG1551 GCTTTTATTC ACTGTGACTA ATAAGCTTCC TAATAAATCC TTGCCAGACT1601 TAAAAAAAA AAAAAAAB氨基酸序列(SEQ ID NO:20)長度211個氨基酸1 MSHGHSHGGG GCRCAAEREE PPEQRGLAYG LYLRIDLERL QCLNESREGS51 GRGVFKPWEE RTDRSKFVES DADEELLFNI PFTGNVKLKG IIIMGEDDDS101 HPSEMRLYKN IPQMSFDDTE REPDQTFSLN RDLTGELEYA TKISRFSNVY151 HLSIHISKNF GADTTKVFYI GLRGEWTELR RHEVTICNYE ASANPADHRV201 HQVTPQTHFI SC核苷酸及氨基酸組合序列(SEQ ID NO:21)克隆號和蛋白名稱PP603起始編碼子112 ATG終止編碼子747 TAA蛋白質(zhì)分子量24176.621 GCG CGG CGC GCT TAG TTG CCG GAG CTG AAC GGC GCG GAG CTG GTC TGA 4849 GGC GAG CCG AGC CGA GCG AGC GCG GCG GTG GGG CCG AGA GGA CGC GCA 9697 GGT GGC GGC GTT GCC ATG TCG CAC GGT CAC AGC CAC GGC GGG GGT GGC 1441 Met Ser His Gly His Ser His Gly Gly Gly Gly 11145 TGC CGC TGC GCC GCC GAA CGG GAG GAG CCG CCC GAG CAG CGC GGC CTG 19212 Cys Arg Cys Ala Ala Glu Arg Glu Glu Pro Pro Glu Gln Arg Gly Leu 27193 GCC TAC GGC CTG TAC CTG CGC ATC GAC CTG GAG CGG CTG CAA TGC CTT 24028 Ala Tyr Gly Leu Tyr Leu Arg Ile Asp Leu Glu Arg Leu Gln Cys Leu 43241 AAC GAG AGC CGC GAG GGC AGC GGC CGC GGC GTC TTC AAG CCG TGG GAG 28844 Ash Glu Ser Arg Glu Gly Ser Gly Arg Gly Val Phe Lys Pro Trp Glu 59289 GAG CGG ACC GAC CGC TCC AAG TTT GTT GAA AGT GAT GCA GAT GAA GAG 33660 Glu Arg Thr Asp Arg Ser Lys Phe Val Glu Ser Asp Ala Asp Glu Glu 75337 CTT CTG TTT AAT ATT CCA TTT ACG GGC AAT GTC AAG CTC AAA GGC ATC 38476 Leu Leu Phe Asn Ile Pro Phe Thr Gly Asn Val Lys Leu Lys Gly Ile 91385 ATT ATA ATG GGA GAG GAT GAT GAC TCA CAC CCC TCT GAG ATG AGA CTG 43292 Ile Ile Met Gly Glu Asp Asp Asp Ser His Pro Ser Glu Met Arg Leu 107433 TAC AAG AAT ATT CCA CAG ATG TCC TTT GAT GAT ACA GAA AGG GAG CCA 480108 Tyr Lys Asn Ile Pro Gln Met Ser Phe Asp Asp Thr Glu Arg Glu Pro 123481 GAT CAG ACC TTT AGT CTG AAC CGG GAT CTT ACA GGA GAA TTA GAG TAT 528124 Asp Gln Thr Phe Ser Leu Asn Arg Asp Leu Thr Gly Glu Leu Glu Tyr 139529 GCT ACA AAA ATT TCT CGT TTT TCA AAT GTC TAT CAT CTC TCA ATT CAT 576140 Ala Thr Lys Ile Ser Arg Phe Ser Asn Val Tyr His Leu Ser Ile His 155577 ATT TCA AAA AAC TTC GGA GCA GAT ACG ACA AAG GTC TTT TAT ATT GGC 624156 Ile Ser Lys Asn Phe Gly Ala Asp Thr Thr Lys Val Phe Tyr Ile Gly 171625 CTG AGA GGA GAG TGG ACT GAG CTT CGC CGA CAC GAG GTG ACC ATC TGC 672172 Leu Arg Gly Glu Trp Thr Glu Leu Arg Arg His Glu Val Thr Ile Cys 187673 AAT TAC GAA GCA TCT GCC AAC CCA GCA GAC CAT AGG GTC CAT CAG GTT 720188 Asn Tyr Glu Ala Ser Ala Asn Pro Ala Asp His Arg Val His Gln Val 203721 ACC CCA CAG ACA CAC TTT ATT TCC TAA GGG CTG GCC AAG GCT CCC ATA 768204 Thr Pro Gln Thr His Phe Ile Ser *** 212769 GAG GCG CTG TGT CAG TGA AGA TGT ACG ACT ACC TGT TGG GAA GGA CAA 816817 AGG GAT GAG GCT CCA GAG AGA GTT GGC TGC CAC AGC CTC TGC CAA GCT 864865 TTG TCT TTG GGG CTT GCT GCA GAA ACC TGG CCT ACG GAA GAT ACG ACA 912913 CCA CTG GGA GGG TTG TGT AGG TGC CAG GGG ACC ATC GTG GTT CTC TAG 960961 GGC GCT GTG GAA ATT GGG TCT TGG GCT GGG TGG CAT CTG GCA GTC ATG10081009 GAT AAC ACT TGC TTT TCC AGT TAA TGT GGC CAT GTG ATT CCA AGT GTC10561057 ATG TTG CTT TGT GGC AAG ATT GTT GTG TGA CTT GTT TTT TTG TTT TTG 11041105 TTT TTG TTT TTT TAA AGG AAA CTA TTT GTG GGC TAT AGG AAA CTT TCT 11521153 GAT GCC TCC GGA TTG TGT TAG TAG TAG CCA TCA GGA GGG TCT CCA ACT 12001201 AAA ACA CTT GTT CCT GCT TGC TCC TTT CCC CTC TCA TTG TTC AGC ATT 12481249 CTT GTC AAG TTG CCC AGC TTG GAG TTG TCT GTC ACG CAC ATG TGT CCT 12961297 GTG GTT ATA GCT AGA AGG ACA GGA GTC TCC TGC TGA TGC GTG ATA GCT 13441345 TAA GCT TGG GGA GAA GGT CTT TTC CAC TGC CTA GCT AAG CAG TCT GGG 13921393 GAG AGC ATG GGG ATC ATT TCT ATG TGT GTG GGT AAT CTG GTC AGT AAG 14401441 ATT GAG ACT TAG TTA AGA TTC CCC TTG GAA ATT CCT TAA TGT TTA TTA 14881489 GCT TCT AAC TAG TGT TGT AAG TCC GAT GCC AGA ATT TG6 AGA TTT GAG 15361537 TTC TTC TTT TCA TGG CTT TTA TTC ACT GTG ACT AAT AAG CTT CCT AAT 15841585 AAA TCC TTG CCA GAC TTA AAA AAA AAA AAA AAA AA 1619DBlastp結(jié)果Query=PP603[基因=PP603](211個氨基酸)>SW:YOJ1_CAEEL P34624 caenorhabditis elegans.hypothetical 63.5 kdprotein zk353.1 in chromosome ⅲ.6/1994長度=548分值=179 bits(449),預計值=2e-44相同性=89/187(47%),相似性=124/187(65%),缺口=1/187(0%)Query:14 CAAER-EEPPEQRGLAYGLYLRIDLERLQCLNESREGSGRGVFKPWEERTDRSKFVESDA 72CAAE E P Y + ID+E++ LNE +G+G+ VFK E+R DR ++VESDSbjct:350 CAAEHIPEVPGDDVYRYDMVSYIDMEKVTTLNESVDGAGKKVFKVMEKRDDRLEYVESDC 409Query:73 DEELLFNIPFTGNVKLKGIIIMGEDDDSHPSEMRLYKNIPQMSFDDTEREPDQTFSLNRD 132D ELLFNIPFTG+V+L G+ I+G++D SHP+++RL+K+ MSFDD E DQ L +DSbjct:410 DHELLFNIPFTGHVRLTGLSIIGDEDGSHPAKIRLFKDREAMSFDDCSIEADQEIDLKQD 469Query:133 LTGELEYATKISRFSNVYHLSIHISKNFGADTTKVFYIGLRGEWTELRRHEVTICNYEAS 192G ++Y K S+F N+++LSI + NFG D TK++YIGLRGE+R + I YE+Sbjct:470 PQGLVDYPLKASKFGNIHNLSILVDANFGEDETKIYYIGLRGEFQHEFRQRIAIATYESR 529Query:193 ANPADHR 199A DH+Sbjct:530 AQLKDHK 5368.PP632A核苷酸序列(SEQ ID N0:22)長度1854bp1 GGAGAGCCCG GCCCGCGGGC CGTCCGTCCC CCACAGGAAA CCGCCGGGGA51 GGCCGCGGCA GGGACCCGCC CCCAGGCCAC TAACAGCAAC AACAGAGAGG101 CTGGAGCTCT GCCTGCGTGC GGGCCAAGGG CTAAACCTTG GACAGGTTCT151 TTCACTTACT CCGCCTGACA ACCCTGCGAC GTGATACCAT TATCCCCACT201 TCGCAGATCA AATAAACGGA GTCTTGGAGA GATTGAATTG ACTTTACCAA251 AACCGTCAGG ATTTGAATCT GCTGCTCTCT GATCCTAAAG CCTGAGCTAG301 AAACCACCGC TCCCCCTCCT AGGAGGCCCC TTCCAGGGGC TTGCCGTGGC351 CAAGCCAGGC CAGGTGGGAG AAGCGGCAGC CTTGCCCTGG AGGGTTTTGA401 GAAGCACTGC TCCTGGAGGC CCTGGGGAAG GTCCCTGAAA CCTTTGGCCA451 ATGTGGCTGT CCCCATGGTC CACATGCCCT CCCCACCCCC TGCCTAGCTG501 CTTGACTGCC TGCTGCTCCC CAGCCCACCA GCCTGTCCGT GGGTCAGCCC551 AGCCACCCGC TTCGGATCTC TGCACGTGTG TCACCTGCTG TTCTGGCCCT601 CATCCCAACT ATCCACCTGC CCATCTCCTC CCTACCTCCT CGCTGCCTAT651 CTGCCCAGGA CTTATCTGCT GTCTGCTCAC CTGCCTGCTT GTTGACTGCT701 TCTCTGCCCT CCTATCTGCC TGTGAGACTA GAGATTTGTC ACCTTGGAAA751 GCACGGAGAG TACTGCTAAG ATGAAACACA GGAAGGACAG GCCTTGATGG801 AAGGTTGGGG GGCCGAGAGA TCCAGAGCCT ATGGGAGGGG ACTTGTGAGT851 GCTGGCATAT TCAGGACCCA GTGCAAACCC AAGCACAGCT CTGCTCCCGG901 CCCCAGTGGC CAAACTGAAG GCTTGCCCTG GCTATTCTGC CGTTGACATG951 GGCCTCACCC TACCACGGGG ATAGGTCTTG GATGGAGGGA AGAGGGAGAC1001 TCACCGGGGG CCTCCTGAGT CCTTTGAGTG TCCCCATGAC CCCAGCACCT1051 GGGACAGCTG CTGGAAAGAG GGTACTGGCA AAAATTTGCT AAATGGACAA1101 TCATAGGCCC AGTGTGGTGG CTCACGTCTG TAATCCCAGC ACTTTGGGAG1151 GCCGAGGTGT GCAGATCACT GAAGTCCAGG AGTTTGAGAC CAGCCTGGGC1201 AACATGGTGA AACCCCATTT CTACAGAAAA CTACAAAAAT TAGCTGGACA1251 CGGTAGCACA CACCTATAGT TCCTGCTACT CAGGAGGCTA AGGTGGGAGG1301 ATCGCTTGAG CCCAGGAGAT CAAGGCTATG GTGAGCCGTG ATCGTGCCAC1351 TGTACTCCAG CCTGGATGAC AGAGGAAGAC CCTGTCTCAA AACAAACAAA1401 ACAACAGCAA CAACAAGAAA ACAATAATAG GGACATTGAG TACCCTTTCT1451 GGCACCTGGC ACTCTGCCAA ATGCTATGCA CACTCCGCCC TTCAGTCTTC1501 CCAGGAACCC TGTGCAGTTT GTAGCGTGGC TCACATTTGC CAAGAAGGAA1551 GTGAGGCTCA GCGAGGTTAA GCAGTGCCTG TGGAGTCACA TGGCTGCAAG1601 TAGTGGCCTG GACTGGACTG CAGAGCCCAT GCTCCCCACC GCTTTCCATG1651 GGGCAACTCT AGGCCATCAT TCTCCACCCC TCAGACCCAA AGCTGCCTTT1701 TCATAATGCT TGCTGTTGCT CCCTTTATGC TCCTGAAATG AAATTTATGG1751 CTAATATGCC AGCCTTTACA TCTAATTAAA AATCATCCAA TGGTTTTTTT1801 GTCCTTATTA ATATATAAGA AATAAAAGGT AATGATAAAA AAAAAAAAAA1851 AAAAB氨基酸序列(SEQ ID NO:23)長度107個氨基1 MWLSPWSTCP PHPLPSCLTA CCSPAHQPVR GSAQPPASDL CTCVTCCSGP51 HPNYPPAHLL PTSSLPICPG LICCLLTCLL VDCFSALLSA CETRDLSPWK101 ARRVLLRC.核苷酸及氨基酸組合序列(SEQ ID NO:24)克隆號和蛋白名稱PP632起始編碼子451 ATG終止編碼子774 TGA蛋白質(zhì)分子量11453.91 GGA GAG CCC GGC CCG CGG GCC GTC CGT CCC CCA CAG GAA ACC GCC GGG 4849 GAG GCC GCG GCA GGG ACC CGC CCC CAG GCC ACT AAC AGC AAC AAC AGA 9697 GAG GCT GGA GCT CTG CCT GCG TGC GGG CCA AGG GCT AAA CCT TGG ACA 144145 GGT TCT TTC ACT TAC TCC GCC TGA CAA CCC TGC GAC GTG ATA CCA TTA 192193 TCC CCA CTT CGC AGA TCA AAT AAA CGG AGT CTT GGA GAG ATT GAA TTG 240241 ACT TTA CCA AAA CCG TCA GGA TTT GAA TCT GCT GCT CTC TGA TCC TAA 288289 AGC CTG AGC TAG AAA CCA CCG CTC CCC CTC CTA GGA GGC CCC TTC CAG 336337 GGG CTT GCC GTG GCC AAG CCA GGC CAG GTG GGA GAA GCG GCA GCC TTG 384385 CCC TGG AGG GTT TTG AGA AGC ACT GCT CCT GGA GGC CCT GGG GAA GGT 432433 CCC TGA AAC CTT TGG CCA ATG TGG CTG TCC CCA TGG TCC ACA TGC CCT 4801 Met Trp Leu Ser Pro Trp Ser Thr Cys Pro 10481 CCC CAC CCC CTG CCT AGC TGC TTG ACT GCC TGC TGC TCC CCA GCC CAC 52811 Pro His Pro Leu Pro Ser Cys Leu Thr Ala Cys Cys Ser Pro Ala His 26529 CAG CCT GTC CGT GGG TCA GCC CAG CCA CCC GCT TCG GAT CTC TGC ACG 57627 Gln Pro Val Arg Gly Ser Ala Gln Pro Pro Ala Ser Asp Leu Cys Thr 42577 TGT GTC ACC TGC TGT TCT GGC CCT CAT CCC AAC TAT CCA CCT GCC CAT 62443 Cys Val Thr Cys Cys Ser Gly Pro His Pro Ash Tyr Pro Pro Ala His 58625 CTC CTC CCT ACC TCC TCG CTG CCT ATC TGC CCA GGA CTT ATC TGC TGT 67259 Leu Leu Pro Thr Ser Ser Leu Pro Ile Cys Pro Gly Leu Ile Cys Cys 74673 CTG CTC ACC TGC CTG CTT GTT GAC TGC TTC TCT GCC CTC CTA TCT GCC 72075 Leu Leu Thr Cys Leu Leu Val Asp Cys Phe Ser Ala Leu Leu Ser Ala 90721 TGT GAG ACT AGA GAT TTG TCA CCT TGG AAA GCA CGG AGA GTA CTG CTA 76891 Cys Glu Thr Arg Asp Leu Ser Pro Trp Lys Ala Arg Arg Val Leu Leu 106769 AGA TGA AAC ACA GGA AGG ACA GGC CTT GAT GGA AGG TTG GGG GGC CGA 816107 Arg *** 108817 GAG ATC CAG AGC CTA TGG GAG GGG ACT TGT GAG TGC TGG CAT ATT CAG 864865 GAC CCA GTG CAA ACC CAA GCA CAG CTC TGC TCC CGG CCC CAG TGG CCA 912913 AAC TGA AGG CTT GCC CTG GCT ATT CTG CCG TTG ACA TGG GCC TCA CCC 960961 TAC CAC GGG GAT AGG TCT TGG ATG GAG GGA AGA GGG AGA CTC ACC GGG10081009 GGC CTC CTG AGT CCT TTG AGT GTC CCC ATG ACC CCA GCA CCT GGG ACA10561057 GCT GCT GGA AAG AGG GTA CTG GCA AAA ATT TGC TAA ATG GAC AAT CAT11041105 AGG CCC AGT GTG GTG GCT CAC GTC TGT AAT CCC AGC ACT TTG GGA GGC11521153 CGA GGT GTG CAG ATC ACT GAA GTC CAG GAG TTT GAG ACC AGC CTG GGC12001201 AAC ATG GTG AAA CCC CAT TTC TAC AGA AAA CTA CAA AAA TTA GCT GGA12481249 CAC GGT AGC ACA CAC CTA TAG TTC CTG CTA CTC AGG AGG CTA AGG TGG12961297 GAG GAT CGC TTG AGC CCA GGA GAT CAA GGC TAT GGT GAG CCG TGA TCG13441345 TGC CAC TGT ACT CCA GCC TGG ATG ACA GAG GAA GAC CCT GTC TCA AAA13921393 CAA ACA AAA CAA CAG CAA CAA CAA GAA AAC AAT AAT AGG GAC ATT GAG14401441 TAC CCT TTC TGG CAC CTG GCA CTC TGC CAA ATG CTA TGC ACA CTC CGC14881489 CCT TCA GTC TTC CCA GGA ACC CTG TGC AGT TTG TAG CGT GGC TCA CAT15361537 TTG CCA AGA AGG AAG TGA GGC TCA GCG AGG TTA AGC AGT GCC TGT GGA15841585 GTC ACA TGG CTG CAA GTA GTG GCC TGG ACT GGA CTG CAG AGC CCA TGC16321633 TCC CCA CCG CTT TCC ATG GGG CAA CTC TAG GCC ATC ATT CTC CAC CCC16801681 TCA GAC CCA AAG CTG CCT TTT CAT AAT GCT TGC TGT TGC TCC CTT TAT17281729 GCT CCT GAA ATG AAA TTT ATG GCT AAT ATG CCA GCC TTT ACA TCT AAT17761777 TAA AAA TCA TCC AAT GGT TTT TTT GTC CTT ATT AAT ATA TAA GAA ATA18241825 AAA GGT AAT GAT AAA AAA AAA AAA AAA AAA1854DBlastp結(jié)果Query=PP632AA(107個氨基酸)>SP_IN:045021 045021 caenorhabditis elegans.zc123.1 protein.
11/1998長度=768分值=35.6 bits(80),預計值=0.16相同性=19/50(38%),相似性=21/50(42%),缺口=8/50(16%)Query:5 PWSTCPPHPLPSCLTACCSPAHQPVRGSAQPPASDLCTCVTCCSGPHPNY54P + CPP P P CC PA P PA+ CCC G P YSbjct:106 PLACCPPPPPPK---PCCQPAFGPCC-----PATPNCCPKPCCRGRRPEY 147>SP_IN:Q17982 Q17982 caenorhabditis elegans.similarity to erbb-3receptor protein-tyrosine kinase.11/1998長度=654分值=31.7 bits(70),預計值=2.3相同性=24/99(24%),相似性=32/99(32%),缺口=15/99(15%)Query:7 STCPPHPLPSCLTACCSPAHQPVRGS------------AQPPASDLCTCVTCCSGPHPNY 54S+C PSC+ C+PA QP S QPP SC + CPSbjct:352 SSCMPACQSSCVQQACAPACQPKCSSQCVEQQQAQIVVVQPPTSSSNNCASSCM---PQC 408Query:55 PPAHLLPTSSXXXXXXXXXXXXXXXXVDCFSALLSACET 93P + + C A L +CE+Sbjct:409 TPQCVQQQTICAAACQPSCQSSCSSNAQCVQACLPSCES 447>PIR2:A60533 tumor-associated antigen DF3-human長度=256分值=30.5 bits(67),預計值=5.2相同性=19/56(33%),相似性=25/56(43%),缺口=7/56(12%)Query:4 SPWSTCPP-HPLPSCLTACCSPAHQPVRGSAQPPASDLCTCVTCCSGPHPNYPPAH 58+P ST PP H + S +P +P GS PPA + + PPPAHSbjct:130 APGSTAPPAHRVTS----APESRPAPGSTAPPAHRVTSAPESRPAPGSTAPPAH 1799.PP844A核苷酸序列(SEQ ID NO:25)長度1843bp1 TGAAGGCCGA TGCTGTGGGG GTGGGCGTGG AGAGAATTCT TCTGTGGGTC51 CTCTGGTGTT GAGTGGTCGG CTTGGTGTGG TGTGCGGAGG AGCTCCAGGC101 CCGTCGGCGC GGAGGGTCTT GCTGTGTTGC CCAGCCTGGT CTTGAATTCC151 TGGACTCAAG TGATGCTCCT GCCTTGGCTT CCCAAACTCC TGGAATTACA201 ACTTGGTCTC ACGTGTGAAA CATGGCTACA GATTGGCTGG GAAGTATTGT251 GTCCATCAAT TGTGGAGATA GCTTGGGTGT CTATCAGGGA AGAGTGTCAG301 CTGTGGATCA GGTCAGCCAG ACCATTTCTC TCACCCGGCC TTTCCATAAT351 GGAGTGAAGT GTCTTGTTCC AGAAGTCACC TTCAGGGCAG GTGACATTAC401 GGAGTTAAAA ATTCTGGAGA TACCAGGACC TGGAGACAAC CAACATTTTG451 GAGACCTTCA TCAAACAGAA TTAGGCCCCT CTGGTGCTGG CTGCCAAGTG501 GGCATCAATC AGAATGGCAC AGGCAAGTTT GTCAAGAAGC CAGCCTCTTC551 CAGCAGTGCC CCTCAGAATA TCCCTAAGAG GACAGATGTG AAGAGCCAGG601 ATGTTGCCGT TTCCCCGCAG CAGCAACAGT GCTCAAAGAG CTATGTCGAC651 AGGCACATGG AATCCTTGAG TCAGTCCAAA AGTTTCCGTC GTCGGCACAA701 CTCCTGGTCA TCTAGTAGCA GGCACCCAAA TCAGGCAACT CCCAAGAAAA751 GTGGTTTAAA GAATGGCCAG ATGAAGAATA AAGATGACGA GTGCTTCGGG801 GATGATATTG AGGAGATCCC AGACACAGAT TTTGATTTTG AAGGGAACCT851 GGCTCTTTTT GACAAGGCAG CTGTGTTTGA GGAGATTGAT ACCTATGAAA901 GGAGAAGTGG TACCCGTTCC CGGGGCATCC CAAATGAAAG GCCCACTCGG951 TACCGCCATG ATGAGAACAT CTTGGAGTCC GAGCCCATTG TCTATCGACG1001 GATCATAGTG CCCCACAACG TGAGCAAGGA GTTCTGCACG GACTCTGGCC1051 TGGTTGTCCC AAGTATTTCC TATGAGCTGC ATAAAAAGCT GTTGTCCGTG1101 GCTGAGAAGC ATGGGCTGAC CCTTGAGCGG AGACTGGAGA TGACAGGTGT1151 GTGTGCCAGT CAGATGGCAC TGACCCTCCT CGGAGGACCT AACAGGTTGA1201 ATCCCAAAAA TGTTCACCAG AGGCCTACAG TGGCTCTACT GTGTGGACCT1251 CATGTGAAGG GGGCTCAGGG TATCAGCTGT GGAAGGCACC TAGCCAACCA1301 TGATGTCCAG GTCATCCTTT TCCTGCCCAA TTTTGTCAAG ATGTTGGAAT1351 CTATCACCAA TGAGCTGTCG CTCTTCAGCA AGACCCAAGG CCAACAAGTG1401 TCTAGCCTCA AAGATCTGCC CACTAGCCCT GTGGACCTGG TCATCAACTG1451 CCTGGATTGC CCTGAGAACG TCTTCCTGCG CGATCAACCC TGGTACAAGG1501 CAGCTGTGGC CTGGGCCAAC CAGAACCGGG CACCAGTACT CAGCATAGAC1551 CCTCCTGTGC ATGAAGTCGA ACAGGGCATT GATGCCAAAT GGTCACTGGC1601 ACTGGGCCTG CCTCTGCCAC TGGGGGAGCA CGCAGGCCGT ATCTATTTGT1651 GCGACATTGG CATTCCCCAG CAGGTCTTCC AGGAGGTGGG CATCAACTAC1701 CACTCGCCCT TTGGCTGCAA GTTTGTTATC CCACTGCACT CTGCTTAAAG1751 GGTTCCTGCG CAGGCAGGAC TCTGCTGTCC CCTGCTGCTC CTGATAACAA1801 ACGCGTTAAG GTTTTGTAAA AAAAAAAAAA AAAAAAAAAA AAAB氨基酸序列(SEQ ID NO:26)長度508個氨基酸1 MATDWLGSIV SINCGDSLGV YQGRVSAVDQ VSQTISLTRP FHNGVKCLVP51 EVTFRAGDIT ELKILEIPGP GDNQHFGDLH QTELGPSGAG CQVGINQNGT101 GKFVKKPASS SSAPQNIPKR TDVKSQDVAV SPQQQQCSKS YVDRHMESLS151 QSKSFRRRHN SWSSSSRHPN QATPKKSGLK NGQMKNKDDE CFGDDIEEIP201 DTDFDFEGNL ALFDKAAVFE EIDTYERRSG TRSRGIPNER PTRYRHDENI251 LESEPIVYRR IIVPHNVSKE FCTDSGLVVP SISYELHKKL LSVAEKHGLT301 LERRLEMTGV CASQMALTLL GGPNRLNPKN VHQRPTVALL CGPHVKGAQG351 ISCGRHLANH DVQVILFLPN FVKMLESITN ELSLFSKTQG QQVSSLKDLP401 TSPVDLVINC LDCPENVFLR DQPWYKAAVA WANQNRAPVL SIDPPVHEVE451 QGIDAKWSLA LGLPLPLGEH AGRIYLCDIG IPQQVFQEVG INYHSPFGCK501 FVIPLHSAC核苷酸及氨基酸組合序列(SEQ ID NO:27)克隆號和蛋白名稱PP844起始編碼子222 ATG終止編碼子1748 TAA蛋白質(zhì)分子量56074.681 TG AAG GCC GAT GCT GTG GGG GTG GGC GTG GAG AGA ATT CTT CTG TGG 4748 GTC CTC TGG TGT TGA GTG GTC GGC TTG GTG TGG TGT GCG GAG GAG CTC 9596 CAG GCC CGT CGG CGC GGA GGG TCT TGC TGT GTT GCC CAG CCT GGT CTT 143144 GAA TTC CTG GAC TCA AGT GAT GCT CCT GCC TTG GCT TCC CAA ACT CCT 191192 GGA ATT ACA ACT TGG TCT CAC GTG TGA AAC ATG GCT ACA GAT TGG CTG 2391 Met Ala Thr Asp Trp Leu 6240 GGA AGT ATT GTG TCC ATC AAT TGT GGA GAT AGC TTG GGT GTC TAT CAG 2877 Gly Ser Ile Val Ser Ile Asn Cys Gly Asp Ser Leu Gly Val Tyr Gln 22288 GGA AGA GTG TCA GCT GTG GAT CAG GTC AGC CAG ACC ATT TCT CTC ACC 33523 Gly Arg Val Ser Ala Val Asp Gln Val Ser Gln Thr Ile Ser Leu Thr 38336 CGG CCT TTC CAT AAT GGA GTG AAG TGT CTT GTT CCA GAA GTC ACC TTC 38339 Arg Pro Phe His Asn Gly Val Lys Cys Leu Val Pro Glu Val Thr Phe 54384 AGG GCA GGT GAC ATT ACG GAG TTA AAA ATT CTG GAG ATA CCA GGA CCT 43155 Arg Ala Gly Asp Ile Thr Glu Leu Lys Ile Leu Glu Ile Pro Gly Pro 70432 GGA GAC AAC CAA CAT TTT GGA GAC CTT CAT CAA ACA GAA TTA GGC CCC 47971 Gly Asp Asn Gln His Phe Gly Asp Leu His Gln Thr Glu Leu Gly Pro 86480 TCT GGT GCT GGC TGC CAA GTG GGC ATC AAT CAG AAT GGC ACA GGC AAG 52787 Ser Gly Ala Gly Cys Gln Val Gly Ile Asn Gln Asn Gly Thr Gly Lys 102528 TTT GTC AAG AAG CCA GCC TCT TCC AGC AGT GCC CCT CAG AAT ATC CCT 575103 Phe Val Lys Lys Pro Ala Ser Ser Ser Ser Ala Pro Gln Asn Ile Pro 118576 AAG AGG ACA GAT GTG AAG AGC CAG GAT GTT GCC GTT TCC CCG CAG CAG 623119 Lys Arg Thr Asp Val Lys Ser Gln Asp Val Ala Val Ser Pro Gln Gln 134624 CAA CAG TGC TCA AAG AGC TAT GTC GAC AGG CAC ATG GAA TCC TTG AGT 671135 Gln Gln Cys Ser Lys Ser Tyr Val Asp Arg His Met Glu Ser Leu Ser 150672 CAG TCC AAA AGT TTC CGT CGT CGG CAC AAC TCC TGG TCA TCT AGT AGC 719151 Gln Ser Lys Ser Phe Arg Arg Arg His Asn Ser Trp Ser Ser Ser Ser 166720 AGG CAC CCA AAT CAG GCA ACT CCC AAG AAA AGT GGT TTA AAG AAT GGC 767167 Arg His Pro Asn Gln Ala Thr Pro Lys Lys Ser Gly Leu Lys Asn Gly 182768 CAG ATG AAG AAT AAA GAT GAC GAG TGC FTC GGG GAT GAT ATT GAG GAG 815183 Gln Met Lys Asn Lys Asp Asp Glu Cys Phe Gly Asp Asp Ile Glu Glu 198816 ATC CCA GAC ACA GAT TTT GAT TTT GAA GGG AAC CTG GCT CTT TTT GAC 863199 Ile Pro Asp Thr Asp Phe Asp Phe Glu Gly Asn Leu Ala Leu Phe Asp 214864 AAG GCA GCT GTG TTT GAG GAG ATT GAT ACC TAT GAA AGG AGA AGT GGT 911215 Lys Ala Ala Val Phe Glu Glu Ile Asp Thr Tyr Glu Arg Arg Ser Gly 230912 ACC CGT TCC CGG GGC ATC CCA AAT GAA AGG CCC ACT CGG TAC CGC CAT 959231 Thr Arg Ser Arg Gly Ile Pro Asn Glu Arg Pro Thr Arg Tyr Arg His 246960 GAT GAG AAC ATC TTG GAG TCC GAG CCC ATT GTC TAT CGA CGG ATC ATA1007247 Asp Glu Asn Ile Leu Glu Ser Glu Pro Ile Val Tyr Arg Arg Ile Ile 2621008 GTG CCC CAC AAC GTG AGC AAG GAG TTC TGC ACG GAC TCT GGC CTG GTT1055263 Val Pro His Asn Val Ser Lys Glu Phe Cys Thr Asp Ser Gly Leu Val 2781056 GTC CCA AGT ATT TCC TAT GAG CTG CAT AAA AAG CTG TTG TCC GTG GCT1103279 Val Pro Ser Ile Ser Tyr Glu Leu His Lys Lys Leu Leu Ser Val Ala 2941104 GAG AAG CAT GGG CTG ACC CTT GAG CGG AGA CTG GAG ATG ACA GGT GTG1151295 Glu Lys His Gly Leu Thr Leu Glu Arg Arg Leu Glu Met Thr Gly Val 3101152 TGT GCC AGT CAG ATG GCA CTG ACC CTC CTC GGA GGA CCT AAC AGG TTG1199311 Cys Ala Ser Gln Met Ala Leu Thr Leu Leu Gly Gly Pro Asn Arg Leu 3261200 AAT CCC AAA AAT GTT CAC CAG AGG CCT ACA GTG GCT CTA CTG TGT GGA1247327 Asn Pro Lys Asn Val His Gln Arg Pro Thr Val Ala Leu Leu Cys Gly 3421248 CCT CAT GTG AAG GGG GCT CAG GGT ATC AGC TGT GGA AGG CAC CTA GCC1295343 Pro His Val Lys Gly Ala Gln Gly Ile Ser Cys Gly Arg His Leu Ala 3581296 AAC CAT GAT GTC CAG GTC ATC CTT TTC CTG CCC AAT TTT GTC AAG ATG1343359 Asn His Asp Val Gln Val Ile Leu Phe Leu Pro Asn Phe Val Lys Met 3741344 TTG GAA TCT ATC ACC AAT GAG CTG TCG CTC TTC AGC AAG ACC CAA GGC1391375 Leu Glu Ser Ile Thr Asn Glu Leu Ser Leu Phe Ser Lys Thr Gln Gly 3901392 CAA CAA GTG TCT AGC CTC AAA GAT CTG CCC ACT AGC CCT GTG GAC CTG1439391 Gln Gln Val Ser Ser Leu Lys Asp Leu Pro Thr Ser Pro Val Asp Leu 4061440 GTC ATC AAC TGC CTG GAT TGC CCT GAG AAC GTC TTC CTG CGC GAT CAA1487407 Val Ile Asn Cys Leu Asp Cys Pro Glu Asn Val Phe Leu Arg Asp Gln 4221488 CCC TGG TAC AAG GCA GCT GTG GCC TGG GCC AAC CAG AAC CGG GCA CCA1535423 Pro Trp Tyr Lys Ala Ala Val Ala Trp Ala Asn Gln Asn Arg Ala Pro 4381536 GTA CTC AGC ATA GAC CCT CCT GTG CAT GAA GTC GAA CAG GGC ATT GAT1583439 Val Leu Ser Ile Asp Pro Pro Val His Glu Val Glu Gln Gly Ile Asp 4541584 GCC AAA TGG TCA CTG GCA CTG GGC CTG CCT CTG CCA CTG GGG GAG CAC1631455 Ala Lys Trp Ser Leu Ala Leu Gly Leu Pro Leu Pro Leu Gly Glu His 4701632 GCA GGC CGT ATC TAT TTG TGC GAC ATT GGC ATT CCC CAG CAG GTC TTC1679471 Ala Gly Arg Ile Tyr Leu Cys Asp Ile Gly Ile Pro Gln Gln Val Phe 4861680 CAG GAG GTG GGC ATC AAC TAC CAC TCG CCC TTT GGC TGC AAG TTT GTT1727487 Gln Glu Val Gly Ile Asn Tyr His Ser Pro Phe Gly Cys Lys Phe Val 5021728 ATC CCA CTG CAC TCT GCT TAA AGG GTT CCT GCG CAG GCA GGA CTC TGC1775503 Ile Pro Leu His Ser Ala *** 5091776 TGT CCC CTG CTG CTC CTG ATA ACA AAC GCC TTA AGG TTT TGT AAA AAA18231824 AAA AAA AAA AAA AAA AAA AA 1843D:Blastp結(jié)果Query=PP844[基因=PP844](508個氨基酸)>SP_FUN:094752 094752 schizosaccharomyces pombe(fission yeast).
hypothetical 49.2 kd protein.5/1999長度=454分值=77.6 bits(188),預計值=2e-13相同性=80/347(23%),相似性=138/347(39%),缺口=63/347(18%)Query:201 DTDFDFEGNLALFDKAAVFEEIDTYERRSGTRSRGIPNERPTR-YRHDENILE--------252D +FDF NL FDK VF E+++ + N+ P R Y H +N+LSbjct:96 DEEFDFAANLEKFDKKQVFAEFREKDKKDPAKLLVSHNKSPNRNYHHKQNVLGPSVKDEF 155Query:253---------------------------------SEPIVYRRIIVPHNVSKEFCTDSGLVVP 280S + ++ V N+ E T +G ++Sbjct:156 VDLPSAGSQINGIDAVLSSSSNGHVTPGSKKGSRETLKKKPFVDENIPAELHTTTGDILK 215Query:281 SISYELHKKLLSVAEKHGLTLERRLEMTGVCASQMALTLLGGPNRLNPKNVHQRPTVALL 340I+ E + +++A T + +E SQ ++LGG RL+ +N + +P V +LSbjct:216 PITPEQLSQGIALAIAKTST-DIVVENAAQLLSQFVFSVLGGHKRLSSRNHNSQPLVCIL 274Query:341 CGPHVKGAQGISCGRHLANHDVQVILFLPNFVKMLESITNELSLFSKTQG----QQVSSL 396G H + ++ GR L++V+L L ++L +FG+Sbjct:275 VGSHDHASAAVAAGRRLCAIGIKVVLRL---LTPFNVDNRQLLMFQAAGGYIPTENFDQF 331Query:397 KDLPTSPVDLVINCLDCPENVFLRDQPWYKAAVAWANQNRAPVLSIDPP----VHEVEQG 452+ TSP++LV++ L++ A + WAN +LS+D PV +Sbjct:332 LNKLTSPIELVVDVLTGFHPSIDKNS---HALIQWANDLNVLILSVDIPSGYTVQKKNTA 388Query:453 IDAKWSLALGLPLPLGEHAG--------RIYLCDIGIPQQVFQEVGI 491I KW+LALGA +++ ++G Q + E+GISbjct:389 ILPKWTLALGAVTTTLAQAALVKQAAGVSVFVGNLGTGSQTWAELGI 435>SW:YNUO YEAST P40165 saccharomyces cerevisiae(baker's yeast).
hypothetical 27.5 kd protein in spx19-gcr2 intergenicregion.7/1998長度=246分值=35.6 bits(80),預計值=1.0相同性=51/212(24%),相似性=89/212(41%),缺口=28/212(13%)Query:277 LVVPSISYELHKKLLSVAEKHGLTLERRLEMTGVCASQMALTLLGGPNRLNPKNVHQRPT 336+V ++ E+ K+L++ G TL++ +E +G +QP R +Sbjct:6 VVSSKLAAEIDKELMG--PQIGFTLQQLMELAGFSVAQAVCRQF--PLR-GKTETEKGKH 60Query:337 VALLCGPHVKGAQGISCGRHLANHDVQVILFLP------NFVKMLESITN--ELSLFSKT 388V ++ GP G G+ C RHL ++F P F K LN ++ + S+Sbjct:61 VFVIAGPGNNGGDGLVCARHLKLFGYNPVVFYPKRSERTEFYKQLVHQLNFFKVPVLSQD 120Query:389 QGQQVSSLKDLPT-SPVDLVINCLDCPENVFLRDQPWYKAAV--AWANQNRAPVLSIDPP 445+G + LK T VD + P+R+ +K V QN P++S+D PSbjct:121 EGNWLEYLKPEKTLCIVDAIFGFSFKPP---MREP--FKGIVEELCKVQNIIPIVSVDVP 175Query:446 V-HEVEQG------IDAKWSLALGLPLPLGEH 470+V++G I+++L +P P HSbjct:176 TGWDVDKGPISQPSINPAVLVSLTVPKPCSSH 207>SP IN:P91255 P91255 caenorhabditis elegans.f12f3.2 protein.5/1999長度=2783分值=35.2 bits(79),預計值=1.3相同性=25/92(27%),相似性=39/92(42%),缺口=8/92(8%)Query:48 LVPEVTFRAGDITELKILEIPGPGDNQHFGDLHQTELGPSGAGCQVGINQNGTGKFVKKP 107+VP++ IL ++N F L +ELG + A CQV I KPSbjct:1536 IVPDEKIDVATTSTSSILNLKSQEENGTFNCLIENELGQASASCQVTI--------FNKP 1587Query:108 ASSSSAPQNIPKRTDVKSQDVAVSPQQQQCSK 139AS S P + +R V + A++ + Q +Sbjct:1588 ASLQSTPDHSLERNLVPTLQKALNNESAQAGQ 1619>SP_IN:Q21740 Q21740 caenorhabditis elegans.r05d11.8 protein.
1/1999長度=566分值=34.4 bits(77),預計值=2.3相同性=31/138(22%),相似性=64/138(45%),缺口=13/138(9%)Query:6 LGSIVSINCGDSLGVYQGRVSAVDQVSQTISLTRPFHNGV---KCLVPEVTFRAGDITEL 62+GS++SD VYQG+++ D + +++ NG+ +C T + DI +LSbjct:6 IGSVISTETKDG-NVYQGKLTTYDTNNGNLTMANVIKNGLPLHRCF----TLSSSDISRL 60Query:63 KILEIPGPGDNQHFGDLHQTELGPSGAGCQVGINQNGTGKFVKKPASSSSAPQNIPKRTD 122K+ I G + + +Q + ++ V +++SS+ ++P +Sbjct:61 KV--IRGATQSTQKSQPLPVQNSSNSVNKQRPIKKSAEST-VSSTSTASSSASSVPDSS- 116Query:123 VKSQDVAVSPQQQQCSKS 140+++ VAVSPQ++SSbjct:117 -RNRSVAVSPQKSAKGRS 13310.PP928A核苷酸序列(SEQ ID NO:28)長度1964bp1 GTCCAGCCCA GCCACTCACC CACCGAGAAC AGCAAAGGCC AAAGCCCACC51 CTCGAAGGAT GGGAGTGGTG ACTACCAGTC TCGTGGGCTG GTAAAGGCCC101 CTGGCAAGAG CTCGTTCACG ATGTTTGTGG ATCTAGGGAT CTACCAGCCT151 GGAGGCAGTG GGGACAGCAT CCCCATCACA GCCCTAGTGG GTGGAGAGGG201 CACTCGGCTC GACCAGCTGC AGTACGACGT GAGGAAGGGT TCTGTGGTCA251 ACGTGAATCC CACCAACACC CGGCCCACAG TGAGACCCCT GAGATCCGGA301 AGTACAAGAA GCGATTCAAC TCCGAGATCC TCTGTGCAGC CCTTTGGGGG351 GTCAACCTGC TGGTGGGCAC GGAGAACGGC TGATGTTGCT GGACCGAAGT401 GGGCAAGGCA AGGTGTATGG ACTCATTGGG CGGCGACGCT TCCAGCAGAT451 GGATGTGCTG GAGGGGCTCA ACCTGCTCAT CACCATCTCA GGGAAAAGGA501 ACAAACTGCG GGTGTATTAC CTGTCCTGGC TCCGGAACAA GATTCTGCAC551 AATGACCCAG AAGTGGAGAA GAAGCAGGGC TGGACCACCG TGGGGGACAT601 GGAGGGCTGC GGGCACTACC GTGTTGTGAA ATACGAGCGG ATTAAGTTCC651 TGGTCATCGC CCTCAAGAGC TCCGTGGAGG TGTATGCCTG GGCCCCCAAA701 CCCTACCACA AATTCATGGC CTTCAAGTCC TTTGCCGACC TCCCCCACCG751 CCCTCTGCTG GTCGACCTGA CAGTAGAGGA GGGGCAGCGG CTCAAGGTCA801 TCTATGGCTC CAGTGCTGGC TTCCATGCTG TGGATGTCGA CTCGGGGAAC851 AGCTATGACA TCTACATCCC TGTGCACATC CAGAGCCAGA TCACGCCCCA901 TGCCATCATC TTCCTCCCCA ACACCGACGG CATGGAGATG CTGCTGTGCT951 ACGAGGACGA GGGTGTCTAC GTCAACACGT ACGGGCGCAT CATTAAGGAT1001 GTGGTGCTGC AGTGGGGGGA GATGCCTACT TCTGTGGCCT ACATCTGCTC1051 CAACCAGATA ATGGGCTGGG GTGAGAAAGC CATTGAGATC CGCTCTGTGG1101 AGACGGGCCA CCTCGACGGG GTCTTCATGC ACAAACGAGC TCAGAGGCTC1151 AAGTTCCTGT GTGAGCGGAA TGACAAGGTG TTTTTTGCCT CAGTCCGCTC1201 TGGGGGCAGC AGCCAAGTTT ACTTCATGAC TCTGAACCGT AACTGCATCA1251 TGAACTGGTG ACGGGGCCCT GGGCTGGGGC TGTCCCACAC TGGACCCAGC1301 TCTCCCCCTG CAGCCAGGCT TCCCGGGCCG CCCCTCTTTC CCCTCCCTGG1351 GCTTTTGCTT TTACTGGTTT GATTTCACTG GAGCCTGCTG GGAACGTGAC1401 CTCTGACCCC TGATGCTTTC GTGATCACGT GACCATCCTC TTCCCCAACA1451 TGTCCTCTTC CCAAAACTGT GCCTGTCCCC AGCTTCTGGG GAGGGACACA1501 GCTTTCCCTT CCCAGGAATT GAGTGGGCCT AGCCCCTCCC CCCTTTTCTC1551 CATTTGAGAG GAGAGTGCTT GGGGCTTGAA CCCCTTACCC CACTCCAGGG1601 GCAGGGACCA TTTCTTCATT TTCTGAAAGC ACTTTAATGA TTCCCCTTCC1651 CCCAAACTCC AGGGAATGGA GGGGGGACCC CGCCAGCCAA AACATTCCCC1701 CCATTCCCGA CCCCCATCTC CTCTTCTAGC CCATGCCCTT CCCCGGCGGA1751 GGGAGGGAGC AGGGAGCCCT CACTCTCCAC GCCCCTTGCT TGCATCTGTA1801 TATAGTGTGA GCAGCAAGTA ACCCTTCTTC TCCCTTCCCC CTCACCCCTT1851 CTCAATGTAG TGGCCTTGGA TATCCCTGTT TGTTAATAAA GACAATTTAA1901 CCAGCTCCCA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA1951 AAAAAAAAAA AAAAB氨基酸序列(SEQ ID NO:29)長度292個氨基酸1 MLLDRSGQGK VYGLIGRRRF QQMDVLEGLN LLITISGKRN KLRVYYLSWL51 RNKILHNDPE VEKKQGWTTV GDMEGCGHYR VVKYERIKFL VIALKSSVEV101 YAWAPKPYHK FMAFKSFADL PHRPLLVDLT VEEGQRLKVI YGSSAGFHAV151 DVDSGNSYDI YIPVHIQSQI TPHAIIFLPN TDGMEMLLCY EDEGVYVNTY201 GRIIKDVVLQ WGEMPTSVAY ICSNQIMGWG EKAIEIRSVE TGHLDGVFMH251 KRAQRLKFLC ERNDKVFFAS VRSGGSSQVY FMTLNRNCIM NWC核苷酸及氨基酸組合序列(SEQ ID NO:30)克隆號和蛋白名稱PP928起始編碼子383 ATG終止編碼子1261 TGA蛋白質(zhì)分子量33572.241G TCC AGC CCA GCC ACT CAC CCA CCG AGA ACA GCA AAG GCC AAA GCC 4647 CAC CCT CGA AGG ATG GGA GTG GTG ACT ACC AGT CTC GTG GGC TGG TAA 9495 AGG CCC CTG GCA AGA GCT CGT TCA CGA TGT TTG TGG ATC TAG GGA TCT 142143 ACC AGC CTG GAG GCA GTG GGG ACA GCA TCC CCA TCA CAG CCC TAG TGG 190191 GTG GAG AGG GCA CTC GGC TCG ACC AGC TGC AGT ACG ACG TGA GGA AGG 238239 GTT CTG TGG TCA ACG TGA ATC CCA CCA ACA CCC GGC CCA CAG TGA GAC 286287 CCC TGA GAT CCG GAA GTA CAA GAA GCG ATT CAA CTC CGA GAT CCT CTG 334335 TGC AGC CCT TTG GGG GGT CAA CCT GCT GGT GGG CAC GGA GAA CGG CTG 382383 ATG TTG CTG GAC CGA AGT GGG CAA GGC AAG GTG TAT GGA CTC ATT GGG 4301 Met Leu Leu Asp Arg Ser Gly Gln Gly Lys Val Tyr Gly Leu Ile Gly 16431 CGG CGA CGC TTC CAG CAG ATG GAT GTG CTG GAG GGG CTC AAC CTG CFC 47817 Arg Arg Arg Phe Gln Gln Met Asp Val Leu Glu Gly Leu Asn Leu Leu 32479 ATC ACC ATC TCA GGG AAA AGG AAC AAA CTG CGG GTG TAT TAC CTG TCC 52633 Ile Thr Ile Ser Gly Lys Arg Asn Lys Leu Arg Val Tyr Tyr Leu Ser 48527 TGG CTC CGG AAC AAG ATT CTG CAC AAT GAC CCA GAA GTG GAG AAG AAG 57449 Trp Leu Arg Asn Lys Ile Leu His Asn Asp Pro Glu Val Glu Lys Lys 64575 CAG GGC TGG ACC ACC GTG GGG GAC ATG GAG GGC TGC GGG CAC TAC CGT 62265 Gln Gly Trp Thr Thr Val Gly Asp Met Glu Gly Cys Gly His Tyr Arg 80623 GTT GTG AAA TAC GAG CGG ATT AAG TTC CTG GTC ATC GCC CTC AAG AGC 67081 Val Val Lys Tyr Glu Arg Ile Lys Phe Leu Val Ile Ala Leu Lys Ser 96671 TCC GTG GAG GTG TAT GCC TGG GCC CCC AAA CCC TAC CAC AAA TTC ATG 71897 Ser Val Glu Val Tyr Ala Trp Ala Pro Lys Pro Tyr His Lys Phe Met 112719 GCC TTC AAG TCC TTT GCC GAC CTC CCC CAC CGC CCT CTG CTG GTC GAC 766113 Ala Phe Lys Ser Phe Ala Asp Leu Pro His Arg Pro Leu Leu Val Asp 128767 CTG ACA GTA GAG GAG GGG CAG CGG CTC AAG GTC ATC TAT GGC TCC AGT 814129 Leu Thr Val Glu Glu Gly Gln Arg Leu Lys Val Ile Tyr Gly Ser Ser 144815 GCT GGC TTC CAT GCT GTG GAT GTC GAC TCG GGG AAC AGC TAT GAC ATC 862145 Ala Gly Phe His Ala Val Asp Val Asp Ser Gly Asn Ser Tyr Asp Ile 160863 TAC ATC CCT GTG CAC ATC CAG AGC CAG ATC ACG CCC CAT GCC ATC ATC 910161 Tyr Ile Pro Val His Ile Gln Ser Gln Ile Thr Pro His Ala Ile Ile 176911 TTC CTC CCC AAC ACC GAC GGC ATG GAG ATG CTG CTG TGC TAC GAG GAC 958177 Phe Leu Pro Asn Thr Asp Gly Met Glu Met Leu Leu Cys Tyr Glu Asp 192959 GAG GGT GTC TAC GTC AAC ACG TAC GGG CGC ATC ATT AAG GAT GTG GTG1006193 Glu Gly Val Tyr Val Asn Thr Tyr Gly Arg Ile Ile Lys Asp Val Val 2081007 CTG CAG TGG GGG GAG ATG CCT ACT TCT GTG GCC TAC ATC TGC TCC AAC1054209 Leu Gln Trp Gly Glu Met Pro Thr Ser Val Ala Tyr Ile Cys Ser Asn 2241055 CAG ATA ATG GGC TGG GGT GAG AAA GCC ATT GAG ATC CGC TCT GTG GAG1102225 Gln Ile Met Gly Trp Gly Glu Lys Ala Ile Glu Ile Arg Ser Val Glu 2401103 ACG GGC CAC CTC GAC GGG GTC TTC ATG CAC AAA CGA GCT CAG AGG CTC1150241 Thr Gly His Leu Asp Gly Val Phe Met His Lys Arg Ala Gln Arg Leu 2561151 AAG TTC CTG TGT GAG CGG AAT GAC AAG GTG TTT TTT GCC TCA GTC CGC1198257 Lys Phe Leu Cys Glu Arg Asn Asp Lys Val Phe Phe Ala Ser Val Arg 2721199 TCT GGG GGC AGC AGC CAA GTT TAC TTC ATG ACT CTG AAC CGT AAC TGC1246273 Ser Gly Gly Ser Ser Gln Val Tyr Phe Met Thr Leu Asn Arg Asn Cys 2881247 ATC ATG AAC TGG TGA CGG GGC CCT GGG CTG GGG CTG TCC CAC ACT GGA1294289 Ile Met Asn Trp *** 2931295 CCC AGC TCT CCC CCT GCA GCC AGG CTT CCC GGG CCG CCC CTC TTT CCC13421343 CTC CCT GGG CTT TTG CTT TTA CTG GTT TGA TTT CAC TGG AGC CTG CTG13901391 GGA ACG TGA CCT CTG ACC CCT GAT GCT TTC GTG ATC ACG TGA CCA TCC14381439 TCT TCC CCA ACA TGT CCT CTT CCC AAA ACT GTG CCT GTC CCC AGC TTC14861487 TGG GGA GGG ACA CAG CTT TCC CTT CCC AGG AAT TGA GTG GGC CTA GCC15341535 CCT CCC CCC TTT TCT CCA TTT GAG AGG AGA GTG CTT GGG GCT TGA ACC15821583 CCT TAC CCC ACT CCA GGG GCA GGG ACC ATT TCT TCA TTT TCT GAA AGC16301631 ACT TTA ATG ATT CCC CTT CCC CCA AAC TCC AGG GAA TGG AGG GGG GAC16781679 CCC GCC AGC CAA AAC ATT CCC CCC ATT CCC GAC CCC CAT CTC CTC TTC17261727 TAG CCC ATG CCC TTC CCC GGC GGA GGG AGG GAG CAG GGA GCC CTC ACT 17741775 CTC CAC GCC CCT TGC TTG CAT CTG TAT ATA GTG TGA GCA GCA AGT AAC 18221823 CCT TCT TCT CCC TTC CCC CTC ACC CCT TCT CAA TGT AGT GGC CTT GGA 18701871 TAT CCC TGT TTG TTA ATA AAG ACA ATT TAA CCA GCT CCC AAA AAA AAA 19181919 AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA A 196411.PP1200A核苷酸序列(SEQ ID NO:31)長度2146bp1 AAAAAGGCAT ACTATGATTC AAGAAAAAGT GAAGTCATTA TATCACAGCT51 TTTAGATTCT AAAGCTGGAG ACTTTAATGC CAGCAAAGGA TGGTGTTTTT101 TTTTGTTTGT TTGTTTTTTG TTTTTTTTTT TGAAATGGAG TCTTGCTCTG151 TCATCCAGAC TGGAGTACAG TGGCACTATC TCGGCTCACT GTAACCTCCG201 CCTCCTGGGT TCAAGCAATT CTCCTGCCCC AGCCTCCCAA GTAGTTGTGA251 TTACAGGTGT GTGCCACCAT GCCCGGCTAA TTTTTGTATT TCTTTAGTAG301 AGATGGGGTT TCACCACGTT GGCCAGGCTG CTCTCAAACT CCTGACCTCA351 GGTGATCCAC CCACCTCAGC CTCCCAAAGT GCTGGGATTA TAGAGTGGAA401 CCGCCACACC TGGCCGGTTT GGTAATTTTA GAAAAAGGTT TGGCTTTGAA451 AAAGTCAAGA TAACAGGAGA AACAGCTTCT GCCAGAAAAG AGGCAGTAGG501 TTCCCAAACT ACATTAAGAA AGTTCCCAAA CTATATTAAG TTGTTGGAGA551 AAGAGTATCT GCCTGAACAA GTTTTTAATG CAGACAAAAG TGCCCTATTC601 TGGAAGAAAA AGGTCACAAA GATCTAAGGA AGAAAAGTGA ATACTAGGAC651 TTAAGGCAGG AGGAGATCGT CTAACTTTAC TGTTTTGCGC AAATGCAGTT701 GGGTTTATGA TCAGAACTGC CATTACCTAT AAAGCTGCTA ACCCGAGTCT751 TGAAGGGAAA AAGATAAACA CCAGCTGCCA GTCTTTTGGT TGTACAACAG801 GAACGCTTAA ACAATATGAA CCCTTTTTCG TGATCGGTTA CATTAATGCT851 TTGTCCCTGA AATCAGAAAG TATCTTGCTA GTAAGGGACT GCCTTTTAAA901 GTTGTTGGGT TTTTTTTTTT TTTTTTTTGA GATGGAGTTT CACTCTTTTC951 GCCTAGGCTG GAGTGCAGTG GCGCACTCTC CACTCACTGC AACCTCCTCC1001 TCCTGGGTTC TAGCGATTCT CCTGCCTCAG CCTCCCTAGT AGCTGGGATT1051 ACAGGCGTAC TCCACCATGC CCAACTTAAT TTCTTTGTAT CTTTAGTAGA1101 GACGGGGTTT CGCCCTGTTG GCCAAGCTGG TCCCAAACTC CTGATCGCAG1151 GTGATCCGTC CACCTCAGCC TCCCAAAGTG CTGGGATTAC AGGCACGAGC1201 CACAGCTCCT GGCCTAAAGT TGTTTTGATA TTGGACAATG CCCCTGGCCA1251 CCCAGAACCT CATGAGTTCA GCACTGAAGG CGTCGAAGTG ATAAATGCTT1301 GCTCCCAGAG AAGCATCTCT AATTCAGCCT CTAGGTCAGG GGTCGTAAAG1351 ACCTTTCAGG CTCATTACAC ATGGTACTCT TTGGAAAAGA TTGTCAGTGC1401 TATGGAAGAG AACCCCAATA GAGGGAACAT CATATCATTG AAGATACCAT1451 TATTGTTTAG AAAAAGATGT GAAAGCTCTC AAGCCTGAAA CAACAAATTT1501 TTGCAGGAGA AAACTCCAGA TGTTGTGTGA CTTCACAGGA TGGAAATCAT1551 GATAGAGATT GTGGATATGG CAAGAAAGGT TGGGGGTTAA GGATTTCAAG1601 ATAGGGATCT TGGAGATATT CAGGAGCTAA TAGACACCAA ACCAGAAGAA1651 TTAACAGAAG ATGACTTGAT GGAGATGACT GCTTCCAAAC CACTGCCAGA1701 CAGTGAGGAA GAAGACATAG AAAAAGAGAA AAAGCGTGCA AGAAAACAGA1751 TTGACACTAG ACAGTCTGAC AGAAGGGGTC TGATTATTCA AGACTGCTTT1801 TCACTTCGTT TGCAACGTGG ACCGTTTGGT GATATGAGAA CTGAAACTAA1851 AGCAAATGGT GGAAGAAGGA TTGGTACCAT ATGGAAACAA TTTAGAGAAC1901 AAAAAAGCAG AAAAGTCAGA AATTACATTG TATTTCCATA AAGTTACACC1951 AAGTATGCCT GCCTCTCCAG CCTCCCCTTT TACCTTCTCC ACCTCTGCTA2001 ACCCTGAGAC AGCAAGACCA ACCCCTCTTC TACCTATTCA ATGTAAAGAC2051 AAGGATGAAG ACTTTTATGA TGATCGACTT TCACTTAATG AATAGTAAAT2101 ATATTTTCTC TTCCTTGTGA TTTTCTGAAA AAAAAAAAAA AAAAAAB氨基酸序列(SRQ ID NO:32)長度110個氨基酸1 MIRTAITYKA ANPSLEGKKI NTSCQSFGCT TGTLKQYEPF FVIGYINALS51 LKSESILLVR DCLLKLLGFF FFFFEMEFHS FRLGWSAVAH SPLTATSSSW101 VLAILLPQPPC.核苷酸及氨基酸組合序列(SEQ ID NO:33)克隆號和蛋白名稱PP1200起始編碼子707 ATG終止編碼子1039 TAG蛋白質(zhì)分子量12302.741A AAA AGG CAT ACT ATG ATT CAA GAA AAA GTG AAG TCA TTA TAT CAC 4647 AGC TTT TAG ATT CTA AAG CTG GAG ACT TTA ATG CCA GCA AAG GAT GGT 9495 GTT TTT TTT TGT TTG TTT GTT TTT TGT TTT TTT TTT TGA AAT GGA GTC 142143 TTG CTC TGT CAT CCA GAC TGG AGT ACA GTG GCA CTA TCT CGG CTC ACT 190191 GTA ACC TCC GCC TCC TGG GTT CAA GCA ATT CTC CTG CCC CAG CCT CCC 238239 AAG TAG TTG TGA TTA CAG GTG TGT GCC ACC ATG CCC GGC TAA TTT TTG 286287 TAT TTC TTT AGT AGA GAT GGG GTT TCA CCA CGT TGG CCA GGC TGC TCT 334335 CAA ACT CCT GAC CTC AGG TGA TCC ACC CAC CTC AGC CTC CCA AAG TGC 382383 TGG GAT TAT AGA GTG GAA CCG CCA CAC CTG GCC GGT TTG GTA ATT TTA 430431 GAA AAA GGT TTG GCT TTG AAA AAG TCA AGA TAA CAG GAG AAA CAG CTT 478479 CTG CCA GAA AAG AGG CAG TAG GTT CCC AAA CTA CAT TAA GAA AGT TCC 526527 CAA ACT ATA TTA AGT TGT TGG AGA AAG AGT ATC TGC CTG AAC AAG TTT 574575 TTA ATG CAG ACA AAA GTG CCC TAT TCT GGA AGA AAA AGG TCA CAA AGA 622623 TCT AAG GAA GAA AAG TGA ATA CTA GGA CTT AAG GCA GGA GGA GAT CGT 670671 CTA ACT TTA CTG TTT TGC GCA AAT GCA GTT GGG TTT ATG ATC AGA ACT 7181 Met Ile Arg Thr 4719 GCC ATT ACC TAT AAA GCT GCT AAC CCG AGT CTT GAA GGG AAA AAG ATA 7665 Ala Ile Thr Tyr Lys Ala Ala Asn Pro Ser Leu Glu Gly Lys Lys Ile 20767 AAC ACC AGC TGC CAG TCT TTT GGT TGT ACA ACA GGA ACG CTT AAA CAA 81421 Asn Thr Ser Cys Gln Ser Phe Gly Cys Thr Thr Gly Thr Leu Lys Gln 36815 TAT GAA CCC TTT TTC GTG ATC GGT TAC ATT AAT GCT TTG TCC CTG AAA 86237 Tyr Glu Pro Phe Phe Val Ile Gly Tyr Ile Asn Ala Leu Ser Leu Lys 52863 TCA GAA AGT ATC TTG CTA GTA AGG GAC TGC CTT TTA AAG TTG TTG GGT 91053 Ser Glu Ser Ile Leu Leu Val Arg Asp Cys Leu Leu Lys Leu Leu Gly 68911 TTT TTT TTT TTT TTT TTT GAG ATG GAG TTT CAC TCT TTT CGC CTA GGC 95869 Phe Phe Phe Phe Phe Phe Glu Met Glu Phe His Ser Phe Arg Leu Gly 84959 TGG AGT GCA GTG GCG CAC TCT CCA CTC ACT GCA ACC TCC TCC TCC TGG100685 Trp Ser Ala Val Ala His Ser Pro Leu Thr Ala Thr Ser Ser Ser Trp 1001007 GTT CTA GCG ATT CTC CTG CCT CAG CCT CCC TAG TAG CTG GGA TTA CAG1054101 Val Leu Ala Ile Leu Leu Pro Gln Pro Pro *** 1111055 GCG TAC TCC ACC ATG CCC AAC TTA ATT TCT TTG TAT CTT TAG TAG AGA11021103 CGG GGT TTC GCC CTG TTG GCC AAG CTG GTC CCA AAC TCC TGA TCG CAG11501151 GTG ATC CGT CCA CCT CAG CCT CCC AAA GTG CTG GGA TTA CAG GCA CGA11981199GCC ACA GCT CCT GGC CTA AAG TTG TTT TGA TAT TGG ACA ATG CCC CTG 12461247GCC ACC CAG AAC CTC ATG AGT TCA GCA CTG AAG GCG TCG AAG TGA TAA 12941295ATG CTT GCT CCC AGA GAA GCA TCT CTA ATT CAG CCT CTA GGT CAG GGG 13421343TCG TAA AGA CCT TTC AGG CTC ATT ACA CAT GGT ACT CTT TGG AAA AGA 13901391TTG TCA GTG CTA TGG AAG AGA ACC CCA ATA GAG GGA ACA TCA TAT CAT 14381439TGA AGA TAC CAT TAT TGT TTA GAA AAA GAT GTG AAA GCT CTC AAG CCT 14861487GAA ACA ACA AAT TTT TGC AGG AGA AAA CTC CAG ATG TTG TGT GAC TTC 15341535ACA GGA TGG AAA TCA TGA TAG AGA TTG TGG ATA TGG CAA GAA AGG TTG 15821583GGG GTT AAG GAT TTC AAG ATA GGG ATC TTG GAG ATA TTC AGG AGC TAA 16301631TAG ACA CCA AAC CAG AAG AAT TAA CAG AAG ATG ACT TGA TGG AGA TGA 16781679CTG CTT CCA AAC CAC TGC CAG ACA GTG AGG AAG AAG ACA TAG AAA AAG 17261727AGA AAA AGC GTG CAA GAA AAC AGA TTG ACA CTA GAC AGT CTG ACA GAA 17741775GGG GTC TGA TTA TTC AAG ACT GCT TTT CAC TTC GTT TGC AAC GTG GAC 18221823CGT TTG GTG ATA TGA GAA CTG AAA CTA AAG CAA ATG GTG GAA GAA GGA 18701871TTG GTA CCA TAT GGA AAC AAT TTA GAG AAC AAA AAA GCA GAA AAG TCA 19181919GAA ATT ACA TTG TAT TTC CAT AAA GTT ACA CCA AGT ATG CCT GCC TCT 19661967CCA GCC TCC CCT TTT ACC TTC TCC ACC TCT GCT AAC CCT GAG ACA GCA 20142015AGA CCA ACC CCT CTT CTA CCT ATT CAA TGT AAA GAC AAG GAT GAA GAC 20622063TTT TAT GAT GAT CGA CTT TCA CTT AAT GAA TAG TAA ATA TAT TTT CTC 21102111TTC CTT GTG ATT TTC TGA AAA AAA AAA AAA AAA AAA 214612.PP1226A核苷酸序列(SEQ ID NO:34)長度1588bp1AGCTTGCAAG CATGCTCCGC TGGACCCGAG CCTGGAGGCT CCCGCGTGAG51GGACTCGGCC CCCACGGCCC TAGCTTCGCG AGGGTGCCTG TCGCACCCAG101CAGCAGCAGC GGCGGCCGAG GGGGCGCCGA GCCGAGGCCG CTTCCGCTTT151CCTACAGGCT TCTGGACGGG GAGGCAGCCC TCCCGGCCGT CGTCTTTTTG201CACGGGCTCT TCGGCAGCAA AACTAACTTC AACTCCATCG CCAAGATCTT251GGCCCAGCAG ACAGGCCGTG CTGACGGTGG ATGCTCGTAA CCACGGTGAC301AGCCCCCACA GCCCAGACAT GAGCTACGAG ATCATGAGCC AGGACCTGCA351GGACCTTCTG CCCCAGCTGG GCCTGGTGCC CTGCGTCGTC GTTGGCCACA401GCATGGGAGG AAAGACAGCC ATGCTGCTGG CACTACAGAG GGTGAGCCGC451CCATGTCTGG GGCCTCCTCC CATTCAGTAT ATACCCTGAG GGCCCTGCAG501GCAACCTGGG ACTCACATGA TCGTTGGATG ACCAAGTTCA GGCTCCAGGA551GCCATGCCTG AGACTCCCTA TGTCTGCCTA AGACTGGTCC CAGTTCGGTT601CTCTCCCACA GCCAGAGCTG GTGGAACGTC TCATTGCTGT AGATATCAGC651CCAGTGGAAA GCACAGGTGT CTCCCACTTT GCAACCTATG TGGCAGCCAT701GAGGGCCATC AACATCGCAG ATGAGCTGCC CCGCTCCCGT GCCCGAAAAC751TGGCGGATGA ACAGCTCAGT TCTGTCATCC AGGACATGGC CGTGCGGCAG801CACCTGCTCA CTAACCTGGT AGAGGTAGAC GGGCGCTTCG TGTGGAGGGT851GAACTTGGAT GCCCTGACCC AGCACCTAGA CAAGATCTTG GCTTTCCCAC901AGAGGCAGGA GTCCTACCTC GGGCCAACAC TCTTTCTCCT TGGTGGAAAC951TCCCAGTTCG TGCATCCCAG CCACCACCCT GAGATTATGC GGCTCTTCCC1001TCGGGCCCAG ATGCAGACGG TGCCGAACGC TGGCCACTGG ATCCACGCTG1051ACCGCCCACA GGACTTCATA GCTGCCATCC GAGGCTTCCT GGTCTAAGAG1101TTGCTGGCAA GAAGATGGCC GGGCGTGGTG GCTCATGCCT GTAATTCCAG1151CACTTTGGGA GGCTAAGGCG GGAGGATGAC TTGAGGCCAG GAGTTGGAGA1201CCAGCCTGGC CAACATGGTG AAACCCTGTC TCTACTAAAA ATACAAAAAT1251TAGCCTGGCG TGGTGGTGCA CACCTGTAAT CCCAGCTACT CTGGAGGCTG1301AGGCAGGAGA ATCACTTGAA CCCTGGAGGC AGAGGTTGCA ATGAGCCGAG1351 ATCACACCAC TACACTCCAG CCTAGGCAAC AGAGCAAGAC TCTGTCTCAA1401 AAAAAACAAA ACAAAAAGGA GGCACAAAAC CCCAGGCTTC AAGTCTCTGC1451 AGCCTGCTCC ACATTTGGGC ACAGAAGGAC TCAGACAGGC ACTGTGTGGG1501 CACGAGGTTT TACAGGGGTG GTCAGACCTC AGGCTTTAAT GAATAAAGAC1551 ACTACTCCCC AAAAAAAAAA AAAAAAAAAA AAAAAAAAAB氨基酸序列(SEQ ID NO:35)長度132個氨基酸1 MRAINIADEL PRSRARKLAD EQLSSVIQDM AVRQHLLTNL VEVDGRFVWR51 VNLDALTQHL DKILAFPQRQ ESYLGPTLFL LGGNSQFVHP SHHPEIMRLF101 PRAQMQTVPN AGHWIHADRP QDFIAAIRGF LVC核苷酸及氨基酸組合序列(SEQ ID NO:36)克隆號和蛋白名稱PP1226起始編碼子699 ATG終止編碼子1097 TAA蛋白質(zhì)分子量15123.621 AG CTT GCA AGC ATG CTC CGC TGG ACC CGA GCC TGG AGG CTC CCG CGT 4748 GAG GGA CTC GGC CCC CAC GGC CCT AGC TTC GCG AGG GTG CCT GTC GCA 9596 CCC AGC AGC AGC AGC GGC GGC CGA GGG GGC GCC GAG CCG AGG CCG CTT 143144 CCG CTT TCC TAC AGG CTT CTG GAC GGG GAG GCA GCC CTC CCG GCC GTC 191192 GTC TTT TTG CAC GGG CTC TTC GGC AGC AAA ACT AAC TTC AAC TCC ATC 239240 GCC AAG ATC TTG GCC CAG CAG ACA GGC CGT GCT GAC GGT GGA TGC TCG 287288 TAA CCA CGG TGA CAG CCC CCA CAG CCC AGA CAT GAG CTA CGA GAT CAT 335336 GAG CCA GGA CCT GCA GGA CCT TCT GCC CCA GCT GGG CCT GGT GCC CTG 383384 CGT CGT CGT TGG CCA CAG CAT GGG AGG AAA GAC AGC CAT GCT GCT GGC 431432 ACT ACA GAG GGT GAG CCG CCC ATG TCT GGG GCC TCC TCC CAT TCA GTA 479480 TAT ACC CTG AGG GCC CTG CAG GCA ACC TGG GAC TCA CAT GAT CGT TGG 527528 ATG ACC AAG TTC AGG CTC CAG GAG CCA TGC CTG AGA CTC CCT ATG TCT 575576 GCC TAA GAC TGG TCC CAG TTC GGT TCT CTC CCA CAG CCA GAG CTG GTG 623624 GAA CGT CTC ATT GCT GTA GAT ATC AGC CCA GTG GAA AGC ACA GGT GTC 671672 TCC CAC TTT GCA ACC TAT GTG GCA GCC ATG AGG GCC ATC AAC ATC GCA 7191 Met Arg Ala Ile Asn Ile Ala 7720 GAT GAG CTG CCC CGC TCC CGT GCC CGA AAA CTG GCG GAT GAA CAG CTC 7678 Asp Glu Leu Pro Arg Ser Arg Ala Arg Lys Leu Ala Asp Glu Gln Leu 23768 AGT TCT GTC ATC CAG GAC ATG GCC GTG CGG CAG CAC CTG CTC ACT AAC 81524 Ser Ser Val Ile Gln Asp Met Ala Val Arg Gln His Leu Leu Thr Asn 39816 CTG GTA GAG GTA GAC GGG CGC TTC GTG TGG AGG GTG AAC TTG GAT GCC 86340 Leu Val Glu Val Asp Gly Arg Phe Val Trp Arg Val Asn Leu Asp Ala 55864 CTG ACC CAG CAC CTA GAC AAG ATC TTG GCT TTC CCA CAG AGG CAG GAG 91156 Leu Thr Gln His Leu Asp Lys Ile Leu Ala Phe Pro Gln Arg Gln Glu 71912 TCC TAC CTC GGG CCA ACA CTC TTT CTC CTT GGT GGA AAC TCC CAG TTC 95972 Ser Tyr Leu Gly Pro Thr Leu Phe Leu Leu Gly Gly Asn Ser Gln Phe 87960 GTG CAT CCC AGC CAC CAC CCT GAG ATT ATG CGG CTC TTC CCT CGG GCC100788 Val His Pro Ser His His Pro Glu Ile Met Arg Leu Phe Pro Arg Ala 1031008 CAG ATG CAG ACG GTG CCG AAC GCT GGC CAC TGG ATC CAC GCT GAC CGC1055104 Gln Met Gln Thr Val Pro Asn Ala Gly His Trp Ile His Ala Asp Arg 1191056 CCA CAG GAC TTC ATA GCT GCC ATC CGA GGC TTC CTG GTC TAA GAG TTG1103120 Pro Gln Asp Phe Ile Ala Ala Ile Arg Gly Phe Leu Val *** 1331104 CTG GCA AGA AGA TGG CCG GGC GTG GTG GCT CAT GCC TGT AAT TCC AGC11511152 ACT TTG GGA GGC TAA GGC GGG AGG ATG ACT TGA GGC CAG GAG TTG GAG11991200 ACC AGC CTG GCC AAC ATG GTG AAA CCC TGT CTC TAC TAA AAA TAC AAA12471248 AAT TAG CCT GGC GTG GTG GTG CAC ACC TGT AAT CCC AGC TAC TCT GGA12951296 GGC TGA GGC AGG AGA ATC ACT TGA ACC CTG GAG GCA GAG GTT GCA ATG13431344 AGC CGA GAT CAC ACC ACT ACA CTC CAG CCT AGG CAA CAG AGC AAG ACT13911392 CTG TCT CAA AAA AAA CAA AAC AAA AAG GAG GCA CAA AAC CCC AGG CTT14391440 CAA GTC TCT GCA GCC TGC TCC ACA TTT GGG CAC AGA AGG ACT CAG ACA14871488 GGC ACT GTG TGG GCA CGA GGT TTT ACA GGG GTG GTC AGA CCT CAG GCT15351536 TTA ATG AAT AAA GAC ACT ACT CCC CAA AAA AAA AAA AAA AAA AAA AAA15831584 AAA AA 1588DBlastp結(jié)果Query=PP1226[基因=PP1226](132個氨基酸)>SP_IN:045707 045707 caenorhabditis elegans.r05d7.4 protein.5/1999長度=299分值=110 bits(272),預計值=6e-24相同性=52/121(42%),相似性=78/121(63%),缺口=3/121(2%)Query:14 RARKLADEQLSSVIQDMAVRQHLLTNLV---EVDGRFVWRVNLDALTQHLDKILAFPQRQ 70R RK + L S I D+A+RQ +LTNLE +G+ W++N++ + H+D+IL +Sbjct:177 RTRKEILKDLESAIPDLAMRQFILTNLQPSSENEGQMEWKININTIDSHVDEILGYTLPV 236Query:71 ESYLGPTLFLLGGNSQFVHPSHHPEIMRLFPRAQMQTVPNAGHWIHADRPQDFIAAIRGF 130S+ GPTLFL G NS +V H P+I LFP+ Q +P++GHW+HA++PQ FI ++ FSbjct:237 GSFRGPTLFLHGANSGYVPDDHKPDIKCLFPQVQFDAIPDSGHWVHAEKPQLFINSVYKF 296Query:131 L 131LSbjct:297 L 297>SP_FUN:094437 094437 schizosaccharomyces pombe(fission yeast).
putative abhydrolase.5/1999長度=270分值=61.3 bits(146),預計值=3e-09相同性=36/118(30%),相似性=65/118(54%),缺口=6/118(5%)Query:19 ADEQLSSVIQDMAVRQHLLTNLVEVDGR---FVWRVNLDALTQHLDKILAFPQRQES--Y 73AD+ +S+V +D+ VR LL+NL+F +RV++ +++ L I FP YSbjct:152 ADKMMSTVEKDILVRSFLLSNLKKDSNNSNTFKFRVPIELISKSLKTIEGFPASLNDLVY 211Query:74 LGPTLFLLGGNSQFVHPSHHPEIMRLFPRAQMQTVPNAGHWIHADRPQDFIAAIRGFL 131PTL+ + F+ S P + FP+ ++ ++ + GHW+H ++P++F +I FLSbjct:212 DSPTLVIRALKAPFIPDSALPVFKKFFPKYELVSL-DCGHWVHFEKPKEFSESIINFL 268>SW:YGlL_YEAST P53219 saccharomyces cerevisiae(baker's yeast).
hypothetical 38.5 kd protein in ervl-gls2 intergenicregion.11/1997長度=342分值=45.7 bits(106),預計值=2e-04相同性=42/132(31%),相似性=66/132(49%),缺口=15/132(11%)Query:14 RARKLADEQLSSVIQ-DMAVRQHLLTNL--VEVDGR-------FVWRVNLDALTQHLDK- 62R K ADE L+ I + VR+ LLT L V++D F R+ L L + KSbjct:207 RTLKQADEHLAERIGGNELYRRFLLTALKKVKMDNSSSVSSYTFEERIPLATLKDAIVKG 266Query:63 -ILAFP--QRQESYLGPTLFLLGGNSQFVHPSHHPEIMRLFPRAQMQTVPNAGHWIHADR 119I A+P+E+ P LF+S +V+P I FPR + + + +AGHW++A++Sbjct:267 EIAAWPLDPARERWTRPALFIRATQSHYVVDEYLPIIGAFFPRFETRDI-DAGHWVNAEK 325Query:120 PQDFIAAIRGFL 131P++I F+Sbjct:326 PGECAESIVDFV 33713.PP1292A核苷酸序列(SEQ ID NO:37)長度966bp1 GATGTCTGGG ATGGCACGTG GCCCGACCTC CACAAGCTCC CTCATGCTTC51 CTGTCCCCCG CTTACACGAC AACGGGCCAG ACCACGGGAA GGACGGTGTT101 TGTGTCTGAG GGAGCTGCTG GCCACAGTGA ACACCCACGT TTATTCCTGC151 CTGCTCCGGC CAGGACTGAA CCCCTTCTCC ACACCTGAAC AGTTGGCTCA201 AGGGCCACCA GAAGCATTTC TTTATTATTA TTATTTTTTA ACCTGGACAT251 GCATTAAAGG GTCTATTAGC TTTCTTTCCG TCTGTCTCAA CAGCTGAGAT301 GGGGCCGCCA AGGAGTGCCT TCCTTTTGCT CCCTCCTAGC TGGGAGTGAC351 GGGTGGGAGT GTGTGTGCCC AGGTGGGGGT GTCTCCTGGC TGGGAAGGAG401 GGAAAGGGAG GGAGAGTTTT GCGGGGGTTG GCAGTGAAGA GCAGGCTGGA451 AAGGAGATGG CTAATAGCTG TTTAATGGAA ACCTGCTGGG CTGGAGGGAG501 TTAGGCTGAA TTTCCCGACT TCCTCTGCCA GTTATTGACA CAGCTCTCTT551 TGTAAGAGAG GAAAGAAACT AAACCCACCC AAGGGATGAT TTCAGGGGGA601 GAGGTGGAGG GCAGATGTCC TGGGCAAACC GGGCCCCTTT GCCCACACAC651 CTCACTTGAT CCTTTTGCCA AACTTGTCAA ACTCAGGGGA ACTGGCTTCC701 CAGTTGCCCC TTTGCCATAT TCCAAGTCCC CCTCAGACTT CATGTCTCTG751 CTCATCAGCA CTGTCCCAGG ATCCTGGAGA GGGAGAACCC CTGGCCCCAG801 GGGAAAGAGG GGGGGGTCTC CCGTTTCCTG TGCCTGCACC AGCCCTGCCC851 CCATTGCGTC TGCACACCCC TGCGTGTAAC TGCATTCCAA CCACTAATAA901 AGTGCCTATT GTACAGGTCC AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA951 AAAAAAAAAA AAAAAAB氨基酸序列(SEQ ID NO:38)長度97個氨基酸1 MISGGEVEGR CPGQTGPLCP HTSLDPFAKL VKLRGTGFPV APLPYSKSPS51 DFMSLLISTV PGSWRGRTPG PRGKRGGSPV SCACTSPAPI ASAHPCVC.核苷酸及氨基酸組合序列(SEQ ID NO:39)克隆號和蛋白名稱PP1292起始編碼子586 ATG終止編碼子879 TAA蛋白質(zhì)分子量9929.971 GAT GTC TGG GAT GGC ACG TGG CCC GAC CTC CAC AAG CTC CCT CAT GCT 4849 TCC TGT CCC CCG CTT ACA CGA CAA CGG GCC AGA CCA CGG GAA GGA CGG 9697 TGT TTG TGT CTG AGG GAG CTG CTG GCC ACA GTG AAC ACC CAC GTT TAT 144145 TCC TGC CTG CTC CGG CCA GGA CTG AAC CCC TTC TCC ACA CCT GAA CAG 192193 TTG GCT CAA GGG CCA CCA GAA GCA TTT CTT TAT TAT TAT TAT TTT TTA 240241 ACC TGG ACA TGC ATT AAA GGG TCT ATT AGC TTT CTT TCC GTC TGT CTC 288289 AAC AGC TGA GAT GGG GCC GCC AAG GAG TGC CTT CCT TTT GCT CCC TCC 336337 TAG CTG GGA GTG ACG GGT GGG AGT GTG TGT GCC CAG GTG GGG GTG TCT 384385 CCT GGC TGG GAA GGA GGG AAA GGG AGG GAG AGT TTT GCG GGG GTT GGC 432433 AGT GAA GAG CAG GCT GGA AAG GAG ATG GCT AAT AGC TGT TTA ATG GAA 480481 ACC TGC TGG GCT GGA GGG AGT TAG GCT GAA TTT CCC GAC TTC CTC TGC 528529 CAG TTA TTG ACA CAG CTC TCT TTG TAA GAG AGG AAA GAA ACT AAA CCC 576577 ACC CAA GGG ATG ATT TCA GGG GGA GAG GTG GAG GGC AGA TGT CCT GGG 6241 Met Ile Ser Gly Gly Glu Val Glu Gly Arg Cys Pro Gly 13625 CAA ACC GGG CCC CTT TGC CCA CAC ACC TCA CTT GAT CCT TTT GCC AAA 67214 Gln Thr Gly Pro Leu Cys Pro His Thr Ser Leu Asp Pro Phe Ala Lys 29673 CTT GTC AAA CTC AGG GGA ACT GGC TTC CCA GTT GCC CCT TTG CCA TAT 72030 Leu Val Lys Leu Arg Gly Thr Gly Phe Pro Val Ala Pro Leu Pro Tyr 45721 TCC AAG TCC CCC TCA GAC TTC ATG TCT CTG CTC ATC AGC ACT GTC CCA 76846 Ser Lys Ser Pro Ser Asp Phe Met Ser Leu Leu Ile Ser Thr Val Pro 61769 GGA TCC TGG AGA GGG AGA ACC CCT GGC CCC AGG GGA AAG AGG GGG GGG 81662 Gly Ser Trp Arg Gly Arg Thr Pro Gly Pro Arg Gly Lys Arg Gly Gly 77817 TCT CCC GTT TCC TGT GCC TGC ACC AGC CCT GCC CCC ATT GCG TCT GCA 86478 Ser Pro Val Ser Cys Ala Cys Thr Ser Pro Ala Pro Ile Ala Ser Ala 93865 CAC CCC TGC GTG TAA CTG CAT TCC AAC CAC TAA TAA AGT GCC TAT TGT 91294 His Pro Cys Val *** 98913 ACA GGT CCA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA 960961 AAA AAA 96614.PP1396A核苷酸序列(SEQ ID NO:40)長度2070bp1 GGACCCCAGG CACCCTCCAG CGTCAGGTGC GGATCCCCCA GATGCCCGCC51 CCGCCCCATC CCAGGACACC CCTGGGGTCT CCAGCTGCGT ACTGGAAACG101 AGTGGGACAC TCTGCATGCT CAGCGTCCTG CGGGAAAGGT GTCTGGCGCC151 CCATTTTCCT CTGCATCTCC CGTGAGTCGG GAGAGGAACT GGATGAACGC201 AGCTGTGCCG CGGGTGCCAG GCCCCCAGCC TCCCCTGAAC CCTGCACGGC251 ACCCCATGCC CCCCATACTG GGAGGCTGGC GAGTGGACAT CCTGCAGCCG301 CTCCTGTGGC CCCGGCACCC AGCACCGCCA GCTGCAGTGC CGGCAGGAAT351 TTGGGGGGGG TGGCTCCTCG GTGCCCCCGG AGCGCTGTGG ACATCTCCCC401 CGGCCCAACA TCACCCAGTC TTGCCAGCTG CGCCTCTGTG GCCATTGGGA451 AGTTGGCTCT CCTTGGAGCC AGTGCTCCGT GCGGTGCGGC CGGGGCCAGA501 GAAGCCGGCA GGTTCGCTGT GTTGGGAACA ACGGTGATGA AGTGAGCGAG551 CAGGAGTGTG CGTCAGGCCC CCCACAGCCC CCCAGCAGAG AGGCCTGTGA601 CATGGGGCCC TGTACTACTG CCTGGTTCCA CAGCGACTGG AGCTCCAAGT651 GCTCAGCCGA GTGTGGGACG GGAATCCAGC GGCGCTCTGT GGTCTGCCTT701 GGGAGTGGGG CAGCCCTCGG GCCAGGCCAG GGGGAAGCAG GAGCAGGAAC751 TGGGCAGAGC TGTCCAACAG GAAGCCGGCC CCCTGACATG CGCGCCTGCA801 GCCTGGGGCC CTGTGAGAGA ACTTGGCGCT GGTACACAGG GCCCTGGGGT851 GAGTGCTCCT CCGAATGTGG CTCTGGCACA CAGCGTAGAG ACATCATCTG901 TGTATCCAAA CTGGGGACGG AGTTCAACGT GACTTCTCCG AGCAACTGTT951 CTCACCTCCC CAGGCCCCCT GCCCTGCAGC CCTGTCAAGG GCAGGCCTGC1001 CAGGACCGAT GGTTTTCCAC GCCCTGGAGC CCATGTTCTC GCTCCTGCCA1051 AGGGGGAACG CAGACACGGG AGGTCCAGTG CCTGAGCACC AACCAGACCC1101 TCAGCACCCG ATGCCCTCCT CAACTGCGGC CCTCCAGGAA GCGCCCCTGT1151 AACAGCCAAC CCTGCAGCCA GCGCCCTGAT GATCAATGCA AGGACAGCTC1201 TCCACATTGC CCCCTGGTGG TACAGGCCCG GCTCTGCGTC TACCCCTACT1251 ACACAGCCAC CTGTTGCCGC TCTTGCGCAC ATGTCCTGGA GCGGTCTCCC1301 CAGGATCCCT CCTGAAAGGG GTCCGGGGCA CCTTCACGGT TTTCTGTGCC1351 ACCATCGGTC ACCCATTGAT CGGCCCACTC TGAACCCCCT GGCTCTCCAG1401 CCTGTCCCAG TCTCAGCAGG GATGTCCTCC AGGTGACAGA GGGTGGCAAG1451 GTGACTGACA CAAAGTGACT TTCAGGGCTG TGGTCAGGCC CATGTGGTGG1501 TGTGATGGGT GTGTGCACAT ATGCCTCAGG TGTGCTTTTG GGACTGCATG1551 GATATGTGTG TGCTCAAACG TGTATCACTT TTCAAAAAGA GGTTACACAG1601 ACTGAGAAGG ACAAGACCTG TTTCCTTGAG ACTTTCCTAG GTGGAAAGGA1651 AAGCAAGTCT GCAGTTCCTT GCTAATCTGA GCTACTTAGA GTGTGGTCTC1701 CCCACCAACT CCAGTTTTGT GCCCTAAGCC TCATTTCTCA TGTTCAGACC1751 TCACATCTTT TAAGCCGCCC TGTGTCTCTG ACCCCTTCTC ATTTGCCTAG1801 TATCTCTGCC CCTGCCTCCC TAATTAGCTA GGGCTGGGGT CAGCCACTGC1851 CAATCCTGCC TTACTCAGGA AGGCAGGAGG AAAGAGACTG CCTCTCCAGA1901 GCAAGGCCCA GCTGGGCAGA GGGTGAAAAA GAGAAATGTG AGCATCCGCT1951 CCCCCACCAC CCCGCCCAGC CCCTAGCCCC ACTCCCTGCC TCCTGAAATG2001 GTTCCCACCC AGAACTAATT TATTTTTTAT TAAAGATGGT CATGACAAAT2051 GAAAAAAAAA AAAAAAAAAAB氨基酸序列(SEQ ID NO:41)長度237個氨基酸1 MGPCTTAWFH SDWSSKCSAE CGTGIQRRSV VCLGSGAALG PGQGEAGAGT51 GQSCPTGSRP PDMRACSLGP CERTWRWYTG PWGECSSECG SGTQRRDIIC101 VSKLGTEFNV TSPSNCSHLP RPPALQPCQG QACQDRWFST PWSPCSRSCQ151 GGTQTREVQC LSTNQTLSTR CPPQLRPSRK RPCNSQPCSQ RPDDQCKDSS201 PHCPLVVQAR LCVYPYYTAT CCRSCAHVLE RSPQDPSC核苷酸及氨基酸組合序列(SEQ ID NO:42)克隆號和蛋白名稱PP1396起始編碼子602 ATG終止編碼子1315 TGA蛋白質(zhì)分子量25657.551G GAC CCC AGG CAC CCT CCA GCG TCA GGT GCG GAT CCC CCA GAT GCN 4647 CGC CCC GCC CCA TCC CAG GAC ACC CCT GGG GTC TCC AGC TGC GTA CTG 9495 GAA ACG AGT GGG ACA CTC TGC ATG CTC AGC GTC CTG CGG GAA AGG TGT 142143 CTG GCG CCC CAT TTT CCT CTG CAT CTC CCG TGA GTC GGG AGA GGA ACT 190191 GGA TGA ACG CAG CTG TGC CGC GGG TGC CAG GCC CCC AGC CTC CCC TGA 238239 ACC CTG CAC GGC ACC CCA TGC CCC CCA TAC TGG GAG GCT GGC GAG TGG 286287 ACA TCC TGC AGC CGC TCC TGT GGC CCC GGC ACC CAG CAC CGC CAG CTG 334335 CAG TGC CGG CAG GAA TTT GGG GGG GGT GGC TCC TCG GTG CCC CCG GAG 382383 CGC TGT GGA CAT CTC CCC CGG CCC AAC ATC ACC CAG TCT TGC CAG CTG 430431 CGC CTC TGT GGC CAT TGG GAA GTT GGC TCT CCT TGG AGC CAG TGC TCC 478479 GTG CGG TGC GGC CGG GGC CAG AGA AGC CGG CAG GTT CGC TGT GTT GGG 526527 AAC AAC GGT GAT GAA GTG AGC GAG CAG GAG TGT GCG TCA GGC CCC CCA 574575 CAG CCC CCC AGC AGA GAG GCC TGT GAC ATG GGG CCC TGT ACT ACT GCC 6221 Met Gly Pro Cys Thr Thr Ala 7623 TGG TTC CAC AGC GAC TGG AGC TCC AAG TGC TCA GCC GAG TGT GGG ACG 6708 Trp Phe His Ser Asp Trp Ser Ser Lys Cys Ser Ala Glu Cys Gly Thr 23671 GGA ATC CAG CGG CGC TCT GTG GTC TGC CTT GGG AGT GGG GCA GCC CTC 71824 Gly Ile Gln Arg Arg Ser Val Val Cys Leu Gly Ser Gly Ala Ala Leu 39719 GGG CCA GGC CAG GGG GAA GCA GGA GCA GGA ACT GGG CAG AGC TGT CCA 76640 Gly Pro Gly Gln Gly Glu Ala Gly Ala Gly Thr Gly Gln Ser Cys Pro 55767 ACA GGA AGC CGG CCC CCT GAC ATG CGC GCC TGC AGC CTG GGG CCC TGT 81456 Thr Gly Ser Arg Pro Pro Asp Met Arg Ala Cys Ser Leu Gly Pro Cys 71815 GAG AGA ACT TGG CGC TGG TAC ACA GGG CCC TGG GGT GAG TGC TCC TCC 86272 Glu Arg Thr Trp Arg Trp Tyr Thr Gly Pro Trp Gly Glu Cys Ser Ser 87863 GAA TGT GGC TCT GGC ACA CAG CGT AGA GAC ATC ATC TGT GTA TCC AAA 91088 Glu Cys Gly Ser Gly Thr Gln Arg Arg Asp Ile Ile Cys Val Ser Lys 103911 CTG GGG ACG GAG TTC AAC GTG ACT TCT CCG AGC AAC TGT TCT CAC CTC 958104 Leu Gly Thr Glu Phe Asn Val Thr Ser Pro Ser Asn Cys Ser His Leu 119959 CCC AGG CCC CCT GCC CTG CAG CCC TGT CAA GGG CAG GCC TGC CAG GAC1006120 Pro Arg Pro Pro Ala Leu Gln Pro Cys Gln Gly Gln Ala Cys Gln Asp 1351007 CGA TGG TTT TCC ACG CCC TGG AGC CCA TGT TCT CGC TCC TGC CAA GGG1054136 Arg Trp Phe Ser Thr Pro Trp Ser Pro Cys Ser Arg Ser Cys Gln Gly 1511055 GGA ACG CAG ACA CGG GAG GTC CAG TGC CTG AGC ACC AAC CAG ACC CTC1102152 Gly Thr Gln Thr Arg Glu Val Gln Cys Leu Ser Thr Asn Gln Thr Leu 1671103 AGC ACC CGA TGC CCT CCT CAA CTG CGG CCC TCC AGG AAG CGC CCC TGT1150168 Ser Thr Arg Cys Pro Pro Gln Leu Arg Pro Ser Arg Lys Arg Pro Cys 1831151 AAC AGC CAA CCC TGC AGC CAG CGC CCT GAT GAT CAA TGC AAG GAC AGC1198184 Asn Ser Gln Pro Cys Ser Gln Arg Pro Asp Asp Gln Cys Lys Asp Ser 1991199 TCT CCA CAT TGC CCC CTG GTG GTA CAG GCC CGG CTC TGC GTC TAC CCC1246200 Ser Pro His Cys Pro Leu Val Val Gln Ala Arg Leu Cys Val Tyr Pro 2151247 TAC TAC ACA GCC ACC TGT TGC CGC TCT TGC GCA CAT GTC CTG GAG CGG1294216 Tyr Tyr Thr Ala Thr Cys Cys Arg Ser Cys Ala His Val Leu Glu Arg 2311295 TCT CCC CAG GAT CCC TCC TGA AAG GGG TCC GGG GCA CCT TCA CGG TTT1342232 Ser Pro Gln Asp Pro Ser *** 2381343 TCT GTG CCA CCA TCG GTC ACC CAT TGA TCG GCC CAC TCT GAA CCC CCT13901391 GGC TCT CCA GCC TGT CCC AGT CTC AGC AGG GAT GTC CTC CAG GTG ACA14381439 GAG GGT GGC AAG GTG ACT GAC ACA AAG TGA CTT TCA GGG CTG TGG TCA14861487 GGC CCA TGT GGT GGT GTG ATG GGT GTG TGC ACA TAT GCC TCA GGT GTG15341535 CTT TTG GGA CTG CAT GGA TAT GTG TGT GCT CAA ACG TGT ATC ACT TTT15821583 CAA AAA GAG GTT ACA CAG ACT GAG AAG GAC AAG ACC TGT TTC CTT GAG16301631 ACT TTC CTA GGT GGA AAG GAA AGC AAG TCT GCA GTT CCT TGC TAA TCT16781679 GAG CTA CTT AGA GTG TGG TCT CCC CAC CAA CTC CAG TTT TGT GCC CTA17261727 AGC CTC ATT TCT CAT GTT CAG ACC TCA CAT CTT TTA AGC CGC CCT GTG17741775 TCT CTG ACC CCT TCT CAT TTG CCT AGT ATC TCT GCC CCT GCC TCC CTA18221823 ATT AGC TAG GGC TGG GGT CAG CCA CTG CCA ATC CTG CCT TAC TCA GGA18701871 AGG CAG GAG GAA AGA GAC TGC CTC TCC AGA GCA AGG CCC AGC TGG GCA19181919 GAG GGT GAA AAA GAG AAA TGT GAG CAT CCG CTC CCC CAC CAC CCC GCC19661967 CAG CCC CTA GCC CCA CTC CCT GCC TCC TGA AAT GGT TCC CAC CCA GAA20142015 CTA ATT TAT TTT TTA TTA AAG ATG GTC ATG ACA AAT GAA AAA AAA AAA20622063 AAA AAA AA 2070DBlastp結(jié)果Query=PPl396[基因=PP1396](237個氨基酸)>SP_IN:Q19791 Q19791 caenorhabditis elegans.f25h8.3 protein.5/1999長度=2165分值=98.7 bits(242),預計值=4e-20相同性=58/205(28%),相似性=86/205(41%),缺口=31/205(15%)Query:8 WFHSDWSSKCSAECGTGIQR-RSVVCLGSXXXXXXXXXXXXXXXXQSCPTGSRPPDMRAC 66W ++W +C A CGT +Q +R+V C++ CRP R CSbjct:1426 WKMAEWE-ECPATCGTHVQQSRNVTCVSAEDGGRTILKDV------DCDVQKRPTSARNC 1478Query:67SLGPC----ERTWRWYTGPWGECSSECGSGTQRRDIICVSKLGTEFNVTSPSNCSHLPRP 122L PCEW G W +CS +CG G +RR + C S S+C+PSbjct:1479 RLEPCPKGEEHIGSWIIGDWSKCSASCGGGWRRRSVSCTS------------SSCDETRKP 1527Query:123 PALQPCQGQAC-----QDRWFSTPWSPCSRSCQGGTQTREVQC---LSTNQTLSTRCPPQL 175C + C +W +PW+ CS SC GG Q R++ C LS + C ++Sbjct:1528 KMFDKCNEELCPPLTNNSWQISPWTHCSVSCGGGVQRRKIWCEDVLSGRKQDDIEC--SEI 1586Query:176 RPSRKRPCNSQPCSQRPDDQCKDSS 200+P +R C PC+++SSbjct:1587 KPREQRDCEMPPCRSHYHNKTSSAS 1611分值=93.6 bits(229),預計值=2e-18相同性=67/219(30%),相似性=88/219(39%),缺口=37/219(16%)Query:4CTTAWFHSDWSSKCSAECGTGIQRRSVVCLGSXXXXXXXXXXXXXXXXQSCPTGSRPPDM 63C+T W D SS CSA+CG+G +R+ V C+C S+P D+Sbjct:958 CSTRWITEDVSS-CSAKCGSGQKRQRVSCV------KMEGDRQTPASEHLCDRNSKPSDI 1010Query:64 RACSLGPCERTWRWYTGPWGECSSECGS-GTQRRDIICV-----------SKLGTEFNVTSP 113+C+ R W + G W CS CGS G R CV S G E+Sbjct:1011 ASCYIDCSGRKWNY--GEWTSCSETCGSNGKMHRKSYCVDDSNRRVDESLCGREQKEATE 1068Query:114 SNCSHLPRPPALQPCQGQACQDRWFSTPWSPCSRSCQGGTQTREVQCL--STNQTLSTRC 171C+ +P P RWWS CSRSC GG + R QCL + +T ++RCSbjct:1069 RECNRIPCP-------------RWVYGHWSECSRSCDGGVKMRHAQCLDAADRETHTSRC 1115Query:172 PPQLRPSRKRPCNSQPCSQRPDDQCKDSSPHCPLVVQAR 210P + CN C+D S C VQ RSbjct:1116 GP---AQTQEHCNEHACTWWQFGVWSDCSAKCGDGVQYR 115115.PP1563A核苷酸序列(SEQ ID NO:43)長度1664bp1 TCGAGTTTTT TTTTTTTTTT TTTAATTAGA GCAGGTATGC TTTTGATGGT51 AGGGAAGGGA TGGAAAAAAG GAAAAGCAAT AGAAACTGTC CAATTCACAT101 CAGTTATCCG TCTGCTTTTT CTTGAGAGCT TGTGGAAGGT GTTAACGTGG151 CTGGGAACAT CAACACCTTG GCATGCATGA ATGTTAAGTC AGGAAGGCCA201 GCGATCACCT TGATAGCTTC TTCACTTAGG TGCTCTTCTC TTTTCGGTTT251 CCTACTGGTA GATGTGCTTG TCTTCTCTAC TGTAGACATG AGTCTTGCAA301 ATGCATCAGT CACTTTGAGG CTTGAGGTGG AGATTTCCAG CTTAGAAGTT351 GTTAACTCAT ACAACTCCGG ATCCACACCT GGGATTGTGG TGCTGCTGCT401 AGAGCTACTG TCATCCACGG GCCCAAAGAA ATCAAGGTTC AGAAGAGTGG451 AACCTCCACT AGCATCTAAA GGGTTAGTAA GGCCACTGCT ACTCCAGTCA501 AACTGGACGG GTGGTAGAGA CTCCTGGAAC TGATCAGATG TACATGTGTT551 CATATCTGGT GACATGGTGG CTGTCTGACC GATGGAAGCT ATTTTTTCTG601 CAGCAGAAAG TGGTTTCAGT GGTTCCTTGG TGGGCTCTAA CATACCCAAT651 CCTGCTGCAT ACATGGGCAC TATAACAGGC TGCTTCTTAT TGCCCGTGAA701 GAGAATGTTT CGGGTGTCTA TTCCCAAGGA GGACAAAAGC TTCTTGTTGC751 TATGGGAGCC GCCCCACTGG TATCTCAAGC CATGTGCATC ATGGATATCC801 TGTAGCTCAG TCCACACATC TAGCAATTCC CCACTTTCAG GTAAGGCCTC851 TCTCGTTTTT ATTGGCAAAG TGCTTGTTTC CAGCAAGTGC TTCAGGGAAG901 TAACTTCCTC TTCAGCATCA GGGACAAGTA TGGAAGGAAA ACATGCTTCG951 AAAATTCGCT CCAGGCGGTT TAATAAAGCT GTCTGAACTG AAGTGGCTGA1001 TTCCTGTAAA TGGCCACTAG CAACTGCTCC TTTGGAAGTT GCTGA4GGTA1051 CACTGTGCGT TTTGGGGGTT CCTGGAGTAT CAATATTTTC ATCTGTCCTA1101 TGTGACTGCC AGGCTTCCTT TCGATGATGA GATTCAGTAG CCTGCTGGTC1151 TCCAAAAGCA GCCCAAGAAC AACTATCTTT TTGTTCATCC TCAAAAGCAT1201 TCCAATCTAC AACTTGGCTA GGACCAGCTG AACTGAAGTC TGCAAAATCA1251 TCAGAGTCTT GAAAACCATT GCAGTCATCC TGAATATTTG GCACAGAATC1301 AAAATGTCCA ATCTCACCTT CTTGCCCATT TTTAAGTTTT GCAACAGGTT1351 CAGTGCCTGT TCCACTAGAT TTTCTTGCCA ATTGACATTC TTCTGATAAA1401 TTATCAGAAG TCTGTTTTAG GTCTGACTTT GTTAATATTG TCTCCTCTTG1451 GCAAGAAACA GCATTTATAT CCCCAAATTC TCCAAAGTCA TCACCTGGTT1501 CACTAAAATG TGGAAAGTGC TCTGAAGACT CTTCAAAAGT GGCATCACTC1551 ATTGAATCTT GAGTACCAGT AACAAAAGGT GGAGTTGAGC CACTGGCAGA1601 GCCAAAGTCA CCAAAATCCC CCAAAAAAAA AAAAAAAAAA AAAAAAAAAA1651 AAAAAAAAAA AAAAB氨基酸序列(SEQ ID NO:44)長度134個氨基酸1 MEAIFSAAES GFSGSLVGSN IPNPAAYMGT ITGCFLLPVK RMFRVSIPKE51 DKSFLLLWEP PHWYLKPCAS WISCSSVHTS SNSPLSGKAS LVFIGKVLVS101 SKCFREVTSS SASGTSMEGK HASKIRSRRF NKAVC核苷酸及氨基酸組合序列(SEQ ID NO:45)克隆號和蛋白名稱PP1563起始編碼子582 ATG終止編碼子986 TGA蛋白質(zhì)分子量14492.981 TC GAG TTT TTT TTT TTT TTT TTT AAT TAG AGC AGG TAT GCT TTT GAT 4748 GGT AGG GAA GGG ATG GAA AAA AGG AAA AGC AAT AGA AAC TGT CCA ATT 9596 CAC ATC AGT TAT CCG TCT GCT TTT TCT TGA GAG CTT GTG GAA GGT GTT 143144 AAC GTG GCT GGG AAC ATC AAC ACC TTG GCA TGC ATG AAT GTT AAG TCA 191192 GGA AGG CCA GCG ATC ACC TTG ATA GCT TCT TCA CTT AGG TGC TCT TCT 239240 CTT TTC GGT TTC CTA CTG GTA GAT GTG CTT GTC TTC TCT ACT GTA GAC 287288 ATG AGT CTT GCA AAT GCA TCA GTC ACT TTG AGG CTT GAG GTG GAG ATT 335336 TCC AGC TTA GAA GTT GTT AAC TCA TAC AAC TCC GGA TCC ACA CCT GGG 383384 ATT GTG GTG CTG CTG CTA GAG CTA CTG TCA TCC ACG GGC CCA AAG AAA 431432 TCA AGG TTC AGA AGA GTG GAA CCT CCA CTA GCA TCT AAA GGG TTA GTA 479480 AGG CCA CTG CTA CTC CAG TCA AAC TGG ACG GGT GGT AGA GAC TCC TGG 527528 AAC TGA TCA GAT GTA CAT GTG TTC ATA TCT GGT GAC ATG GTG GCT GTC 575576 TGA CCG ATG GAA GCT ATT TTT TCT GCA GCA GAA AGT GGT TTC AGT GGT 6231 Met Glu Ala Ile Phe Ser Ala Ala Glu Ser Gly Phe Ser Gly 14624 TCC TTG GTG GGC TCT AAC ATA CCC AAT CCT GCT GCA TAC ATG GGC ACT 67115 Ser Leu Val Gly Ser Asn Ile Pro Asn Pro Ala Ala Tyr Met Gly Thr 30672 ATA ACA GGC TGC TTC TTA TTG CCC GTG AAG AGA ATG TTT CGG GTG TCT 71931 Ile Thr Gly Cys Phe Leu Leu Pro Val Lys Arg Met Phe Arg Val Ser 46720 ATT CCC AAG GAG GAC AAA AGC TTC TTG TTG CTA TGG GAG CCG CCC CAC 76747 Ile Pro Lys Glu Asp Lys Ser Phe Leu Leu Leu Trp Glu Pro Pro His 62768 TGG TAT CTC AAG CCA TGT GCA TCA TGG ATA TCC TGT AGC TCA GTC CAC 81563 Trp Tyr Leu Lys Pro Cys Ala Ser Trp Ile Ser Cys Ser Ser Val His 78816 ACA TCT AGC AAT TCC CCA CTT TCA GGT AAG GCC TCT CTC GTT TTT ATT 86379 Thr Ser Ser Asn Ser Pro Leu Ser Gly Lys Ala Ser Leu Val Phe Ile 94864 GGC AAA GTG CTT GTT TCC AGC AAG TGC TTC AGG GAA GTA ACT TCC TCT 91195 Gly Lys Val Leu Val Ser Ser Lys Cys Phe Arg Glu Val Thr Ser Ser 110912 TCA GCA TCA GGG ACA AGT ATG GAA GGA AAA CAT GCT TCG AAA ATT CGC 959111 Ser Ala Ser Gly Thr Ser Met Glu Gly Lys His Ala Ser Lys Ile Arg 126960 TCC AGG CGG TTT AAT AAA GCT GTC TGA ACT GAA GTG GCT GAT TCC TGT1007127 Ser Arg Arg Phe Asn Lys Ala Val *** 1351008 AAA TGG CCA CTA GCA ACT GCT CCT TTG GAA GTT GCT GAA GGT ACA CTG10551056 TGC GTT TTG GGG GTT CCT GGA GTA TCA ATA TTT TCA TCT GTC CTA TGT11031104 GAC TGC CAG GCT TCC TTT CGA TGA TGA GAT TCA GTA GCC TGC TGG TCT11511152 CCA AAA GCA GCC CAA GAA CAA CTA TCT TTT TGT TCA TCC TCA AAA GCA11991200 TTC CAA TCT ACA ACT TGG CTA GGA CCA GCT GAA CTG AAG TCT GCA AAA12471248 TCA TCA GAG TCT TGA AAA CCA TTG CAG TCA TCC TGA ATA TTT GGC ACA12951296 GAA TCA AAA TGT CCA ATC TCA CCT TCT TGC CCA TTT TTA AGT TTT GCA13431344 ACA GGT TCA GTG CCT GTT CCA CTA GAT TTT CTT GCC AAT TGA CAT TCT13911392 TCT GAT AAA TTA TCA GAA GTC TGT TTT AGG TCT GAC TTT GTT AAT ATT14391440 GTC TCC TCT TGG CAA GAA ACA GCA TTT ATA TCC CCA AAT TCT CCA AAG14871488 TCA TCA CCT GGT TCA CTA AAA TGT GGA AAG TGC TCT GAA GAC TCT TCA15351536 AAA GTG GCA TCA CTC ATT GAA TCT TGA GTA CCA GTA ACA AAA GGT GGA15831584 GTT GAG CCA CTG GCA GAG CCA AAG TCA CCA AAA TCC CCC AAA AAA AAA16311632 AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA1664DBlastp結(jié)果Query=PP1563[基因=PP1563](134個氨基酸)>SP_FUN:Q05164 Q05164 saccharomyces cerevisiae(baker's yeast).
aob567,aof1001,aoe110,aoe264 and aoe130 genes.8/1998長度=1001分值=31.3 bits(69),預計值=3.6相同性=22/61(36%),相似性=31/61(50%),缺口=1/61(1%)Query:65 LKPCASWISCSSVHTSSNSPLSGKASLVFIGKVLVSSKCFREVTSSSASGTSMEGKHASK 124+ P SSS TSS S +SG +S+ G +SSE + SSASG+S + SSbjct:81 IAPSTSSSEVSSSITSSGSSVSGSSSITSSGSSVSSSSSATE-SGSSASGSSSATESGSS 139Query:125 I 125+Sbjct:140 V 140>SP_FUN:Q08294 Q08294 saccharomyces cerevisiae(baker's yeast).
chromosome xv reading frame orf yol155c.11/1996長度=967分值=31.3 bits(69),預計值=3.6相同性=22/61(36%),相似性=31/61(50%),缺口=1/61(1%)Query:65 LKPCASWISCSSVHTSSNSPLSGKASLVFIGKVLVSSKCFREVTSSSASGTSMEGKHASK 124+ P SSS TSS S +SG +S+ G + SSE+SSASG+S + SSbjct:81 IAPSTSSSEVSSSITSSGSSVSGSSSITSSGSSVSSSSSATE-SGSSASGSSSATESGSS 139Query:125 I 125+Sbjct:140 V 140>SW:YA06_CAEEL Q20762 caenorhabditis elegans.hypothetical 167.7 kdprotein f54d1.6 in chromosome ⅳ.7/1998長度=1462分值=30.1 bits(66),預計值=8.0相同性=14/56(25%),相似性=29/56(51%),缺口=3/56(5%)Query:63 WYLKPCASWISCSSVHTSSNSPLSGKASLVFIGKVLVSSKC---FREVTSSSASGT 115WY + A W T+S+ P + ++ IG+ + +C FR++T +++ G+Sbjct:673 WYDEDGAQWNFIRDTETNSSCPCIERQAIADIGRFMPHPRCSQAFRDITCTTSIGS 72816.PP1746A核苷酸序列(SEQ ID NO:46)長度1977bp1 GTCCAATGCC CCCCACATCC CTGTGCACCT GGGTGCCATG CAGGAGACGG51 TGCAGTTCCA GATTCAGCAC CTGGGGGCCG ATCTCCACCC TGGCGACGTG101 CTACTGAGCA ACCATCCCAG TGCCGGGGGC AGCCACCTGC CAGACCTGAC151 TGTTATCACA CCGGTGAGGG GTGCTGCCCG CCTGCCTCTG CTGGGGCAGT201 GGTGGCCGAT GCAGCTGACC GTGGCTCTCC ACCCGCTAGG TGTTTTGGCC251 GGGTCAGACG CGGCCTGTGT TCTATGTGGC CAGCCGAGGG CACCACGCAG301 ACATCGGGGG CATCACACCA GGCTCCATGC CCCCCCACTC CACCATGCTG351 CAACAGGAGG GTGCCGTCTT TCTGTCCTTC AAACTTGTCC AGGGGGGCGT401 CTTCCAGGAG GAGGCGGTGA CGGAGGCCCT GCGGGCGCCA GGCAAGGTCC451 CCAACTGCAG CGGAACCAGA AACCTGCACG ACACCTGGAA GATAAACTGA501 AATGCACCAA AGAGGAGCAC CTCTGTACAC AAAGGATGCT GGACCAGACC551 CTGCTTGACC TGAATGAGAT GTAGAACGCC CCAGTCCCAC CCTGCTGCTG501 CTCCTCCCTC TGACCCAGAC TCCGCCTGAG GCCAGCCTGC GGGAAGCTGA651 CCTTTAATTG AGGGCTGATC TTTAACTGGA AGGCTGCTTT CTCCTTTCAC701 CACCCCCTCC TTCCCTGTGT CTTTTTCGCC AAACTGTCTC TGCCTCTTCC751 CGGAGAATCC AGCTGGGCTA GAGGCTGAGC ACCTTTGGAA ACAACATTTA801 AGGGAATGTG AGCACAATGC ATAATGTCTT TAAAAAGCAT GTTGTGATGT851 ACACATTTTG TAATTACCTT TTTTGTTGTT TTGTAGCAAC CATTTGTAAA901 ACATTCCAAA TGGTTGCTCC AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA951 AAAAAAAAAA AAAAAAAACT CGAGGGGGGG CCCGGTACCA GGTAATCTAC1001 TGCCTGCGCT GTCTGGTGGG CCGCGACATC CCACTCAACC AGGGCTGCCT1051 GGCGCCAGTG CGCGTGGTCA TTCCCCGAGG CTCCATCCTG GACCCGTCGC1101 CCGAGGCGGC GGTGGTGGGC GGCAACGTGC TCACGTCGCA GCGCGTGGTG1151 GATGTCATCC TGGGGGCCTT TGGGGCCTGC GCCGCCTCCC AGGTGCGGGG1201 GCGGGGTGGG CGCAGCTCGG GGGCGGACTG GGTGGGCAGG CTGGAGTAGG1251 AGCGGGAGGG CGAGGTGGGG ACGCCCTGCC CCAGCCCAGC GCAGCGACCA1301 GGTGCCCTCA CCAGGGCTGC ATGAACAACG TGACCCTGGG CAACGCCCAC1351 ATGGGCTACT ACGAGACGGT GGCGGGCGGC GCGGGCGCGG GTCCCAGCTG1401 GCACGGGCGC AGCGGTGTGC ACAGCCACAT GACCAACACA CGCATCACCG1451 ACCCTGAGAT CCTGGAGAGC CGGTACCCGG TCATCCTGCG CCGCTTCGAG1501 CTGCGGCGGG GCTCGGGGGG CAGAGGCCGC TFCCGAGGCG GCGACGGCGT1551 CACCCGCGAG CTGCTCTTTC GTGAGGAGGC GCTGCTGTCA GTGCTGACCG1601 AGCGCCGCGC CTTCCGGCCA TACGGGCTCC ACGGGGGCGA GCCTGGCGCC1851 CGCGGCCTAA ACCTGCTGAT CCGCAAAAAC GGCCGGACGG TGAATCTGGG1701 CGGCAAGACG TCGGTGACCG TGTACCCCGG GGATGTGTTC TGTCTCCACA1751 CGCCCGGCGG CGGTGGCTAT GGGGACCCGG AGGACCCCGC CCCACCGCCG1801 GGGTCGCCCC CGCAAGCACT GGCCTTTCCC GAGCACGGCA GCGTCTATGA1851 GTATCGCCGG GCCCAGAAGG CCGTGTGAGG ATCCCGCAAT AAAAATGCCT1901 TAAGTCTCCC GGTTCTGGGG ACGCAGCTAC GGCGCCTTAA AAAAAAAAAA1951 AAAAAAAAAA AAAAAAAAAA AAAAAAAB氨基酸序列(SEQ ID NO:47)長度353個氨基酸1 MHNVFKKHVV MYTFCNYLFC CFVATICKTF QMVAPKKKKK KKKKKKKKKK51 NSRGGPVPGN LLPALSGGPR HPTQPGLPGA SARGHSPRLH PGPVARGGGG101 GRQRAHVAAR GGCHPGGLWG LRRLPGAGAG WAQLGGGLGG QAGVGAGGRG151 GDALPQPSAA TRCPHQGCMN NVTLGNAHMG YYETVAGGAG AGPSWHGRSG201 VHSHMTNTRI TDPEILESRY PVILRRFELR RGSGGRGRFR GGDGVTRELL251 FREEALLSVL TERRAFRPYG LHGGEPGARG LNLLIRKNGR TVNLGGKTSV301 TVYPGDVFCL HTPGGGGYGD PEDPAPPPGS PPQALAFPEH GSVYEYRRAQ351 KAVC核苷酸及氨基酸組合序列(SEQ ID NO:48)克隆號和蛋白名稱PP1746起始編碼子817 ATG終止編碼子1878 TGA蛋白質(zhì)分子量37265.901 GTC CAA TGC CCC CCA CAT CCC TGT GCA CCT GGG TGC CAT GCA GGA GAC 4849 GGT GCA GTT CCA GAT TCA GCA CCT GGG GGC CGA TCT CCA CCC TGG CGA 9697 CGT GCT ACT GAG CAA CCA TCC CAG TGC CGG GGG CAG CCA CCT GCC AGA 144145 CCT GAC TGT TAT CAC ACC GGT GAG GGG TGC TGC CCG CCT GCC TCT GCT 192193 GGG GCA GTG GTG GCC GAT GCA GCT GAC CGT GGC TCT CCA CCC GCT AGG 240241 TGT TTT GGC CGG GTC AGA CGC GGC CTG TGT TCT ATG TGG CCA GCC GAG 288289 GGC ACC ACG CAG ACA TCG GGG GCA TCA CAC CAG GCT CCA TGC CCC CCC 336337 ACT CCA CCA TGC TGC AAC AGG AGG GTG CCG TCT TTC TGT CCT TCA AAC 384385 TTG TCC AGG GGG GCG TCT TCC AGG AGG AGG CGG TGA CGG AGG CCC TGC 432433 GGG CGC CAG GCA AGG TCC CCA ACT GCA GCG GAA CCA GAA ACC TGC ACG 480481 ACA CCT GGA AGA TAA ACT GAA ATG CAC CAA AGA GGA GCA CCT CTG TAC 528529 ACA AAG GAT GCT GGA CCA GAC CCT GCT TGA CCT GAA TGA GAT GTA GAA 576577 CGC CCC AGT CCC ACC CTG CTG CTG CTC CFC CCT CTG ACC CAG ACT CCG 624625 CCT GAG GCC AGC CTG CGG GAA GCT GAC CTT TAA TTG AGG GCT GAT CTT 672673 TAA CTG GAA GGC TGC TTT CTC CTT TCA CCA CCC CCT CCT TCC CTG TGT 720721 CTT TTT CGC CAA ACT GTC TCT GCC TCT TCC CGG AGA ATC CAG CTG GGC 768769 TAG AGG CTG AGC ACC TTT GGA AAC AAC ATT TAA GGG AAT GTG AGC ACA 816817 ATG CAT AAT GTC TTT AAA AAG CAT GTT GTG ATG TAC ACA TTT TGT AAT 8641 Met His Asn Val Phe Lys Lys His Val Val Met Tyr Thr Phe Cys Asn 16865 TAC CTT TTT TGT TGT TTT GTA GCA ACC ATT TGT AAA ACA TTC CAA ATG 91217 Tyr Leu Phe Cys Cys Phe Val Ala Thr Ile Cys Lys Thr Phe Gln Met 32913 GTT GCT CCA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA 96033 Val Ala Pro Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys 48961 AAA AAA AAC TCG AGG GGG GGC CCG GTA CCA GGT AAT CTA CTG CCT GCG100849 Lys Lys Asn Ser Arg Gly Gly Pro Val Pro Gly Asn Leu Leu Pro Ala 641009 CTG TCT GGT GGG CCG CGA CAT CCC ACT CAA CCA GGG CTG CCT GGC GCC105665 Leu Ser Gly Gly Pro Arg His Pro Thr Gln Pro Gly Leu Pro Gly Ala 801057 AGT GCG CGT GGT CAT TCC CCG AGG CTC CAT CCT GGA CCC GTC GCC CGA110481 Ser Ala Arg Gly His Ser Pro Arg Leu His Pro Gly Pro Val Ala Arg 961105 GGC GGC GGT GGT GGG CGG CAA CGT GCT CAC GTC GCA GCG CGT GGT GGA115297 Gly Gly Gly Gly Gly Arg Gln Arg Ala His Val Ala Ala Arg Gly Gly 1121153 TGT CAT CCT GGG GGC CTT TGG GGC CTG CGC CGC CTC CCA GGT GCG GGG1200113 Cys His Pro Gly Gly Leu Trp Gly Leu Arg Arg Leu Pro Gly Ala Gly 1281201 GCG GGG TGG GCG CAG CTC GGG GGC GGA CTG GGT GGG CAG GCT GGA GTA1248129 Ala Gly Trp Ala Gln Leu Gly Gly Gly Leu Gly Gly Gln Ala Gly Val 1441249 GGA GCG GGA GGG CGA GGT GGG GAC GCC CTG CCC CAG CCC AGC GCA GCG1296145 Gly Ala Gly Gly Arg Gly Gly Asp Ala Leu Pro Gln Pro Ser Ala Ala 1601297 ACC AGG TGC CCT CAC CAG GGC TGC ATG AAC AAC GTG ACC CTG GGC AAC1344161 Thr Arg Cys Pro His Gln Gly Cys Met Asn Asn Val Thr Leu Gly Asn 1761345 GCC CAC ATG GGC TAC TAC GAG ACG GTG GCG GGC GGC GCG GGC GCG GGT1392177 Ala His Met Gly Tyr Tyr Glu Thr Val Ala Gly Gly Ala Gly Ala Gly 1921393 CCC AGC TGG CAC GGG CGC AGC GGT GTG CAC AGC CAC ATG ACC AAC ACA1440193 Pro Ser Trp His Gly Arg Ser Gly Val His Ser His Met Thr Asn Thr 2081441 CGC ATC ACC GAC CCT GAG ATC CTG GAG AGC CGG TAC CCG GTC ATC CTG1488209 Arg Ile Thr Asp Pro Glu Ile Leu Glu Ser Arg Tyr Pro Val Ile Leu 2241489 CGC CGC TTC GAG CTG CGG CGG GGC TCG GGG GGC AGA GGC CGC TTC CGA1536225 Arg Arg Phe Glu Leu Arg Arg Gly Ser Gly Gly Arg Gly Arg Phe Arg 2401537 GGC GGC GAC GGC GTC ACC CGC GAG CTG CTC TTT CGT GAG GAG GCG CTG1584241 Gly Gly Asp Gly Val Thr Arg Glu Leu Leu Phe Arg Glu Glu Ala Leu 2561585 CTG TCA GTG CTG ACC GAG CGC CGC GCC TTC CGG CCA TAC GGG CTC CAC1632257 Leu Ser Val Leu Thr Glu Arg Arg Ala Phe Arg Pro Tyr Gly Leu His 2721633 GGG GGC GAG CCT GGC GCC CGC GGC CTA AAC CTG CTG ATC CGC AAA AAC1680273 Gly Gly Glu Pro Gly Ala Arg Gly Leu Asn Leu Leu Ile Arg Lys Asn 2881681 GGC CGG ACG GTG AAT CTG GGC GGC AAG ACG TCG GTG ACC GTG TAC CCC1728289 Gly Arg Thr Val Asn Leu Gly Gly Lys Thr Ser Val Thr Val Tyr Pro 3041729 GGG GAT GTG TTC TGT CTC CAC ACG CCC GGC GGC GGT GGC TAT GGG GAC1776305 Gly Asp Val Phe Cys Leu His Thr Pro Gly Gly Gly Gly Tyr Gly Asp 3201777 CCG GAG GAC CCC GCC CCA CCG CCG GGG TCG CCC CCG CAA GCA CTG GCC1824321 Pro Glu Asp Pro Ala Pro Pro Pro Gly Ser Pro Pro Gln Ala Leu Ala 3361825 TTT CCC GAG CAC GGC AGC GTC TAT GAG TAT CGC CGG GCC CAG AAG GCC1872337 Phe Pro Glu His Gly Ser Val Tyr Glu Tyr Arg Arg Ala Gln Lys Ala 3521873 GTG TGA GGA TCC CGC AAT AAA AAT GCC TTA AGT CTC CCG GTT CTG GGG1920353 Val *** 3541921 ACG CAG CTA CGG CGC CTT AAA AAA AAA AAA AAA AAAAAA AAA AAA AAA 19681969 AAA AAA AAA1977DBlastp結(jié)果Query=PP1746[基因=PP1746](353個氨基酸)>SW:YAOE_SCHPO Q10093 schizosaccharomyces pombe(fission yeast).
hypothetical 138.8 kd protein clld3.14c in chromosome ⅰ.
12/1998長度=1260分值=134 bits(335),預計值=9e-31相同性=75/157(47%),相似性=92/157(57%),缺口=10/157(6%)Query:166 QGCMNNWTLG----NAHMGY--YETVAGGAGAGPSWHGRSGVHSHMTNTRITDPEILESR 219QGCMNN+T GN G +YET+AGGAGAGP+W+G SGVH+HMTNTRITDPE++E RSbjct:1093 QGCMNNLTFGYDGENGEEGFAMYETIAGGAGAGPTWNGTSGVHTHMTNTRITDPEVVERR 1152Query:220 YPVILXXXXXXXXXXXXXXXXXXXXVTRELLFREEALLSVLTERRAFRPYGLHGGEPGAR 279PVILV R FR S+L+ERR+ PYG++GGE GASbjct:1153 APVILRRFCLRENSGGKGEYHGGDGVIRHFEFRRSMHCSILSERRSRAPYGMNGGEDGAM 1212Query:280 GLNLLIRKNG--------RTVNLGGKTSVTVYPGDVFCLHT 312G+N I+ R VNLGGK V + GD + TSbjct:1213 GVNTWIDCSNPDFPRYVNLGGKNHVLMGKGDHIVIET 1249在本發(fā)明提及的所有文獻都在本申請中引用作為參考,就如同每一篇文獻被單獨引用作為參考那樣。此外應理解,在閱讀了本發(fā)明的上述講授內(nèi)容之后,本領域技術人員可以對本發(fā)明作各種改動或修改,這些等價形式同樣落于本申請所附權利要求書所限定的范圍。
權利要求
1.一種分離的具有抑癌功能的人蛋白,其特征在于,它包含具有選自下組的氨基酸序列的多肽SEQ ID NO:2、SEQ ID NO:5、SEQ ID NO:8、SEQ ID NO:11、SEQ ID NO:14、SEQID NO:17、SEQ ID NO:20、SEQ ID NO:23、SEQ ID NO:26、SEQ ID NO:29、SEQID NO:32、SEQID NO:35、SEQ ID NO:38、SEQ ID NO:41、SEQ ID NO:44、SEQ ID NO:47;或其保守性變異多肽、或其活性片段、或其活性衍生物。
2.如權利要求1所述的多肽,其特征在于,該多肽是具有選自下組的氨基酸序列的多肽SEQ ID NO:2、SEQ ID NO:5、SEQ ID NO:8、SEQ ID NO:11、SEQIDNO:14、SEQ ID NO:17、SEQ ID NO:20、SEQ ID NO:23、SEQ ID NO:26、SEQ ID NO:29、SEQID NO:32、SEQ ID NO:35、SEQ ID NO:38、SEQ ID NO:41、SEQ ID NO:44、SEQ ID NO:47。
3.一種分離的多核苷酸,其特征在于,它包含一核苷酸序列,該核苷酸序列與選自下組的一種核苷酸序列有至少85%相同性(a)編碼如權利要求1和2所述多肽的多核苷酸;(b)與多核苷酸(a)互補的多核苷酸。
4.如權利要求3所述的多核苷酸,其特征在于,該多核苷酸編碼的多肽具有選自下組的氨基酸序列SEQ ID NO:2、SEQ ID NO:5、SEQ ID NO:8、SEQ ID NO:11、SEQID NO:14、SEQ ID NO:17、SEQ ID NO:20、SEQ ID NO:23、SEQ ID NO:26、SEQ ID NO:29、SEQID NO:32、SEQ ID NO:35、SEQ ID NO:38、SEQ ID NO:4 1、SEQ ID NO:44、SEQ ID NO:47。
5.如權利要求3所述的多核苷酸,其特征在于,該多核苷酸的序列選自下組SEQ ID NO:3、SEQ ID NO:6、SEQ ID NO:9、SEQ ID NO:12、SEQID NO:15、SEQ ID NO:18、SEQ ID NO:21、SEQ ID NO:24、SEQ ID NO:27、SEQ ID NO:30、SEQIDNO:33、SEQ ID NO:36、SEQ ID NO:39、SEQ ID NO:42、SEQ ID NO:45、SEQ ID NO:48的編碼區(qū)序列或全長序列。
6.一種載體,其特征在于,它含有權利要求3所述的多核苷酸。
7.一種遺傳工程化的宿主細胞,其特征在于,它是選自下組的一種宿主細胞(a)用權利要求6所述的載體轉(zhuǎn)化或轉(zhuǎn)導的宿主細胞;(b)用權利要求3所述的多核苷酸轉(zhuǎn)化或轉(zhuǎn)導的宿主細胞。
8.一種具有具有抑癌功能的人蛋白活性的多肽的制備方法,其特征在于,該方法包含(a)在適合表達具有抑癌功能的人蛋白的條件下,培養(yǎng)權利要求7所述的宿主細胞;(b)從培養(yǎng)物中分離出具有具有抑癌功能的人蛋白活性的多肽。
9.一種能與權利要求1所述的具有抑癌功能的人蛋白特異性結(jié)合的抗體。
10.一種核酸分子,它含有權利要求3所述的多核苷酸中連續(xù)的10-800個核苷酸。
11.一種藥物組合物,其特征在于,它含有安全有效量的權利要求1所述的多肽以及藥學上可接受的載體。
全文摘要
本發(fā)明公開了一類新的具有抑癌功能的人蛋白,編碼此多肽的多核苷酸和經(jīng)重組技術產(chǎn)生該多肽的方法。本發(fā)明還公開了此多肽用于治療多種疾病如癌癥等的方法。本發(fā)明還公開了抗此多肽的拮抗劑及其治療作用。本發(fā)明還公開了編碼這類新的具有抑癌功能的人蛋白的多核苷酸的用途。
文檔編號C07K14/435GK1313318SQ0011199
公開日2001年9月19日 申請日期2000年3月14日 優(yōu)先權日2000年3月14日
發(fā)明者顧健人, 楊勝利 申請人:上海市腫瘤研究所
網(wǎng)友詢問留言 已有0條留言
  • 還沒有人留言評論。精彩留言會獲得點贊!
1